Skip to main navigation Skip to search Skip to main content

Enhancing Scene Coordinate Regression With Efficient Keypoint Detection and Sequential Information

  • Kuan Xu
  • , Zeyu Jiang
  • , Haozhi Cao
  • , Shenghai Yuan
  • , Chen Wang
  • , Lihua Xie
  • Nanyang Technological University

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Scene Coordinate Regression (SCR) is a visual localization technique that utilizes deep neural networks (DNN) to directly regress 2D-3D correspondences for camera pose estimation. However, current SCR methods often face challenges in handling repetitive textures and meaningless areas due to their reliance on implicit triangulation. In this letter, we propose an efficient and accurate SCR system. Compared to existing SCR methods, we propose a unified architecture for both scene encoding and salient keypoint detection, allowing our system to prioritize the encoding of informative regions. This design significantly improves computational efficiency. Additionally, we introduce a mechanism that utilizes sequential information during both mapping and relocalization. The proposed method enhances the implicit triangulation, especially in environments with repetitive textures. Comprehensive experiments conducted across indoor and outdoor datasets demonstrate that the proposed system outperforms state-of-the-art (SOTA) SCR methods. Our single-frame relocalization mode improves the recall rate of our baseline by 6.4% and increases the running speed from 56 Hz to 90 Hz. Furthermore, our sequence-based mode increases the recall rate by 11% while maintaining the original efficiency.

Original languageEnglish
Pages (from-to)9932-9939
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume10
Issue number10
DOIs
StatePublished - 2025

Keywords

  • Visual learning
  • localization
  • mapping

Fingerprint

Dive into the research topics of 'Enhancing Scene Coordinate Regression With Efficient Keypoint Detection and Sequential Information'. Together they form a unique fingerprint.

Cite this