상세 보기
- Kim, Kyoung Eun;
- Sim, Joo Yong
WEB OF SCIENCE
0SCOPUS
0초록
Recent advancements in cost-effective image-based localization using 2D maps have garnered significant attention, inspired by humans' ability to navigate with such maps. This study addresses the limitations of monocular vision-based systems, specifically inaccurate depth information and loss of geometric details, which hinder precise localization. We propose a novel neural network framework that incorporates a pretrained metric depth estimation model, such as Zoedepth, to accurately measure absolute distances and enhance map matching between 2D maps and images. Our approach introduces two key modules: an Explicit Depth Prior Fusion (EDPF) module, which constructs a depth score volume using depth maps, and an Implicit Depth Prior Fusion (IDPF) module, which integrates depth and semantic features early through positional encoding. These modules enable a single-layer-scale classifier to learn essential features for effective localization. Notably, the IDPF model with positional encoding showed over 10% performance improvement on the Mapillary dataset compared to the baseline, underscoring the advantages of combining semantic and geometric information. The proposed DP-Loc approach provides a cost-efficient solution for visual localization by leveraging publicly accessible 2D maps and monocular image inputs, making it applicable to autonomous driving, robotics, and augmented reality. © 2024, Institute of Electrical and Electronics Engineers Inc. All rights reserved.
키워드
- 제목
- DP-Loc: Visual Localization in 2D Maps using an Embedded Depth Prior
- 저자
- Kim, Kyoung Eun; Sim, Joo Yong
- 발행일
- 2024-12
- 유형
- Article
- 저널명
- IEEE Access
- 권
- 12
- 페이지
- 181570 ~ 181578