ViPR: Visual-Odometry-aided Pose Regression for 6DoF Camera Localization
Abstract
Visual Odometry (VO) accumulates a positional drift in long-term robot navigation tasks. Although Convolutional Neural Networks (CNNs) improve VO in various aspects, VO still suffers from moving obstacles, discontinuous observation of features, and poor textures or visual information. While recent approaches estimate a 6DoF pose either directly from (a series of) images or by merging depth maps with optical flow (OF), research that combines absolute pose regression with OF is limited. We propose ViPR, a novel modular architecture for long-term 6DoF VO that leverages temporal information and synergies between absolute pose estimates (from PoseNet-like modules) and relative pose estimates (from FlowNet-based modules) by combining both through recurrent layers. Experiments on known datasets and on our own Industry dataset show that our modular design outperforms state of the art in long-term navigation tasks.
Más información
Título según WOS: | ID WOS:000788279000021 Not found in local WOS DB |
Título de la Revista: | 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW |
Editorial: | IEEE COMPUTER SOC |
Fecha de publicación: | 2020 |
Página de inicio: | 187 |
Página final: | 198 |
DOI: |
10.1109/CVPRW50498.2020.00029 |
Notas: | ISI |