博客
关于我
论文 :pix-loc
阅读量:776 次
发布时间:2019-03-24

本文共 3276 字,大约阅读时间需要 10 分钟。

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose

Camera pose estimation in known scenes can be improved by focusing on learning robust and invariant visual features while leaving geometric estimation to principled algorithms.

Our approach leverages direct alignment of multiscale deep features, framing camera localization as a metric learning problem while also enhancing sparse feature matching accuracy.

Inspired by direct image alignment [22, 26, 27, 63, 90, 91] and learned image representations for outlier rejection [42], we advocate that end-to-end visual localization algorithms should prioritize representation learning.

By not requiring pose regression itself, the network can extract suitable features, ensuring accurate and scene-agnostic performance.

PixLoc achieves localization by aligning query and reference images based on the known 3D structure of the scene.

Motivation: In absolute pose and scene coordinate regression from a single image, a deep neural network learns to:

i) Recognize the approximate location in a scene,

ii) Recognize robust visual features tailored to this scene, and

iii) Regress accurate geometric quantities like pose or coordinates.

Given CNNs' ability to learn generalizable features, i) and ii) do not need to be scene-specific, and i) is already addressed by image retrieval.

On the other hand, iii) can be effectively handled by classical geometry using feature matching [19, 20, 28] or image alignment [4, 26, 27, 51] combined with 3D representation.

Therefore, focusing on learning robust and generalizable features is key, enabling scene-agnostic and tightly-constrained pose estimation by geometry.

The challenge lies in defining effective features for localization. We solve this by making geometric estimation differentiable and only supervising the final pose estimate.

Section 3.1: Localization as Image Alignment

Image Representation: Sparse alignment is performed over learned feature representations, utilizing CNNs' ability to extract hierarchical features at multiple levels.

The features are L2-normalized along channels to enhance robustness and generalization across datasets.

This representation, inspired by past works on handcrafted and learned features for camera tracking [22, 52, 63, 85, 90, 93], is robust to significant illumination and viewpoint changes, providing meaningful gradients for successful alignments despite initial pose inaccuracies.

Direct Alignment: The geometric optimization aims to find the pose (R, t), aligning query and reference images based on scene structure.

Visual Priors: Combining pointwise uncertainties of query and reference images into per-residual weights allows the network to learn uncertainty, such as in domain shift scenarios, similar to aleatoric uncertainty [36].

This weighting captures multiple scenarios, enhancing pose accuracy across different conditions.

Experiments: The refinement improves performance on RobotCar Night, which faces motion blur and challenges in sparse keypoint detection, while showing no improvement on RobotCar Day or being detrimental on Aachen at 0.25m, potentially due to limited ground truth accuracy or camera intrinsics.

The difficulty of RobotCar Oxford dataset may also contribute to these results.

转载地址:http://jiokk.baihongyu.com/

你可能感兴趣的文章
Objective-C实现Euclidean GCD欧几里得最大公约数算法(附完整源码)
查看>>
Objective-C实现euclideanDistance欧氏距离算法(附完整源码)
查看>>
Objective-C实现euler method欧拉法算法(附完整源码)
查看>>
Objective-C实现eulerianPath欧拉路径算法(附完整源码)
查看>>
Objective-C实现eval函数功能(附完整源码)
查看>>
Objective-C实现Exceeding words超词(差距是ascii码的距离) 算法(附完整源码)
查看>>
Objective-C实现extended euclidean algorithm扩展欧几里得算法(附完整源码)
查看>>
Objective-C实现Factorial digit sum阶乘数字和算法(附完整源码)
查看>>
Objective-C实现factorial iterative阶乘迭代算法(附完整源码)
查看>>
Objective-C实现factorial recursive阶乘递归算法(附完整源码)
查看>>
Objective-C实现FigurateNumber垛积数算法(附完整源码)
查看>>
Objective-C实现Gale-Shapley盖尔-沙普利算法(附完整源码)
查看>>
Objective-C实现hamiltonianCycle哈密尔顿图算法(附完整源码)
查看>>
Objective-C实现hamming numbers汉明数算法(附完整源码)
查看>>
Objective-C实现hanning 窗(附完整源码)
查看>>
Objective-C实现hanoiTower汉诺塔算法(附完整源码)
查看>>
Objective-C实现hardy ramanujana定理算法(附完整源码)
查看>>
Objective-C实现highest response ratio next高响应比优先调度算法(附完整源码)
查看>>
Objective-C实现hill climbing爬山法用来寻找函数的最大值算法(附完整源码)
查看>>
Objective-C实现hornerMethod霍纳法算法(附完整源码)
查看>>