Visual Mapping

2022

thumbs DM-VIO: Delayed Marginalization Visual-Inertial Odometry DSO-IMUS

  • Multi-stage IMU initializer. Dynamic photometric weight (decrease weight for overall bad image)
  • Delayed marginalization (marginalization cannot be undone, but it can be delayed)

thumbs Long-term Visual Map Sparsification with Heterogeneous GNN use GNN to substitute the ILP method. compare with the result using Keep it brief for map summarization.

2021

chrown0 GVINS: Tightly Coupled GNSS-Visual-Inertial Fusion for Smooth and Consistent State Estimation It offers a complete model of GPS measurement. Makes fusion with GPS very solid.

unhappy DSP-SLAM: Object Oriented SLAM with Deep Shape Priors ORBSLAM2 + object tracking

chrown0 V-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping Plucher coordinate line only has normal residual term, cannot fix degeneracy cases (line on epipolar plane). This paper introduces a new residual based on vanishing point measurements.

2020

chrown hloc Hierarchical-Localization. CVPR2020 winner.

../_images/hloc.png

chrown Online Invariance Selection for Local Feature Descriptors Mainly for image retrieval. A light-weight meta descriptor approach to automatically select the best invariance of the local descriptors given the context. Learning the best invariance for local descriptors.

chrown0 Online Visual Place Recognition via Saliency Re-identification. github project .

  • Perform both saliency detection and retrieval in frequency domain (2D Fourier transformation).
  • Saliency map : IFFT of the difference w.r.t. average filtered log spectral. Kernel cross-correlator (KCC) to match.
  • No offline trainning needed. Low cost, higher recall rate than DBoW2 (as shown in the paper).

chrown0 Learning Feature Descriptors using Camera Pose Supervision, use camera pose (re-projected distance to epipolar line) error as loss function to train NN.

chrown0 Kapture: Robust Image Retrieval-based Visual Localization using Kapture data-driven features. Instead of manually describing how keypoints or image descriptions should look like, a large amount of data is used to train an algorithm to make this decision by itself.

chrown0 Multi-View Optimization of Local Feature Geometry Refining the geometry of local image features from multiple views without known scene or camera geometry. Optimize feature keypoints’ position based on multiple views.

  • Process feature extraction and feature matching steps.
  • Calculate visual flow \(T_{u\to v}(x_{u})\) between feature matches (as the jacobians), using CNN method.
  • Perform optimization for each feature track. Residual is weighted \((x_{v} - x_{u} - T_{u\to v}(x_{u}))\)
thumbs Cross-Descriptor Visual Localization and Mapping.
“translates” descriptors from one representation to another, using NN method.

unhappy Attention Guided Camera Localization. Roughly speaking, MapNet 2018 with attention.

2019

thumbs OANet Learning Two-View Correspondences and Geometry Using Order-Aware Network. In short, GNN based feature matches outlier rejection.

unhappy DIFL-FCL Domain-Invariant Feature Learning with Feature Consistency Loss. Train DL features which are robust to environment change (using GAN to generate train set). It may help when we are lack of real training images, while mostly it won’t happen.

unhappy Multi-Process Fusion . Ensemble methods for image retrieval process.

chrown Large-scale, real-time visual-inertial localization revisited review of different methods, finally use Keep it brief for map summarization.

2018

thumbs ToDayGAN. Use GAN to transform night image to bright day, then use the transformed image for image retrieval task.

unhappy Efficient adaptive non-maximal suppression algorithms for homogeneous spatial keypoint distribution

  • ANMS(Adaptive non-maximal suppression) based on Tree Data Structure (TDS).
  • Suppression via Square Covering (SSC)

2015

chrown chrown Keep it brief: Scalable creation of compressed localization maps use ILP (integral linear programming) to solve the summerization problem. (worth try)

../_images/vision_ilp.png
  • use google ortools to solve the ILP problem.
  • use SNAP to analysis the vision map graph.
  • tried this method in our benchmarks (keep 10% the map points, mean image observations drop from 1300 to 200), the localization result dropped within 10%.