Updated on 2025.12.26

2023-7

Pose Estimation

Publish Date Title Authors PDF Code
2023-08-03 Sim-to-Real Vision-depth Fusion CNNs for Robust Pose Estimation Aboard Autonomous Nano-quadcopter Luca Crupi et.al. 2308.01833 null
2023-08-03 Active Acoustic Sensing for Robot Manipulation Shihan Lu et.al. 2308.01600 null
2023-08-02 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions Andrew Guo et.al. 2308.01477 null
2023-08-01 Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes Bohao Fan et.al. 2308.00628 link
2023-08-01 Markerless human pose estimation for biomedical applications: a survey Andrea Avogaro et.al. 2308.00519 null
2023-08-01 Kidnapping Deep Learning-based Multirotors using Optimized Flying Adversarial Patches Pia Hanfeld et.al. 2308.00344 null
2023-08-01 Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis Asish Bera et.al. 2308.00323 null
2023-08-01 Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF) Chaochao Zhou et.al. 2308.00214 null
2023-07-31 Lightweight Super-Resolution Head for Human Pose Estimation Haonan Wang et.al. 2307.16765 link
2023-07-31 DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation Runyang Feng et.al. 2307.16687 null
2023-07-30 Touch if it’s transparent! ACTOR: Active Tactile-based Category-Level Transparent Object Reconstruction Prajval Kumar Murali et.al. 2307.16254 null
2023-07-30 Successive Pose Estimation and Beam Tracking for mmWave Vehicular Communication Systems Cen Liu et.al. 2307.16117 null
2023-07-29 Iterative Graph Filtering Network for 3D Human Pose Estimation Zaedul Islam et.al. 2307.16074 link
2023-07-29 HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation Zuyan Liu et.al. 2307.16061 null
2023-07-29 Effective Whole-body Pose Estimation with Two-stages Distillation Zhendong Yang et.al. 2307.15880 link
2023-07-28 Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation Jaime Corsetti et.al. 2307.15514 null
2023-07-28 Robust Visual Sim-to-Real Transfer for Robotic Manipulation Ricardo Garcia et.al. 2307.15320 null
2023-07-27 Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving Peter Bauer et.al. 2307.14889 null
2023-07-26 Attention of Robot Touch: Tactile Saliency Prediction for Robust Sim-to-Real Tactile Control Yijiong Lin et.al. 2307.14510 null
2023-07-28 CBGL: Fast Monte Carlo Passive Global Localisation of 2D LIDAR Sensor Alexandros Filotheou et.al. 2307.14247 link
2023-07-26 Deep Robust Multi-Robot Re-localisation in Natural Environments Milad Ramezani et.al. 2307.13950 null
2023-07-25 Of Mice and Pose: 2D Mouse Pose Estimation from Unlabelled Data and Synthetic Prior Jose Sosa et.al. 2307.13361 null
2023-07-23 TransNet: Transparent Object Manipulation Through Category-Level Pose Estimation Huijie Zhang et.al. 2307.12400 null
2023-07-25 FDCT: Fast Depth Completion for Transparent Objects Tianan Li et.al. 2307.12274 link
2023-07-22 Challenges for Monocular 6D Object Pose Estimation in Robotics Stefan Thalhammer et.al. 2307.12172 null
2023-07-22 Pyramid Semantic Graph-based Global Point Cloud Registration with Low Overlap Zhijian Qiao et.al. 2307.12116 link
2023-07-22 Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence Yang Tian et.al. 2307.12106 null

Visual Localization

Publish Date Title Authors PDF Code
2023-08-14 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954 link
2023-08-14 MixBCT: Towards Self-Adapting Backward-Compatible Training Yu Liang et.al. 2308.06948 link
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459 null
2023-08-09 AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities Jingdan Zhang et.al. 2308.04992 link
2023-08-08 Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Yi Bin et.al. 2308.04343 link
2023-08-08 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu et.al. 2308.04008 link
2023-08-05 A Comprehensive Analysis of Real-World Image Captioning and Scene Identification Sai Suprabhanu Nallapaneni et.al. 2308.02833 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871 null
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688 link
2023-07-31 Guiding Image Captioning Models Toward More Specific Captions Simon Kornblith et.al. 2307.16686 null
2023-07-31 Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks Kousik Rajesh et.al. 2307.16395 null
2023-07-28 D2S: Representing local descriptors and global scene coordinates for camera relocalization Bach-Thuan Bui et.al. 2307.15250 null
2023-07-26 Neural-based Cross-modal Search and Retrieval of Artwork Yan Gong et.al. 2307.14244 null
2023-07-26 Boon: A Neural Search Engine for Cross-Modal Information Retrieval Yan Gong et.al. 2307.14240 null
2023-07-25 Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network Chull Hwan Song et.al. 2307.13254 null
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-19 Quantum Optics based Algorithm for Measuring the Similarity between Images Vivek Mehta et.al. 2307.09789 null
2023-07-18 Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments Max Moebius et.al. 2307.09172 null
2023-07-19 Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation Rundong Luo et.al. 2307.08779 null
2023-07-17 Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition Gabriele Trivigno et.al. 2307.08417 null
2023-07-17 Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification Tengfei Liang et.al. 2307.08316 null

Keypoint Detection

Publish Date Title Authors PDF Code
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987 null
2023-08-16 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667 null
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727 null
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231 null
2023-06-03 LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193 null
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null

2023-6

Keypoint Detection

Publish Date Title Authors PDF Code
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667 null
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727 null
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231 null
2023-06-03 LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193 null
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null

2023-8

Pose Estimation

Publish Date Title Authors PDF Code
2023-09-05 A Robust Localization Solution for an Uncrewed Ground Vehicle in Unstructured Outdoor GNSS-Denied Environments W. Jacob Wagner et.al. 2309.02569 null
2023-09-05 GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction Youmin Zhang et.al. 2309.02436 null
2023-09-05 DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation Lei Zhou et.al. 2309.01925 null
2023-09-04 On the Query Strategies for Efficient Online Active Distillation Michele Boldo et.al. 2309.01612 null
2023-09-04 DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion Cédric Rommel et.al. 2309.01575 null
2023-09-06 Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation Hanbing Liu et.al. 2309.01365 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-02 Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for Pitch Analysis Jerrin Bright et.al. 2309.01010 null
2023-09-01 Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture Shaohua Pan et.al. 2309.00310 link
2023-08-31 EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild Manuel Kaufmann et.al. 2308.16894 link
2023-08-31 SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects Ning Gao et.al. 2308.16528 null
2023-08-30 Two-Stage Violence Detection Using ViTPose and Classification Models at Smart Airports İrem Üstek et.al. 2308.16325 link
2023-08-30 SignDiff: Learning Diffusion Models for American Sign Language Production Sen Fang et.al. 2308.16082 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 null
2023-08-30 Reconstructing Groups of People with Hypergraph Relational Reasoning Buzhen Huang et.al. 2308.15844 null
2023-08-29 3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking Urs Waldmann et.al. 2308.15316 null
2023-08-29 Spatio-temporal MLP-graph network for 3D human pose estimation Tanvir Hassan et.al. 2308.15313 link
2023-08-29 Pose-Free Neural Radiance Fields via Implicit Pose Regularization Jiahui Zhang et.al. 2308.15049 null
2023-08-28 R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras Aron Schmied et.al. 2308.14713 null
2023-08-28 Video-Based Hand Pose Estimation for Remote Assessment of Bradykinesia in Parkinson’s Disease Gabriela T. Acevedo Trebbau et.al. 2308.14679 null
2023-08-28 Active Pose Refinement for Textureless Shiny Objects using the Structured Light Camera Jun Yang et.al. 2308.14665 null
2023-08-28 CPFES: Physical Fitness Evaluation Based on Canadian Agility and Movement Skill Assessment Pengcheng Dong et.al. 2308.14324 null
2023-08-27 LDL: Line Distance Functions for Panoramic Localization Junho Kim et.al. 2308.13989 null
2023-08-26 Prior-guided Source-free Domain Adaptation for Human Pose Estimation Dripta S. Raychaudhuri et.al. 2308.13954 null
2023-08-26 Vision-Based Human Pose Estimation via Deep Learning: A Survey Gongjin Lan et.al. 2308.13872 null
2023-08-24 POCO: 3D Pose and Shape Estimation with Confidence Sai Kumar Dwivedi et.al. 2308.12965 null
2023-08-24 Robot Pose Nowcasting: Forecast the Future to Improve the Present Alessandro Simoni et.al. 2308.12914 null
2023-08-23 Certifiably Optimal Rotation and Pose Estimation Based on the Cayley Map Timothy D Barfoot et.al. 2308.12418 null
2023-08-22 Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape Jiacong Xu et.al. 2308.11737 null
2023-08-22 TrackFlow: Multi-Object Tracking with Normalizing Flows Gianluca Mancusi et.al. 2308.11513 null
2023-08-22 A LiDAR-Inertial SLAM Tightly-Coupled with Dropout-Tolerant GNSS Fusion for Autonomous Mine Service Vehicles Yusheng Wang et.al. 2308.11492 null
2023-08-22 PoseGraphNet++: Enriching 3D Human Pose with Orientation Estimation Soubarna Banik et.al. 2308.11440 null
2023-08-22 Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views Wentian Qu et.al. 2308.11198 null
2023-08-21 Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images Tze Ho Elden Tse et.al. 2308.11015 null
2023-08-21 Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data Patrick Ruhkamp et.al. 2308.10627 null
2023-08-21 GaitPT: Skeletons Are All You Need For Gait Recognition Andy Catruna et.al. 2308.10623 null
2023-08-21 Approximately Equivariant Graph Networks Ningyuan Huang et.al. 2308.10436 link
2023-08-21 In-Rack Test Tube Pose Estimation Using RGB-D Data Hao Chen et.al. 2308.10411 null
2023-08-20 Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video Yingxuan You et.al. 2308.10305 link
2023-08-20 OCHID-Fi: Occlusion-Robust Hand Pose Estimation in 3D via RF-Vision Shujie Zhang et.al. 2308.10146 null
2023-08-19 3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation Yi Zhang et.al. 2308.10123 link
2023-08-19 Pseudo Flow Consistency for Self-Supervised 6D Object Pose Estimation Yang Hai et.al. 2308.10016 link
2023-08-19 UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning Meiqi Sun et.al. 2308.09953 null
2023-08-22 Scene-Aware Feature Matching Xiaoyong Lu et.al. 2308.09949 null
2023-08-18 PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation Hanbing Liu et.al. 2308.09678 link
2023-08-18 Improving 3D Pose Estimation for Sign Language Maksym Ivashechkin et.al. 2308.09525 null
2023-08-18 Denoising Diffusion for 3D Hand Pose Estimation from Images Maksym Ivashechkin et.al. 2308.09523 null
2023-08-18 ResQ: Residual Quantization for Video Perception Davide Abati et.al. 2308.09511 null
2023-08-17 MovePose: A High-performance Human Pose Estimation Algorithm on Mobile and Edge Devices Dongyang Yu et.al. 2308.09084 null
2023-08-17 Pedestrian Environment Model for Automated Driving Adrian Holzbock et.al. 2308.09080 null
2023-08-17 Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction Yuhao Yang et.al. 2308.08518 null
2023-08-16 View Consistent Purification for Accurate Cross-View Localization Shan Wang et.al. 2308.08110 null
2023-08-15 Learning Better Keypoints for Multi-Object 6DoF Pose Estimation Yangzheng Wu et.al. 2308.07827 null
2023-08-14 Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation Huan Liu et.al. 2308.07313 link
2023-08-12 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion Guirong Zhuo et.al. 2308.06573 null
2023-08-17 EgoPoser: Robust Real-Time Ego-Body Pose Estimation in Large Scenes Jiaxi Jiang et.al. 2308.06493 null
2023-08-11 Aggressive Aerial Grasping using a Soft Drone with Onboard Perception Samuel Ubellacker et.al. 2308.06351 null
2023-08-11 VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields Dominic Maggio et.al. 2308.05939 null
2023-08-10 Toward Globally Optimal State Estimation Using Automatically Tightened Semidefinite Relaxations Frederike Dümbgen et.al. 2308.05783 null
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459 null
2023-08-10 How-to Augmented Lagrangian on Factor Graphs Barbara Bazzana et.al. 2308.05444 null
2023-08-10 Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose Estimation Jun Zhou et.al. 2308.05438 link
2023-08-10 Robust Localization with Visual-Inertial Odometry Constraints for Markerless Mobile AR Changkun Liu et.al. 2308.05394 null
2023-08-10 Double-chain Constraints for 3D Human Pose Estimation in Images and Videos Hongbo Kang et.al. 2308.05298 link
2023-08-09 ACE-HetEM for ab initio Heterogenous Cryo-EM 3D Reconstruction Weijie Chen et.al. 2308.04956 null
2023-08-07 SEM-GAT: Explainable Semantic Pose Estimation using Learned Graph Attention Efimia Panagiotaki et.al. 2308.03718 null
2023-08-07 A Horse with no Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior Jose Sosa et.al. 2308.03411 null
2023-08-06 Source-free Domain Adaptive Human Pose Estimation Qucheng Peng et.al. 2308.03202 null
2023-08-04 Diffusion-Augmented Depth Prediction with Sparse Annotations Jiaqi Li et.al. 2308.02283 null
2023-08-04 DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field Haowen Wang et.al. 2308.02239 null
2023-08-07 Robust Self-Supervised Extrinsic Self-Calibration Takayuki Kanai et.al. 2308.02153 null
2023-08-03 Sim-to-Real Vision-depth Fusion CNNs for Robust Pose Estimation Aboard Autonomous Nano-quadcopter Luca Crupi et.al. 2308.01833 null
2023-08-03 Active Acoustic Sensing for Robot Manipulation Shihan Lu et.al. 2308.01600 null
2023-08-02 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions Andrew Guo et.al. 2308.01477 null
2023-08-06 Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes Bohao Fan et.al. 2308.00628 link
2023-08-01 Markerless human pose estimation for biomedical applications: a survey Andrea Avogaro et.al. 2308.00519 null
2023-08-01 Kidnapping Deep Learning-based Multirotors using Optimized Flying Adversarial Patches Pia Hanfeld et.al. 2308.00344 null
2023-08-01 Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis Asish Bera et.al. 2308.00323 null
2023-08-01 Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF) Chaochao Zhou et.al. 2308.00214 null
2023-07-31 Lightweight Super-Resolution Head for Human Pose Estimation Haonan Wang et.al. 2307.16765 link
2023-07-31 DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation Runyang Feng et.al. 2307.16687 null
2023-07-30 Touch if it’s transparent! ACTOR: Active Tactile-based Category-Level Transparent Object Reconstruction Prajval Kumar Murali et.al. 2307.16254 null
2023-07-30 Successive Pose Estimation and Beam Tracking for mmWave Vehicular Communication Systems Cen Liu et.al. 2307.16117 null
2023-07-29 Iterative Graph Filtering Network for 3D Human Pose Estimation Zaedul Islam et.al. 2307.16074 link

Visual Localization

Publish Date Title Authors PDF Code
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-08 Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning Hiroki Nakamura et.al. 2309.04148 null
2023-09-05 Dual Relation Alignment for Composed Image Retrieval Xintong Jiang et.al. 2309.02169 null
2023-09-04 NLLB-CLIP – train performant multilingual image retrieval model on a budget Alexander Visheratin et.al. 2309.01859 null
2023-09-04 Target-Guided Composed Image Retrieval Haokun Wen et.al. 2309.01366 null
2023-09-02 Deep supervised hashing for fast retrieval of radio image cubes Steven Ndung’u et.al. 2309.00932 null
2023-08-31 Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval Prateksha Udhayanan et.al. 2308.16649 null
2023-08-28 Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Nils Böhne et.al. 2308.14786 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746 link
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-26 Learning Efficient Representations for Image-Based Patent Retrieval Hongsong Wang et.al. 2308.13749 null
2023-08-25 Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers Mohammad Javad Rajabi et.al. 2308.13671 null
2023-08-24 Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities Jinze Bai et.al. 2308.12966 link
2023-08-23 Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval Huafeng Li et.al. 2308.11994 null
2023-08-23 OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes Tao Xie et.al. 2308.11928 link
2023-08-22 Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features Alberto Baldrati et.al. 2308.11485 link
2023-08-22 GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Xinchi Deng et.al. 2308.11331 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-21 EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition Gabriele Berton et.al. 2308.10832 link
2023-08-20 FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory Anwesan Pal et.al. 2308.10170 null
2023-08-18 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion Yanmei Jiao et.al. 2308.09566 null
2023-08-17 FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Yulin Su et.al. 2308.09012 link
2023-08-16 Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval Aishwarya Venkataramanan et.al. 2308.08431 link
2023-08-16 Ranking-aware Uncertainty for Text-guided Image Retrieval Junyang Chen et.al. 2308.08131 null
2023-08-19 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954 link
2023-08-14 MixBCT: Towards Self-Adapting Backward-Compatible Training Yu Liang et.al. 2308.06948 link
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459 null
2023-08-09 AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities Jingdan Zhang et.al. 2308.04992 link
2023-08-08 Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Yi Bin et.al. 2308.04343 link
2023-08-08 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu et.al. 2308.04008 link
2023-08-05 A Comprehensive Analysis of Real-World Image Captioning and Scene Identification Sai Suprabhanu Nallapaneni et.al. 2308.02833 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871 null
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688 link
2023-07-31 Guiding Image Captioning Models Toward More Specific Captions Simon Kornblith et.al. 2307.16686 null
2023-07-31 Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks Kousik Rajesh et.al. 2307.16395 null
2023-07-28 D2S: Representing local descriptors and global scene coordinates for camera relocalization Bach-Thuan Bui et.al. 2307.15250 null
2023-07-26 Neural-based Cross-modal Search and Retrieval of Artwork Yan Gong et.al. 2307.14244 null
2023-07-26 Boon: A Neural Search Engine for Cross-Modal Information Retrieval Yan Gong et.al. 2307.14240 null
2023-07-25 Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network Chull Hwan Song et.al. 2307.13254 null
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-19 Quantum Optics based Algorithm for Measuring the Similarity between Images Vivek Mehta et.al. 2307.09789 null

Keypoint Detection

Publish Date Title Authors PDF Code
2023-09-26 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 null
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-12 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar et.al. 2309.00434 link
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 null
2023-08-29 A lightweight 3D dense facial landmark estimation model from position map data Shubhajit Basak et.al. 2308.15170 null
2023-08-27 Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors Francesco Pirotti et.al. 2308.14047 null
2023-08-24 VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition Gengxuan Tian et.al. 2308.12870 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987 null
2023-09-03 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667 null
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727 null
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231 null
2023-06-03 LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193 null

2023-9

Pose Estimation

Publish Date Title Authors PDF Code
2023-10-04 Condition numbers in multiview geometry, instability in relative pose estimation, and RANSAC Hongyi Fan et.al. 2310.02719 null
2023-10-04 USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li et.al. 2310.02687 null
2023-10-03 Beyond the Benchmark: Detecting Diverse Anomalies in Videos Yoav Arad et.al. 2310.01904 link
2023-10-03 MFOS: Model-Free & One-Shot Object Pose Estimation JongMin Lee et.al. 2310.01897 null
2023-10-02 LEAP: Liberate Sparse-view 3D Modeling from Camera Poses Hanwen Jiang et.al. 2310.01410 null
2023-10-02 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 null
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-09-30 Diff-DOPE: Differentiable Deep Object Pose Estimation Jonathan Tremblay et.al. 2310.00463 null
2023-09-29 Diver Identification Using Anthropometric Data Ratios for Underwater Multi-Human-Robot Collaboration Jungseok Hong et.al. 2310.00146 null
2023-09-29 Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation Zhuoran Yu et.al. 2310.00099 null
2023-09-29 Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head Qian Wu et.al. 2309.17143 link
2023-09-29 AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi Yunjiao Zhou et.al. 2309.16964 null
2023-09-28 End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon Guillaume Bono et.al. 2309.16634 null
2023-09-28 Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task Frederik Hagelskjær et.al. 2309.16221 null
2023-09-28 Cloth2Body: Generating 3D Human Body Mesh from 2D Clothing Lu Dai et.al. 2309.16189 null
2023-09-28 Laboratory Automation: Precision Insertion with Adaptive Fingers utilizing Contact through Sliding with Tactile-based Pose Estimation Sameer Pai et.al. 2309.16170 null
2023-09-28 CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting Shaoxiang Guo et.al. 2309.16140 null
2023-09-28 A Modular Bio-inspired Robotic Hand with High Sensitivity Chao Liu et.al. 2309.16081 null
2023-09-27 Handbook on Leveraging Lines for Two-View Relative Pose Estimation Petr Hruby et.al. 2309.16040 null
2023-09-27 Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature Shengze Jin et.al. 2309.16023 null
2023-09-27 Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range Xinran Li et.al. 2309.15367 null
2023-09-26 Unsupervised Reconstruction of 3D Human Pose Interactions From 2D Poses Alone Peter Hardy et.al. 2309.14865 null
2023-09-26 Learning Vision-Based Bipedal Locomotion for Challenging Terrain Helei Duan et.al. 2309.14594 null
2023-09-25 Spring-IMU Fusion Based Proprioception for Feedback Control of Soft Manipulators Yinan Meng et.al. 2309.14279 null
2023-09-25 Industrial Application of 6D Pose Estimation for Robotic Manipulation in Automotive Internal Logistics Philipp Quentin et.al. 2309.14265 null
2023-09-25 BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation Uyoung Jeong et.al. 2309.14072 link
2023-09-24 Towards Subcentimeter Accuracy Digital-Twin Tracking via An RGBD-based Transformer Model and A Comprehensive Mobile Dataset Zixun Huang et.al. 2309.13570 null
2023-09-21 ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding Yu Cheng et.al. 2309.12183 null
2023-09-21 ZS6D: Zero-shot 6D Object Pose Estimation using Vision Transformers Philipp Ausserlechner et.al. 2309.11986 null
2023-09-21 Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views Taeho Kang et.al. 2309.11962 null
2023-09-21 A Real-Time Multi-Task Learning System for Joint Detection of Face, Facial Landmark and Head Pose Qingtian Wu et.al. 2309.11773 null
2023-09-20 Understanding Pose and Appearance Disentanglement in 3D Human Pose Estimation Krishna Kanth Nakka et.al. 2309.11667 null
2023-09-20 Online Supervised Training of Spaceborne Vision during Proximity Operations using Adaptive Kalman Filtering Tae Ha Park et.al. 2309.11645 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011 null
2023-09-19 Language-Conditioned Affordance-Pose Detection in 3D Point Clouds Toan Nguyen et.al. 2309.10911 null
2023-09-19 MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings Surbhi Madan et.al. 2309.10765 link
2023-09-19 SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction Anilkumar Swamy et.al. 2309.10748 null
2023-09-20 GloPro: Globally-Consistent Uncertainty-Aware 3D Human Pose Estimation & Tracking in the Wild Simon Schaefer et.al. 2309.10369 null
2023-09-19 RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery Jiaxin Wei et.al. 2309.10255 null
2023-09-18 Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation Kathia Melbouci et.al. 2309.09934 null
2023-09-18 Application-driven Validation of Posteriors in Inverse Problems Tim J. Adler et.al. 2309.09764 null
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-18 Sparse and Privacy-enhanced Representation for Human Pose Estimation Ting-Ying Lin et.al. 2309.09515 null
2023-09-19 RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation Lijun Li et.al. 2309.09301 link
2023-09-16 Optimal Initialization Strategies for Range-Only Trajectory Estimation Abhishek Goudar et.al. 2309.09011 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 null
2023-09-16 Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning Pengyu Yin et.al. 2309.08914 null
2023-09-15 Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild Sungchan Park et.al. 2309.08644 null
2023-09-15 YCB-Ev: Event-vision dataset for 6DoF object pose estimation Pavel Rojtberg et.al. 2309.08482 link
2023-09-15 Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM Chenghao Shi et.al. 2309.08086 null
2023-09-14 Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success Gergely Sóti et.al. 2309.08040 null
2023-09-14 TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting Rohan Choudhury et.al. 2309.07910 null
2023-09-14 Towards Robust and Unconstrained Full Range of Rotation Head Pose Estimation Thorsten Hempel et.al. 2309.07654 link
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-14 Unleashing the Power of Depth and Pose Estimation Neural Networks by Designing Compatible Endoscopic Images Junyang Wu et.al. 2309.07390 null
2023-09-13 LInKs “Lifting Independent Keypoints” – Partial Pose Lifting for Occlusion Handling with Improved Accuracy in 2D-3D Human Pose Estimation Peter Hardy et.al. 2309.07243 null
2023-09-13 3D Active Metric-Semantic SLAM Yuezhan Tao et.al. 2309.06950 null
2023-09-11 ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion Hongyu Li et.al. 2309.05662 null
2023-09-11 Towards Intuitive HMI for UAV Control Filip Zoric et.al. 2309.05460 null
2023-09-12 FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild Jiong Wang et.al. 2309.05073 link
2023-09-09 Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation Boyuan Jiang et.al. 2309.04756 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 null
2023-09-08 Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-07 ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation Hui Zhang et.al. 2309.03891 null
2023-09-05 An automated, high-resolution phenotypic assay for adult Brugia malayi and microfilaria Upender Kalwa et.al. 2309.03235 null
2023-09-05 A Robust Localization Solution for an Uncrewed Ground Vehicle in Unstructured Outdoor GNSS-Denied Environments W. Jacob Wagner et.al. 2309.02569 null
2023-09-05 GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction Youmin Zhang et.al. 2309.02436 link
2023-09-05 DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation Lei Zhou et.al. 2309.01925 link
2023-09-04 On the Query Strategies for Efficient Online Active Distillation Michele Boldo et.al. 2309.01612 null
2023-09-04 DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion Cédric Rommel et.al. 2309.01575 null
2023-09-06 Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation Hanbing Liu et.al. 2309.01365 link
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-02 Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for Pitch Analysis Jerrin Bright et.al. 2309.01010 null
2023-09-01 Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture Shaohua Pan et.al. 2309.00310 link
2023-08-31 EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild Manuel Kaufmann et.al. 2308.16894 link
2023-08-31 SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects Ning Gao et.al. 2308.16528 null
2023-08-30 Two-Stage Violence Detection Using ViTPose and Classification Models at Smart Airports İrem Üstek et.al. 2308.16325 link
2023-08-30 SignDiff: Learning Diffusion Models for American Sign Language Production Sen Fang et.al. 2308.16082 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 null
2023-08-30 Reconstructing Groups of People with Hypergraph Relational Reasoning Buzhen Huang et.al. 2308.15844 null
2023-08-29 3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking Urs Waldmann et.al. 2308.15316 null
2023-08-29 Spatio-temporal MLP-graph network for 3D human pose estimation Tanvir Hassan et.al. 2308.15313 link
2023-08-29 Pose-Free Neural Radiance Fields via Implicit Pose Regularization Jiahui Zhang et.al. 2308.15049 null

Keypoint Detection

Publish Date Title Authors PDF Code
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124 null
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot’s motion Kartikeya Singh et.al. 2310.06249 null
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056 null
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-12 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar et.al. 2309.00434 link
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 null
2023-08-29 A lightweight 3D dense facial landmark estimation model from position map data Shubhajit Basak et.al. 2308.15170 null
2023-08-27 Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors Francesco Pirotti et.al. 2308.14047 null
2023-08-24 VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition Gengxuan Tian et.al. 2308.12870 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987 null
2023-09-03 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link

Visual Localization

Publish Date Title Authors PDF Code
2023-10-06 ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer Yifan Xu et.al. 2310.04099 null
2023-10-06 Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao et.al. 2310.03967 null
2023-10-04 Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon et.al. 2310.02650 null
2023-10-02 NEUCORE: Neural Concept Reasoning for Composed Image Retrieval Shu Zhao et.al. 2310.01358 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-05 PlaceNav: Topological Navigation through Place Recognition Lauri Suomela et.al. 2309.17260 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-09-28 Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning Albert Mohwald et.al. 2309.16351 link
2023-09-28 FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding Pengxiang Wu et.al. 2309.16249 link
2023-09-28 Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2309.16137 null
2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization Vicente Vivanco Cepeda et.al. 2309.16020 null
2023-09-27 Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization Zhenbo Song et.al. 2309.15556 null
2023-09-26 Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features Hila Levi et.al. 2309.14999 null
2023-09-23 Resolving References in Visually-Grounded Dialogue via Text Generation Bram Willemsen et.al. 2309.13430 link
2023-09-21 Face Identity-Aware Disentanglement in StyleGAN Adrian Suwała et.al. 2309.12033 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 null
2023-09-20 2D-3D Pose Tracking with Multi-View Constraints Huai Yu et.al. 2309.11335 null
2023-09-19 VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition Adam D. Hines et.al. 2309.10225 link
2023-09-11 Introspective Deep Metric Learning Chengkun Wang et.al. 2309.09982 null
2023-09-18 Decompose Semantic Shifts for Composed Image Retrieval Xingyu Yang et.al. 2309.09531 null
2023-09-16 Efficient Object Rearrangement via Multi-view Fusion Dehao Huang et.al. 2309.08994 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 null
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743 null
2023-09-15 Optimization of Rank Losses for Image Retrieval Elias Ramzi et.al. 2309.08250 link
2023-09-18 Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer Yaoting Wang et.al. 2309.07929 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-08 Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning Hiroki Nakamura et.al. 2309.04148 null
2023-09-05 Dual Relation Alignment for Composed Image Retrieval Xintong Jiang et.al. 2309.02169 null
2023-09-04 NLLB-CLIP – train performant multilingual image retrieval model on a budget Alexander Visheratin et.al. 2309.01859 null
2023-09-04 Target-Guided Composed Image Retrieval Haokun Wen et.al. 2309.01366 null
2023-09-02 Deep supervised hashing for fast retrieval of radio image cubes Steven Ndung’u et.al. 2309.00932 null
2023-08-31 Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval Prateksha Udhayanan et.al. 2308.16649 null
2023-08-28 Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Nils Böhne et.al. 2308.14786 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746 link
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-26 Learning Efficient Representations for Image-Based Patent Retrieval Hongsong Wang et.al. 2308.13749 null
2023-08-25 Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers Mohammad Javad Rajabi et.al. 2308.13671 null

2023-10

Visual Localization

Publish Date Title Authors PDF Code
2023-11-13 Pretrain like You Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Junyang Chen et.al. 2311.07622 null
2023-11-13 VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search Shuting He et.al. 2311.07514 null
2023-11-10 Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval Xin Lu et.al. 2311.06067 null
2023-11-08 Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model Junya Shiraishi et.al. 2311.04788 null
2023-11-08 Training CLIP models on Data from Scientific Papers Calvin Metzger et.al. 2311.04711 link
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098 link
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345 null
2023-11-06 FocusTune: Tuning Visual Localization through Focus-Guided Sampling Son Tung Nguyen et.al. 2311.02872 link
2023-11-01 DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang et.al. 2311.00230 null
2023-10-29 Identifiable Contrastive Learning with Automatic Feature Importance Discovery Qi Zhang et.al. 2310.18904 link
2023-10-27 LipSim: A Provably Robust Perceptual Similarity Metric Sara Ghazanfari et.al. 2310.18274 link
2023-10-27 Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation Susu Fang et.al. 2310.17879 null
2023-10-25 FoundLoc: Vision-based Onboard Aerial Localization in the Wild Yao He et.al. 2310.16299 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval Xu Yuan et.al. 2310.14637 link
2023-10-21 Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation Anastasia Kritharoula et.al. 2310.14025 link
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320 link
2023-10-27 Representation Learning via Consistent Assignment of Views over Random Partitions Thalles Silva et.al. 2310.12692 link
2023-10-18 Evaluating the Fairness of Discriminative Foundation Models in Computer Vision Junaid Ali et.al. 2310.11867 null
2023-10-17 Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Shuanglin Yan et.al. 2310.11210 null
2023-10-16 Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People Dharmateja Adapa et.al. 2310.10290 null
2023-10-16 EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Tom Bryan et.al. 2310.10050 null
2023-10-15 CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Yulei Qin et.al. 2310.09761 link
2023-10-13 Pairwise Similarity Learning is SimPLE Yandong Wen et.al. 2310.09449 null
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291 null
2023-10-12 Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning Shiyang Yan et.al. 2310.08390 null
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984 null
2023-10-10 Distillation Improves Visual Place Recognition for Low-Quality Queries Anbang Yang et.al. 2310.06906 null
2023-10-10 Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets Jiajun Zhang et.al. 2310.06566 null
2023-10-10 Topological RANSAC for instance verification and retrieval without fine-tuning Guoyuan An et.al. 2310.06486 null
2023-10-10 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments Ghanta Sai Krishna et.al. 2310.06385 null
2023-10-09 Collaborative Visual Place Recognition Yiming Li et.al. 2310.05541 null
2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval Yang Bai et.al. 2310.05473 link
2023-10-08 AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition Feng Lu et.al. 2310.05184 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-10-12 ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer Yifan Xu et.al. 2310.04099 null
2023-10-06 Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao et.al. 2310.03967 null
2023-10-04 Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon et.al. 2310.02650 null
2023-10-02 NEUCORE: Neural Concept Reasoning for Composed Image Retrieval Shu Zhao et.al. 2310.01358 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-05 PlaceNav: Topological Navigation through Place Recognition Lauri Suomela et.al. 2309.17260 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-09-28 Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning Albert Mohwald et.al. 2309.16351 link
2023-09-28 FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding Pengxiang Wu et.al. 2309.16249 link
2023-09-28 Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2309.16137 null
2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization Vicente Vivanco Cepeda et.al. 2309.16020 null
2023-09-27 Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization Zhenbo Song et.al. 2309.15556 null
2023-09-26 Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features Hila Levi et.al. 2309.14999 null
2023-09-23 Resolving References in Visually-Grounded Dialogue via Text Generation Bram Willemsen et.al. 2309.13430 link
2023-09-21 Face Identity-Aware Disentanglement in StyleGAN Adrian Suwała et.al. 2309.12033 null

Pose Estimation

Publish Date Title Authors PDF Code
2023-11-06 A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation Qitao Zhao et.al. 2311.03312 null
2023-11-06 Enabling In-Situ Resources Utilisation by leveraging collaborative robotics and astronaut-robot interaction Silvia Romero-Azpitarte et.al. 2311.03146 null
2023-11-06 Simultaneous Time Synchronization and Mutual Localization for Multi-robot System Xiangyong Wen et.al. 2311.02948 null
2023-11-06 Initialisation of Autonomous Aircraft Visual Inspection Systems via CNN-Based Camera Pose Estimation Xueyan Oh et.al. 2311.02900 null
2023-11-06 Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning Nobline Yoo et.al. 2311.02815 link
2023-11-03 Generating Unbiased Pseudo-labels via a Theoretically Guaranteed Chebyshev Constraint to Unify Semi-supervised Classification and Regression Jiaqi Wu et.al. 2311.01782 link
2023-11-03 Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation Jiaqi Wu et.al. 2311.01770 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380 link
2023-11-01 A Spatial-Temporal Transformer based Framework For Human Pose Assessment And Correction in Education Scenarios Wenyang Hu et.al. 2311.00401 null
2023-10-31 HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception Junkun Yuan et.al. 2310.20695 link
2023-10-31 Pose-to-Motion: Cross-Domain Motion Retargeting with Pose Prior Qingqing Zhao et.al. 2310.20249 null
2023-10-30 FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound Chaoyu Chen et.al. 2310.19293 null
2023-10-29 Distributed Nonlinear Filtering using Triangular Transport Maps Daniel Grange et.al. 2310.19000 null
2023-10-29 TIC-TAC: A Framework To Learn And Evaluate Your Covariance Megh Shukla et.al. 2310.18953 link
2023-10-29 Improving Multi-Person Pose Tracking with A Confidence Network Zehua Fu et.al. 2310.18920 null
2023-10-29 HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration Weiyi Xue et.al. 2310.18874 null
2023-10-27 ProcNet: Deep Predictive Coding Model for Robust-to-occlusion Visual Segmentation and Pose Estimation Michael Zechmair et.al. 2310.18009 null
2023-10-26 Learning Extrinsic Dexterity with Parameterized Manipulation Primitives Shih-Min Yang et.al. 2310.17785 null
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649 null
2023-10-26 SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation Haobo Jiang et.al. 2310.17359 null
2023-10-26 Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs Ryota Tanaka et.al. 2310.17193 link
2023-10-25 Real-time 6-DoF Pose Estimation by an Event-based Camera using Active LED Markers Gerald Ebmer et.al. 2310.16618 null
2023-10-25 ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors Xiaoxuan Ma et.al. 2310.16447 link
2023-10-25 MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network Soroush Mehraban et.al. 2310.16288 link
2023-10-25 TransPose: 6D Object Pose Estimation with Geometry-Aware Transformer Xiao Lin et.al. 2310.16279 null
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924 link
2023-10-23 Object Pose Estimation Annotation Pipeline for Multi-view Monocular Camera Systems in Industrial Settings Hazem Youssef et.al. 2310.14914 null
2023-10-23 Player Re-Identification Using Body Part Appearences Mahesh Bhosale et.al. 2310.14469 null
2023-10-20 LanPose: Language-Instructed 6D Object Pose Estimation for Robotic Assembly Bowen Fu et.al. 2310.13819 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-20 ColAG: A Collaborative Air-Ground Framework for Perception-Limited UGVs’ Navigation Zhehan Li et.al. 2310.13324 link
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320 link
2023-10-19 Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey Lijuan Zhou et.al. 2310.13039 null
2023-10-19 FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects Mayank Lunayach et.al. 2310.12974 null
2023-10-18 Mesh Represented Recycle Learning for 3D Hand Pose and Mesh Estimation Bosang Kim et.al. 2310.12189 null
2023-10-18 One-Shot Imitation Learning: A Pose Estimation Perspective Pietro Vitiello et.al. 2310.12077 null
2023-10-18 ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map Ahmed Tawfik Aboukhadra et.al. 2310.11811 null
2023-10-17 Holistic Parking Slot Detection with Polygon-Shaped Representations Lihao Wang et.al. 2310.11629 null
2023-10-17 Diver Interest via Pointing in Three Dimensions: 3D Pointing Reconstruction for Diver-AUV Communication Chelsey Edge et.al. 2310.11536 null
2023-10-18 AP $n$P: A Less-constrained P$n$ P Solver for Pose Estimation with Unknown Anisotropic Scaling or Focal Lengths Jiaxin Wei et.al. 2310.09982 link
2023-10-15 Tabletop Transparent Scene Reconstruction via Epipolar-Guided Optical Flow with Monocular Depth Completion Prior Xiaotong Chen et.al. 2310.09956 null
2023-10-15 Socially reactive navigation models for mobile robots in dynamic environments Ricarte Ribeiro et.al. 2310.09916 null
2023-10-15 MoEmo Vision Transformer: Integrating Cross-Attention and Movement Vectors in 3D Pose Estimation for HRI Emotion Detection David C. Jeong et.al. 2310.09757 null
2023-10-16 IMU Preintegration for Multi-Robot Systems in the Presence of Bias and Communication Constraints Mohammed Ayman Shalaby et.al. 2310.08686 null
2023-10-12 Towards Design and Development of an ArUco Markers-Based Quantitative Surface Tactile Sensor Ozdemir Can Kara et.al. 2310.08398 null
2023-10-12 Multimodal Active Measurement for Human Mesh Recovery in Close Proximity Takahiro Maeda et.al. 2310.08116 null
2023-10-12 X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention Yixuan Zhou et.al. 2310.08042 link
2023-10-12 PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction Jia-Wang Bian et.al. 2310.07449 null
2023-10-11 SAGE-ICP: Semantic Information-Assisted ICP Jiaming Cui et.al. 2310.07237 null
2023-10-11 DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation Rong Wang et.al. 2310.07206 null
2023-10-12 FABind: Fast and Accurate Protein-Ligand Binding Qizhi Pei et.al. 2310.06763 link
2023-10-10 EARL: Eye-on-Hand Reinforcement Learner for Dynamic Grasping with Active Pose Estimation Baichuan Huang et.al. 2310.06751 null
2023-10-09 Augmenting Vision-Based Human Pose Estimation with Rotation Matrix Milad Vazan et.al. 2310.06068 null
2023-10-07 Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles Elton F. de S. Soares et.al. 2310.04837 null
2023-10-10 1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction Zhishan Zhou et.al. 2310.04769 null
2023-10-06 SwimXYZ: A large-scale dataset of synthetic swimming motions and videos Fiche Guénolé et.al. 2310.04360 null
2023-10-05 BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields Ágoston István Csehi et.al. 2310.03563 null
2023-10-05 3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation Chen Zhao et.al. 2310.03534 null
2023-10-05 RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation Boshi An et.al. 2310.03478 null
2023-10-05 Cyber Physical System Information Collection: Robot Location and Navigation Method Based on QR Code Hongwei Li et.al. 2310.03470 null
2023-10-04 Condition numbers in multiview geometry, instability in relative pose estimation, and RANSAC Hongyi Fan et.al. 2310.02719 null
2023-10-05 USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li et.al. 2310.02687 null
2023-10-03 Beyond the Benchmark: Detecting Diverse Anomalies in Videos Yoav Arad et.al. 2310.01904 link
2023-10-03 MFOS: Model-Free & One-Shot Object Pose Estimation JongMin Lee et.al. 2310.01897 null
2023-10-02 LEAP: Liberate Sparse-view 3D Modeling from Camera Poses Hanwen Jiang et.al. 2310.01410 null
2023-10-02 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 null
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-09-30 Diff-DOPE: Differentiable Deep Object Pose Estimation Jonathan Tremblay et.al. 2310.00463 null
2023-09-29 Diver Identification Using Anthropometric Data Ratios for Underwater Multi-Human-Robot Collaboration Jungseok Hong et.al. 2310.00146 null
2023-09-29 Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation Zhuoran Yu et.al. 2310.00099 null
2023-09-29 Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head Qian Wu et.al. 2309.17143 link
2023-09-29 AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi Yunjiao Zhou et.al. 2309.16964 null

Keypoint Detection

Publish Date Title Authors PDF Code
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361 null
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398 null
2023-11-11 CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer Haoyu Ma et.al. 2311.06443 null
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699 null
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot’s motion Kartikeya Singh et.al. 2310.06249 null
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056 null
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null

2023-11

Pose Estimation

Publish Date Title Authors PDF Code
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 null
2023-12-04 SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM Nikhil Keetha et.al. 2312.02126 null
2023-12-04 Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection Xubin Zhong et.al. 2312.01713 null
2023-12-04 Hulk: A Universal Knowledge Translator for Human-Centric Tasks Yizhou Wang et.al. 2312.01697 link
2023-12-04 Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks Yan Xu et.al. 2312.01561 null
2023-12-01 Object 6D pose estimation meets zero-shot learning Andrea Caraffa et.al. 2312.00947 null
2023-12-01 Open-vocabulary object 6D pose estimation Jaime Corsetti et.al. 2312.00690 null
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-12-01 Learning Unorthogonalized Matrices for Rotation Estimation Kerui Gu et.al. 2312.00462 null
2023-11-30 PoseGPT: Chatting about 3D Human Pose Yao Feng et.al. 2311.18836 null
2023-11-30 FoundPose: Unseen Object Pose Estimation with Foundation Features Evin Pınar Örnek et.al. 2311.18809 null
2023-11-30 Pose Estimation and Tracking for ASIST Ari Goodman et.al. 2311.18665 null
2023-11-29 A Stochastic-Geometrical Framework for Object Pose Estimation based on Mixture Models Avoiding the Correspondence Problem Wolfgang Hoegele et.al. 2311.18107 null
2023-11-29 Pose Anything: A Graph-Based Approach for Category-Agnostic Pose Estimation Or Hirschorn et.al. 2311.17891 link
2023-11-29 Cinematic Behavior Transfer via NeRF-based Differentiable Filming Xuekun Jiang et.al. 2311.17754 null
2023-11-29 PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens Sebastian Stapf et.al. 2311.17504 null
2023-11-28 On the Calibration of Human Pose Estimation Kerui Gu et.al. 2311.17105 null
2023-11-28 Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence Junyi Zhang et.al. 2311.17034 null
2023-11-28 HandyPriors: Physically Consistent Perception of Hand-Object Interactions with Differentiable Priors Shutong Zhang et.al. 2311.16552 null
2023-11-28 Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement Jian Wang et.al. 2311.16495 null
2023-11-24 UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning Zhongyu Jiang et.al. 2311.16477 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060 link
2023-11-27 Uncertainty Quantification of Set-Membership Estimation in Control and Perception: Revisiting the Minimum Enclosing Ellipsoid Yukai Tang et.al. 2311.15962 null
2023-11-27 Computer Vision for Carriers: PATRIOT Ari Goodman et.al. 2311.15914 null
2023-11-27 SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation Jiehong Lin et.al. 2311.15707 link
2023-11-24 RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling Xiaoyue Wan et.al. 2311.14242 null
2023-11-23 Appearance-based gaze estimation enhanced with synthetic images using deep neural networks Dmytro Herashchenko et.al. 2311.14175 link
2023-11-23 GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence Van Nguyen Nguyen et.al. 2311.14155 link
2023-11-23 GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence Pengyuan Wang et.al. 2311.13777 null
2023-11-22 HEViTPose: High-Efficiency Vision Transformer for Human Pose Estimation Chengpeng Wu et.al. 2311.13615 link
2023-11-24 Calibration System and Algorithm Design for a Soft Hinged Micro Scanning Mirror with a Triaxial Hall Effect Sensor Di Wang et.al. 2311.12778 null
2023-11-21 HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation Yongliang Lin et.al. 2311.12588 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580 null
2023-11-21 HCA-Net: Hierarchical Context Attention Network for Intervertebral Disc Semantic Labeling Afshin Bozorgpour et.al. 2311.12486 link
2023-11-21 Two Views Are Better than One: Monocular 3D Pose Estimation with Multiview Consistency Christian Keilstrup Ingwersen et.al. 2311.12421 null
2023-11-20 Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models Pooya Fayyazsanavi et.al. 2311.12128 link
2023-11-20 Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation Wenhao Li et.al. 2311.12028 null
2023-11-20 SniffyArt: The Dataset of Smelling Persons Mathias Zinnen et.al. 2311.11888 null
2023-11-21 Robot Hand-Eye Calibration using Structure-from-Motion Nicolas Andreff et.al. 2311.11808 null
2023-11-18 SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation Yamei Chen et.al. 2311.11125 null
2023-11-18 Synthetic Data Generation for Bridging Sim2Real Gap in a Production Environment Parth Rawal et.al. 2311.11039 null
2023-11-18 Multiple View Geometry Transformers for 3D Human Pose Estimation Ziwei Liao et.al. 2311.10983 null
2023-11-18 Jenga Stacking Based on 6D Pose Estimation for Architectural Form Finding Process Zixun Huang et.al. 2311.10918 null
2023-11-17 BiHRNet: A Binary high-resolution network for Human Pose Estimation Zhicheng Zhang et.al. 2311.10296 null
2023-11-16 Match and Locate: low-frequency monocular odometry based on deep feature matching Stepan Konev et.al. 2311.10034 null
2023-11-16 LIO-EKF: High Frequency LiDAR-Inertial Odometry using Extended Kalman Filters Yibin Wu et.al. 2311.09887 null
2023-11-16 Improved TokenPose with Sparsity Anning Li et.al. 2311.09653 null
2023-11-16 Pseudo-keypoints RKHS Learning for Self-supervised 6DoF Pose Estimation Yangzheng Wu et.al. 2311.09500 null
2023-11-15 NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios En-Te Lin et.al. 2311.09269 link
2023-11-15 Range-Visual-Inertial Sensor Fusion for Micro Aerial Vehicle Localization and Navigation Abhishek Goudar et.al. 2311.09056 link
2023-11-14 LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping Sujal Vijayaraghavan et.al. 2311.08438 null
2023-11-13 SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models Ziyi Lin et.al. 2311.07575 link
2023-11-13 Bio-Inspired Grasping Controller for Sensorized 2-DoF Grippers Luca Lach et.al. 2311.07257 link
2023-11-10 CESPED: a new benchmark for supervised particle pose estimation in Cryo-EM Ruben Sanchez-Garcia et.al. 2311.06194 null
2023-11-10 2D Image head pose estimation via latent space regression under occlusion settings José Celestino et.al. 2311.06038 link
2023-11-10 Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous Ziwei Wang et.al. 2311.05992 null
2023-11-10 A Practical Guide to Implementing Off-Axis Stereo Projection Using Existing Ray Tracing Libraries Stefan Zellmann et.al. 2311.05887 null
2023-11-09 Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking Mederic Fourmy et.al. 2311.05344 null
2023-11-09 Spatial Attention-based Distribution Integration Network for Human Pose Estimation Sihan Gao et.al. 2311.05323 null
2023-11-09 SPADES: A Realistic Spacecraft Pose Estimation Dataset using Event Sensing Arunkumar Rathinam et.al. 2311.05310 null
2023-11-09 Differentiable Cloth Parameter Identification and State Estimation in Manipulation Dongzhe Zheng et.al. 2311.05141 null
2023-11-09 POISE: Pose Guided Human Silhouette Extraction under Occlusions Arindam Dutta et.al. 2311.05077 link
2023-11-08 Active Transfer Learning for Efficient Video-Specific Human Pose Estimation Hiromu Taketsugu et.al. 2311.05041 link
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699 null
2023-11-09 Rethinking Human Pose Estimation for Autonomous Driving with 3D Event Representations Xiaoting Yin et.al. 2311.04591 link
2023-11-08 Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images Nishant Jain et.al. 2311.04521 null
2023-11-08 PLV-IEKF: Consistent Visual-Inertial Odometry using Points, Lines, and Vanishing Points Tong Hua et.al. 2311.04477 null
2023-11-08 UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields Injae Kim et.al. 2311.03784 null
2023-11-06 A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation Qitao Zhao et.al. 2311.03312 null
2023-11-06 Enabling In-Situ Resources Utilisation by leveraging collaborative robotics and astronaut-robot interaction Silvia Romero-Azpitarte et.al. 2311.03146 null
2023-11-06 Simultaneous Time Synchronization and Mutual Localization for Multi-robot System Xiangyong Wen et.al. 2311.02948 null
2023-11-06 Initialisation of Autonomous Aircraft Visual Inspection Systems via CNN-Based Camera Pose Estimation Xueyan Oh et.al. 2311.02900 null
2023-11-06 Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning Nobline Yoo et.al. 2311.02815 link
2023-11-03 Generating Unbiased Pseudo-labels via a Theoretically Guaranteed Chebyshev Constraint to Unify Semi-supervised Classification and Regression Jiaqi Wu et.al. 2311.01782 link
2023-11-03 Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation Jiaqi Wu et.al. 2311.01770 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380 link
2023-11-01 A Spatial-Temporal Transformer based Framework For Human Pose Assessment And Correction in Education Scenarios Wenyang Hu et.al. 2311.00401 null
2023-10-31 HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception Junkun Yuan et.al. 2310.20695 link
2023-10-31 Pose-to-Motion: Cross-Domain Motion Retargeting with Pose Prior Qingqing Zhao et.al. 2310.20249 null
2023-10-30 FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound Chaoyu Chen et.al. 2310.19293 null
2023-10-29 Distributed Nonlinear Filtering using Triangular Transport Maps Daniel Grange et.al. 2310.19000 null
2023-10-29 TIC-TAC: A Framework To Learn And Evaluate Your Covariance Megh Shukla et.al. 2310.18953 link
2023-10-29 Improving Multi-Person Pose Tracking with A Confidence Network Zehua Fu et.al. 2310.18920 null
2023-10-29 HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration Weiyi Xue et.al. 2310.18874 null
2023-10-27 ProcNet: Deep Predictive Coding Model for Robust-to-occlusion Visual Segmentation and Pose Estimation Michael Zechmair et.al. 2310.18009 null
2023-10-26 Learning Extrinsic Dexterity with Parameterized Manipulation Primitives Shih-Min Yang et.al. 2310.17785 null

Visual Localization

Publish Date Title Authors PDF Code
2023-12-11 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Fuxiang Huang et.al. 2312.06179 null
2023-12-06 Lite-Mind: Towards Efficient and Versatile Brain Representation Network Zixuan Gong et.al. 2312.03781 null
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428 link
2023-12-04 Implicit Learning of Scene Geometry from Poses for Global Localization Mohammad Altillawi et.al. 2312.02029 null
2023-12-04 Language-only Efficient Training of Zero-shot Composed Image Retrieval Geonmo Gu et.al. 2312.01998 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522 null
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950 null
2023-12-05 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878 link
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-11-30 HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance Zhuohao Yin et.al. 2311.18273 link
2023-11-30 Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models Raviteja Vemulapalli et.al. 2311.18237 null
2023-11-29 Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce Chang Liu et.al. 2311.17954 null
2023-11-28 Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Chao Chen et.al. 2311.17940 null
2023-11-29 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Huajian Huang et.al. 2311.17389 null
2023-11-27 Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation Samuele Poppi et.al. 2311.16254 link
2023-11-27 Optimal Transport Aggregation for Visual Place Recognition Sergio Izquierdo et.al. 2311.15937 link
2023-11-27 AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval Shicheng Xu et.al. 2311.14084 null
2023-11-23 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology Asma Ben Abacha et.al. 2311.13752 null
2023-11-22 Medical Image Retrieval Using Pretrained Embeddings Farnaz Khun Jush et.al. 2311.13547 null
2023-11-22 Applications of Spiking Neural Networks in Visual Place Recognition Somayeh Hussaini et.al. 2311.13186 link
2023-11-21 Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval Xiu-Shen Wei et.al. 2311.12894 null
2023-11-19 From Categories to Classifier: Name-Only Continual Learning by Exploring the Web Ameya Prabhu et.al. 2311.11293 null
2023-11-18 Lesion Search with Self-supervised Learning Kristin Qi et.al. 2311.11014 null
2023-11-15 Flow reconstruction and particle characterization from inertial Lagrangian tracks Ke Zhou et.al. 2311.09076 null
2023-11-15 Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Junyang Chen et.al. 2311.07622 null
2023-11-13 VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search Shuting He et.al. 2311.07514 null
2023-11-10 Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval Xin Lu et.al. 2311.06067 null
2023-11-08 Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model Junya Shiraishi et.al. 2311.04788 null
2023-11-08 Training CLIP models on Data from Scientific Papers Calvin Metzger et.al. 2311.04711 link
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098 link
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345 null
2023-11-06 FocusTune: Tuning Visual Localization through Focus-Guided Sampling Son Tung Nguyen et.al. 2311.02872 link
2023-11-01 DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang et.al. 2311.00230 null
2023-10-29 Identifiable Contrastive Learning with Automatic Feature Importance Discovery Qi Zhang et.al. 2310.18904 link
2023-10-27 LipSim: A Provably Robust Perceptual Similarity Metric Sara Ghazanfari et.al. 2310.18274 link
2023-10-27 Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation Susu Fang et.al. 2310.17879 null
2023-10-25 FoundLoc: Vision-based Onboard Aerial Localization in the Wild Yao He et.al. 2310.16299 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval Xu Yuan et.al. 2310.14637 link
2023-10-21 Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation Anastasia Kritharoula et.al. 2310.14025 link
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-01-02 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 null
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 null
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 null
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null
2023-11-29 Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features Thomas Wimmer et.al. 2311.18113 null
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024 null
2023-11-28 Riemannian Self-Attention Mechanism for SPD Networks Rui Wang et.al. 2311.16738 null
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361 null
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398 null
2023-11-11 CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer Haoyu Ma et.al. 2311.06443 null
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699 null
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot’s motion Kartikeya Singh et.al. 2310.06249 null
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056 null
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link

2023-12

Pose Estimation

Publish Date Title Authors PDF Code
2024-01-07 RHOBIN Challenge: Reconstruction of Human Object Interaction Xianghui Xie et.al. 2401.04143 null
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914 null
2024-01-07 Big Data and Deep Learning in Smart Cities: A Comprehensive Dataset for AI-Driven Traffic Accident Detection and Computer Vision Systems Victor Adewopo et.al. 2401.03587 null
2024-01-04 Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications Darshan Venkatrayappa et.al. 2401.02383 null
2024-01-04 Fit-NGP: Fitting Object Models to Neural Graphics Primitives Marwan Taher et.al. 2401.02357 null
2024-01-04 PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DOF Object Pose Dataset Generation Lukas Meyer et.al. 2401.02281 null
2024-01-03 Real-Time Human Fall Detection using a Lightweight Pose Estimation Technique Ekram Alam et.al. 2401.01587 null
2024-01-05 PLE-SLAM: A Visual-Inertial SLAM Based on Point-Line Features and Efficient IMU Initialization Jiaming He et.al. 2401.01081 link
2023-12-30 3D Human Pose Perception from Egocentric Stereo Videos Hiroyasu Akada et.al. 2401.00889 null
2024-01-01 Geometry Depth Consistency in RGBD Relative Pose Estimation Sourav Kumar et.al. 2401.00639 null
2023-12-30 A comprehensive framework for occluded human pose estimation Linhao Xu et.al. 2401.00155 null
2024-01-02 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-29 MURP: Multi-Agent Ultra-Wideband Relative Pose Estimation with Constrained Communications in 3D Environments Andrew Fishberg et.al. 2312.17731 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250 link
2023-12-28 EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion Jianping Jiang et.al. 2312.16933 null
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-28 L-LO: Enhancing Pose Estimation Precision via a Landmark-Based LiDAR Odometry Feiya Li et.al. 2312.16787 null
2023-12-27 HMP: Hand Motion Priors for Pose and Shape Estimation from Video Enes Duran et.al. 2312.16737 null
2023-12-27 Camera calibration for the surround-view system: a benchmark and dataset L Qin et.al. 2312.16499 null
2023-12-24 TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions Rohit Lal et.al. 2312.16221 null
2023-12-26 Graph Context Transformation Learning for Progressive Correspondence Pruning Junwen Guo et.al. 2312.15971 null
2023-12-25 Lifting by Image – Leveraging Image Cues for Accurate 3D Human Pose Estimation Feng Zhou et.al. 2312.15636 null
2023-12-25 APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond Yuxiang Yang et.al. 2312.15612 null
2023-12-23 PACE: Pose Annotations in Cluttered Environments Yang You et.al. 2312.15130 link
2023-12-22 PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF Mohsen Gholami et.al. 2312.14915 link
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733 link
2023-12-22 Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization Joaquin Rodriguez et.al. 2312.14697 null
2023-12-22 PoseViNet: Distracted Driver Action Recognition Framework Using Multi-View Pose Estimation and Vision Transformer Neha Sengar et.al. 2312.14577 null
2023-12-22 Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning Jay Shenoy et.al. 2312.14432 null
2023-12-21 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera Christen Millerdurai et.al. 2312.14157 null
2023-12-21 DUSt3R: Geometric 3D Vision Made Easy Shuzhe Wang et.al. 2312.14132 null
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162 null
2023-12-18 Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics Yesukhei Jagvaral et.al. 2312.11707 null
2023-12-18 Underwater Robot Pose Estimation Using Acoustic Methods and Intermittent Position Measurements at the Surface Vicu-Mihalis Maer et.al. 2312.11401 null
2023-12-17 SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation Xiaoqi An et.al. 2312.10758 link
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-15 SoloPose: One-Shot Kinematic 3D Human Pose Estimation with Video Data Augmentation David C. Jeong et.al. 2312.10195 null
2023-12-14 iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching Yuan Sun et.al. 2312.09031 null
2023-12-14 Scene 3-D Reconstruction System in Scattering Medium Zhuoyifan Zhang et.al. 2312.09005 null
2023-12-14 CattleEyeView: A Multi-task Top-down View Cattle Dataset for Smarter Precision Livestock Farming Kian Eng Ong et.al. 2312.08764 link
2023-12-20 PnP for Two-Dimensional Pose Estimation Joshua Wang et.al. 2312.08488 null
2023-12-13 Pose and shear-based tactile servoing John Lloyd et.al. 2312.08411 null
2023-12-13 FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects Bowen Wen et.al. 2312.08344 null
2023-12-13 Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation Arul Selvam Periyasamy et.al. 2312.08268 null
2023-12-13 CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation Eugenio Chisari et.al. 2312.08240 null
2023-12-13 C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Florian Fervers et.al. 2312.08060 null
2023-12-13 Three-Filters-to-Normal+: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation Jingwei Yang et.al. 2312.07964 null
2023-12-13 Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users Tianxun Zhou et.al. 2312.07854 null
2023-12-12 RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation Peng Lu et.al. 2312.07526 link
2023-12-12 COLMAP-Free 3D Gaussian Splatting Yang Fu et.al. 2312.07504 null
2023-12-12 RMS: Redundancy-Minimizing Point Cloud Sampling for Real-Time Pose Estimation in Degenerated Environments Pavel Petracek et.al. 2312.07337 link
2023-12-12 Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs Sunghwan Hong et.al. 2312.07246 link
2023-12-12 Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation Yuchen Yang et.al. 2312.07051 null
2023-12-12 Towards Enhanced Human Activity Recognition through Natural Language Generation and Pose Estimation Nikhil Kashyap et.al. 2312.06965 null
2023-12-12 Exploring Novel Object Recognition and Spontaneous Location Recognition Machine Learning Analysis Techniques in Alzheimer’s Mice Soham Bafana et.al. 2312.06914 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 null
2023-12-11 Improving the Robustness of 3D Human Pose Estimation: A Benchmark and Learning from Noisy Input Trung-Hieu Hoang et.al. 2312.06797 null
2023-12-11 3D Hand Pose Estimation in Egocentric Images in the Wild Aditya Prakash et.al. 2312.06583 null
2023-12-11 PointVoxel: A Simple and Effective Pipeline for Multi-View Multi-Modal 3D Human Pose Estimation Zhiyu Pan et.al. 2312.06409 null
2023-12-11 ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation Cédric Rommel et.al. 2312.06386 null
2023-12-10 From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation Javier Tirado-Garín et.al. 2312.05995 link
2023-12-09 You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception Sheng Jin et.al. 2312.05525 null
2023-12-07 Image and AIS Data Fusion Technique for Maritime Computer Vision Applications Emre Gülsoylu et.al. 2312.05270 null
2023-12-07 Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection Kohei Yamashita et.al. 2312.04527 null
2023-12-07 Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images Yiqun Zhang et.al. 2312.04236 null
2023-12-06 Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning Xinshun Wang et.al. 2312.03703 link
2023-12-06 Cooperative Probabilistic Trajectory Forecasting under Occlusion Anshul Nayak et.al. 2312.03296 null
2023-12-05 A Unified Simulation Framework for Visual and Behavioral Fidelity in Crowd Analysis Niccolò Bisagno et.al. 2312.02613 null
2023-12-05 6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation K. Samarawickrama et.al. 2312.02593 link
2023-12-05 PolyFit: A Peg-in-hole Assembly Framework for Unseen Polygon Shapes via Sim-to-real Adaptation Geonhyup Lee et.al. 2312.02531 null
2023-12-04 GenEM: Physics-Informed Generative Cryo-Electron Microscopy Jiakai Zhang et.al. 2312.02235 null
2023-12-02 Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors Yu Zhang et.al. 2312.02196 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 null
2023-12-04 SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM Nikhil Keetha et.al. 2312.02126 null
2023-12-04 Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection Xubin Zhong et.al. 2312.01713 null
2023-12-05 Hulk: A Universal Knowledge Translator for Human-Centric Tasks Yizhou Wang et.al. 2312.01697 link
2023-12-04 Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks Yan Xu et.al. 2312.01561 null
2023-12-01 Object 6D pose estimation meets zero-shot learning Andrea Caraffa et.al. 2312.00947 null
2023-12-01 Open-vocabulary object 6D pose estimation Jaime Corsetti et.al. 2312.00690 null
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-12-01 Learning Unorthogonalized Matrices for Rotation Estimation Kerui Gu et.al. 2312.00462 null
2023-11-30 PoseGPT: Chatting about 3D Human Pose Yao Feng et.al. 2311.18836 null
2023-11-30 FoundPose: Unseen Object Pose Estimation with Foundation Features Evin Pınar Örnek et.al. 2311.18809 null
2023-11-30 Pose Estimation and Tracking for ASIST Ari Goodman et.al. 2311.18665 null
2023-11-29 A Stochastic-Geometrical Framework for Object Pose Estimation based on Mixture Models Avoiding the Correspondence Problem Wolfgang Hoegele et.al. 2311.18107 null
2023-11-29 Pose Anything: A Graph-Based Approach for Category-Agnostic Pose Estimation Or Hirschorn et.al. 2311.17891 link
2023-11-29 Cinematic Behavior Transfer via NeRF-based Differentiable Filming Xuekun Jiang et.al. 2311.17754 null
2023-11-29 PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens Sebastian Stapf et.al. 2311.17504 null

Visual Localization

Publish Date Title Authors PDF Code
2024-01-18 Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong et.al. 2401.10090 null
2024-01-16 Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Zahra Tabatabaei et.al. 2401.08272 null
2024-01-16 Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2401.08263 null
2024-01-15 Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jakob Hackstein et.al. 2401.07782 link
2024-01-14 HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Zexuan Qiu et.al. 2401.07212 null
2024-01-11 UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization Rouwan Wu et.al. 2401.05971 null
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860 null
2024-01-05 Benchmarking PathCLIP for Pathology Image Analysis Sunyi Zheng et.al. 2401.02651 null
2024-01-02 BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving Dafeng Wei et.al. 2401.01065 null
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371 link
2023-12-29 Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering Long-Kun Du et.al. 2401.00032 null
2023-12-27 LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization Sai Shubodh Puligilla et.al. 2312.16648 null
2023-12-26 Recursive Distillation for Open-Set Distributed Robot Localization Kenta Tsukahara et.al. 2312.15897 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-20 Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2312.12995 null
2023-12-19 VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Chun-Mei Feng et.al. 2312.12273 link
2023-12-18 Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Boaz Lerner et.al. 2312.11078 link
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-17 DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition Sijie Wang et.al. 2312.10616 link
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320 link
2023-12-15 Data-Efficient Multimodal Fusion on a Single GPU Noël Vouitsis et.al. 2312.10144 link
2023-12-13 Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques Hamed Qazanfari et.al. 2312.10089 null
2023-12-15 Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval Zhe Ma et.al. 2312.09716 link
2023-12-14 Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Oliver Grainge et.al. 2312.09028 null
2023-12-14 Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking Shitong Sun et.al. 2312.08924 null
2023-12-13 C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Florian Fervers et.al. 2312.08060 null
2023-12-12 Contextually Affinitive Neighborhood Refinery for Deep Clustering Chunlin Yu et.al. 2312.07806 link
2023-12-12 Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval Qiwei Tian et.al. 2312.07364 null
2023-12-11 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Fuxiang Huang et.al. 2312.06179 null
2023-12-06 Lite-Mind: Towards Efficient and Versatile Brain Representation Network Zixuan Gong et.al. 2312.03781 null
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428 link
2023-12-04 Implicit Learning of Scene Geometry from Poses for Global Localization Mohammad Altillawi et.al. 2312.02029 null
2023-12-04 Language-only Efficient Training of Zero-shot Composed Image Retrieval Geonmo Gu et.al. 2312.01998 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522 null
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950 null
2023-12-05 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878 link
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-11-30 HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance Zhuohao Yin et.al. 2311.18273 link
2023-11-30 Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models Raviteja Vemulapalli et.al. 2311.18237 null
2023-11-29 Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce Chang Liu et.al. 2311.17954 null
2023-11-28 Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Chao Chen et.al. 2311.17940 null
2023-11-29 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Huajian Huang et.al. 2311.17389 null
2023-11-27 Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation Samuele Poppi et.al. 2311.16254 link
2023-11-27 Optimal Transport Aggregation for Visual Place Recognition Sergio Izquierdo et.al. 2311.15937 link
2023-11-27 AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval Shicheng Xu et.al. 2311.14084 null
2023-11-23 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology Asma Ben Abacha et.al. 2311.13752 null

Keypoint Detection

Publish Date Title Authors PDF Code
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 null
2024-01-02 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 null
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 null
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 null
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null
2023-11-29 Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features Thomas Wimmer et.al. 2311.18113 null
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024 null
2023-11-28 Riemannian Self-Attention Mechanism for SPD Networks Rui Wang et.al. 2311.16738 null
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361 null
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398 null

2024-1

Pose Estimation

Publish Date Title Authors PDF Code
2024-02-05 A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model Murad Hasan et.al. 2402.03417 null
2024-02-05 SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM Mingrui Li et.al. 2402.03246 null
2024-02-05 Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun et.al. 2402.02800 link
2024-02-04 Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation Ti Wang et.al. 2402.02339 null
2024-02-01 mmID: High-Resolution mmWave Imaging for Human Identification Sakila S. Jayaweera et.al. 2402.00996 null
2024-02-01 In-Bed Pose Estimation: A Review Ziya Ata Yazıcı et.al. 2402.00700 null
2024-02-01 WayFASTER: a Self-Supervised Traversability Prediction for Increased Navigation Awareness Mateus Valverde Gasparino et.al. 2402.00683 null
2024-02-02 CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration Daniele Cattaneo et.al. 2402.00129 null
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-30 Navigating the Unknown: Uncertainty-Aware Compute-in-Memory Autonomy of Edge Robotics Nastaran Darabi et.al. 2401.17481 null
2024-01-30 MESA: Matching Everything by Segmenting Anything Yesheng Zhang et.al. 2401.16741 null
2024-01-30 Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers Jianbin Jiao et.al. 2401.16700 null
2024-01-29 Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation Jaewoo Park et.al. 2401.16284 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-28 Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras Yu-Jhe Li et.al. 2401.15616 null
2024-01-30 Multi-Robot Relative Pose Estimation in SE(2) with Observability Analysis: A Comparison of Extended Kalman Filtering and Robust Pose Graph Optimization Kihoon Shin et.al. 2401.15313 null
2024-01-26 Adaptive Deep Learning for Efficient Visual Pose Estimation aboard Ultra-low-power Nano-drones Beatrice Alessandra Motetti et.al. 2401.15236 null
2024-01-26 SimpleEgo: Predicting Probabilistic Body Pose from Egocentric Cameras Hanz Cuevas-Velasquez et.al. 2401.14785 null
2024-01-24 Synthetic data enables faster annotation and robust segmentation for multi-object grasping in clutter Dongmyoung Lee et.al. 2401.13405 null
2024-01-24 Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry Qi Cai et.al. 2401.13357 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076 link
2024-01-24 RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos Hongchi Xia et.al. 2401.12592 null
2024-01-26 MobileARLoc: On-device Robust Absolute Localisation for Pervasive Markerless Mobile AR Changkun Liu et.al. 2401.11511 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886 null
2024-01-19 Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation Prakhar Kaushik et.al. 2401.10848 null
2024-01-22 TEXterity: Tactile Extrinsic deXterity Antonia Bronars et.al. 2401.10230 null
2024-01-18 Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework Junkun Jiang et.al. 2401.09836 null
2024-01-17 DK-SLAM: Monocular Visual SLAM with Deep Keypoints Adaptive Learning, Tracking and Loop-Closing Hao Qu et.al. 2401.09160 null
2024-01-17 PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency Yue Pan et.al. 2401.09101 link
2024-01-16 AdaSem: Adaptive Goal-Oriented Semantic Communications for End-to-End Camera Relocalization Qi Liao et.al. 2401.08360 null
2024-01-16 S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera Thanh Nguyen Canh et.al. 2401.08134 null
2024-01-15 Collaboratively Self-supervised Video Representation Learning for Action Recognition Jie Zhang et.al. 2401.07584 null
2024-01-14 3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework Fan Zhang et.al. 2401.07251 null
2024-01-11 On the representation and methodology for wide and short range head pose estimation Alejandro Cobo et.al. 2401.05807 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236 link
2024-01-10 Video-based Automatic Lameness Detection of Dairy Cows using Pose Estimation and Multiple Locomotion Traits Helena Russello et.al. 2401.05202 null
2024-01-10 Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton Hongbo Kang et.al. 2401.04921 null
2024-01-07 RHOBIN Challenge: Reconstruction of Human Object Interaction Xianghui Xie et.al. 2401.04143 null
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914 null
2024-01-07 Big Data and Deep Learning in Smart Cities: A Comprehensive Dataset for AI-Driven Traffic Accident Detection and Computer Vision Systems Victor Adewopo et.al. 2401.03587 null
2024-01-04 Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications Darshan Venkatrayappa et.al. 2401.02383 null
2024-01-04 Fit-NGP: Fitting Object Models to Neural Graphics Primitives Marwan Taher et.al. 2401.02357 null
2024-01-04 PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DOF Object Pose Dataset Generation Lukas Meyer et.al. 2401.02281 null
2024-01-03 Real-Time Human Fall Detection using a Lightweight Pose Estimation Technique Ekram Alam et.al. 2401.01587 null
2024-01-05 PLE-SLAM: A Visual-Inertial SLAM Based on Point-Line Features and Efficient IMU Initialization Jiaming He et.al. 2401.01081 link
2023-12-30 3D Human Pose Perception from Egocentric Stereo Videos Hiroyasu Akada et.al. 2401.00889 null
2024-01-01 Geometry Depth Consistency in RGBD Relative Pose Estimation Sourav Kumar et.al. 2401.00639 null
2023-12-30 A comprehensive framework for occluded human pose estimation Linhao Xu et.al. 2401.00155 null
2024-01-02 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-29 MURP: Multi-Agent Ultra-Wideband Relative Pose Estimation with Constrained Communications in 3D Environments Andrew Fishberg et.al. 2312.17731 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250 link
2023-12-28 EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion Jianping Jiang et.al. 2312.16933 null
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-28 L-LO: Enhancing Pose Estimation Precision via a Landmark-Based LiDAR Odometry Feiya Li et.al. 2312.16787 null

Visual Localization

Publish Date Title Authors PDF Code
2024-02-14 Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Yannis Kalantidis et.al. 2402.09237 null
2024-02-13 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu et.al. 2402.08567 link
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-02-10 Semantic Object-level Modeling for Robust Visual Camera Relocalization Yifan Zhu et.al. 2402.06951 null
2024-02-09 Large Language Models for Captioning and Retrieving Remote Sensing Images João Daniel Silva et.al. 2402.06475 null
2024-02-09 PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes Xinggang Hu et.al. 2402.06131 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352 null
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141 null
2024-02-01 Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering Tianxiao Gao et.al. 2402.00330 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-29 Cross-Modal Coordination Across a Diverse Set of Input Modalities Jorge Sánchez et.al. 2401.16347 null
2024-01-29 Regressing Transformers for Data-efficient Visual Place Recognition María Leyva-Vallina et.al. 2401.16304 null
2024-01-27 Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Ayush Dubey et.al. 2401.15362 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613 null
2024-01-23 PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Shyam Sundar Kannan et.al. 2401.13082 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076 link
2024-01-25 CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios Xiangshuo Qiao et.al. 2401.10475 link
2024-01-19 PhotoScout: Synthesis-Powered Multi-Modal Image Search Celeste Barnaby et.al. 2401.10464 null
2024-01-19 Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong et.al. 2401.10090 null
2024-01-16 Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Zahra Tabatabaei et.al. 2401.08272 null
2024-01-16 Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2401.08263 null
2024-01-15 Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jakob Hackstein et.al. 2401.07782 link
2024-01-14 HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Zexuan Qiu et.al. 2401.07212 null
2024-01-11 UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization Rouwan Wu et.al. 2401.05971 null
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860 null
2024-01-05 Benchmarking PathCLIP for Pathology Image Analysis Sunyi Zheng et.al. 2401.02651 null
2024-01-02 BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving Dafeng Wei et.al. 2401.01065 null
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371 link
2023-12-29 Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering Long-Kun Du et.al. 2401.00032 null
2023-12-27 LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization Sai Shubodh Puligilla et.al. 2312.16648 null
2023-12-26 Recursive Distillation for Open-Set Distributed Robot Localization Kenta Tsukahara et.al. 2312.15897 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-20 Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2312.12995 null
2023-12-19 VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Chun-Mei Feng et.al. 2312.12273 link
2023-12-18 Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Boaz Lerner et.al. 2312.11078 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 null
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 null
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 null
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 null
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null

2024-2

Pose Estimation

Publish Date Title Authors PDF Code
2024-03-02 Single-image camera calibration with model-free distortion correction Katia Genovese et.al. 2403.01263 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-03-01 Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations Syed Shabbir Ahmed et.al. 2403.00988 null
2024-03-04 TEXterity – Tactile Extrinsic deXterity: Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Sangwoon Kim et.al. 2403.00049 null
2024-03-01 Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach Sarina Thomas et.al. 2402.19062 null
2024-02-29 Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey Yang Liu et.al. 2402.18844 link
2024-02-28 Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting Taeho Kang et.al. 2402.18330 link
2024-02-28 Location-guided Head Pose Estimation for Fisheye Image Bing Li et.al. 2402.18320 null
2024-02-28 NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images Jingrui Yu et.al. 2402.18196 null
2024-02-28 Six-Point Method for Multi-Camera Systems with Reduced Solution Space Banglei Guan et.al. 2402.18066 null
2024-02-27 Real-Time Estimation of Relative Pose for UAVs Using a Dual-Channel Feature Association Zhaoying Wang et.al. 2402.17504 null
2024-02-26 HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields Haozhe Qi et.al. 2402.17062 link
2024-02-26 DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation Shang Wu et.al. 2402.16640 null
2024-02-26 GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video Xinqi Liu et.al. 2402.16607 null
2024-02-26 DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer Yizhe Wu et.al. 2402.16308 null
2024-02-25 XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras Arnav Mishra et.al. 2402.16175 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-24 CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge Xiao Lin et.al. 2402.15726 null
2024-02-23 Optimized Deployment of Deep Neural Networks for Visual Pose Estimation on Nano-drones Matteo Risso et.al. 2402.15273 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817 null
2024-02-22 S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR Jialun Pei et.al. 2402.14461 null
2024-02-22 VLPose: Bridging the Domain Gap in Pose Estimation with Language-Vision Tuning Jingyao Li et.al. 2402.14456 null
2024-02-22 Modeling 3D Infant Kinetics Using Adaptive Graph Convolutional Networks Daniel Holmberg et.al. 2402.14400 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280 null
2024-02-21 SecurePose: Automated Face Blurring and Human Movement Kinematics Extraction from Videos Recorded in Clinical Settings Rishabh Bajpai et.al. 2402.14143 null
2024-02-21 High-throughput Visual Nano-drone to Nano-drone Relative Localization using Onboard Fully Convolutional Networks Luca Crupi et.al. 2402.13756 null
2024-02-21 EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization Zhendong Xiao et.al. 2402.13537 null
2024-02-20 DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation Takuya Ikeda et.al. 2402.12647 null
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551 null
2024-02-18 Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training Huayi Zhou et.al. 2402.11566 link
2024-02-17 Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review Merryn D. Constable et.al. 2402.11288 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287 null
2024-02-16 Occlusion Resilient 3D Human Pose Estimation Soumava Kumar Roy et.al. 2402.11036 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885 null
2024-02-15 Lester: rotoscope animation through video object segmentation and tracking Ruben Tous et.al. 2402.09883 link
2024-02-15 Foul prediction with estimated poses from soccer broadcast video Jiale Fang et.al. 2402.09650 null
2024-02-16 IMUOptimize: A Data-Driven Approach to Optimal IMU Placement for Human Pose Estimation with Transformer Architecture Varun Ramani et.al. 2402.08923 null
2024-02-13 Are Semi-Dense Detector-Free Methods Good at Matching Local Features? Matthieu Vilain et.al. 2402.08671 null
2024-02-13 Gaussian-Sum Filter for Range-based 3D Relative Pose Estimation in the Presence of Ambiguities Syed S. Ahmed et.al. 2402.08566 null
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-02-12 Extending 3D body pose estimation for robotic-assistive therapies of autistic children Laura Santos et.al. 2402.08006 null
2024-02-12 GBOT: Graph-Based 3D Object Tracking for Augmented Reality-Assisted Assembly Guidance Shiyu Li et.al. 2402.07677 null
2024-02-12 UAV-assisted Visual SLAM Generating Reconstructed 3D Scene Graphs in GPS-denied Environments Ahmed Radwan et.al. 2402.07537 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436 null
2024-02-08 Real-time Holistic Robot Pose Estimation with Unknown States Shikun Ban et.al. 2402.05655 link
2024-02-08 Extending 6D Object Pose Estimators for Stereo Vision Thomas Pöllabauer et.al. 2402.05610 null
2024-02-09 NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction Zhongqun Zhang et.al. 2402.05532 null
2024-02-07 Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training Thomas Pöllabauer et.al. 2402.04979 null
2024-02-07 4-Dimensional deformation part model for pose estimation using Kalman filter constraints Enrique Martinez-Berti et.al. 2402.04953 null
2024-02-07 STAR: Shape-focused Texture Agnostic Representations for Improved Object Detection and 6D Pose Estimation Peter Hönig et.al. 2402.04878 null
2024-02-05 A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model Murad Hasan et.al. 2402.03417 null
2024-02-05 SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM Mingrui Li et.al. 2402.03246 null
2024-02-05 Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun et.al. 2402.02800 link
2024-02-04 Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation Ti Wang et.al. 2402.02339 null
2024-02-01 mmID: High-Resolution mmWave Imaging for Human Identification Sakila S. Jayaweera et.al. 2402.00996 null
2024-02-01 In-Bed Pose Estimation: A Review Ziya Ata Yazıcı et.al. 2402.00700 null
2024-02-01 WayFASTER: a Self-Supervised Traversability Prediction for Increased Navigation Awareness Mateus Valverde Gasparino et.al. 2402.00683 null
2024-02-02 CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration Daniele Cattaneo et.al. 2402.00129 null
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-30 Navigating the Unknown: Uncertainty-Aware Compute-in-Memory Autonomy of Edge Robotics Nastaran Darabi et.al. 2401.17481 null
2024-01-30 MESA: Matching Everything by Segmenting Anything Yesheng Zhang et.al. 2401.16741 null
2024-01-30 Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers Jianbin Jiao et.al. 2401.16700 null
2024-01-29 Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation Jaewoo Park et.al. 2401.16284 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-28 Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras Yu-Jhe Li et.al. 2401.15616 null

Visual Localization

Publish Date Title Authors PDF Code
2024-03-08 LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map Xinrui Wu et.al. 2403.05002 null
2024-03-07 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 null
2024-03-06 Self-supervised Photographic Image Layout Representation Learning Zhaoran Zhao et.al. 2403.03740 link
2024-03-04 Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models Benedikt Blumenstiel et.al. 2403.02059 link
2024-03-03 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval Yongchao Du et.al. 2403.01431 null
2024-03-01 Asymmetric Feature Fusion for Image Retrieval Hui Wu et.al. 2403.00671 null
2024-03-01 Structure Similarity Preservation Learning for Asymmetric Image Retrieval Hui Wu et.al. 2403.00648 link
2024-02-29 CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feng Lu et.al. 2402.19231 link
2024-02-28 Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport Bin Li et.al. 2402.18411 link
2024-02-28 Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning Hanyao Wang et.al. 2402.18400 null
2024-02-28 Representing 3D sparse map points and lines for camera relocalization Bach-Thuan Bui et.al. 2402.18011 link
2024-02-27 Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control Thong Nguyen et.al. 2402.17535 link
2024-02-29 Active propulsion noise shaping for multi-rotor aircraft localization Gabriele Serussi et.al. 2402.17289 link
2024-02-27 NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer Bingxi Liu et.al. 2402.17159 null
2024-02-25 Deep Homography Estimation for Visual Place Recognition Feng Lu et.al. 2402.16086 link
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-28 Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries Zijun Long et.al. 2402.15276 null
2024-02-23 Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Hyunjae Kim et.al. 2402.15120 null
2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition Feng Lu et.al. 2402.14505 link
2024-02-16 Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition Chenming Hu et.al. 2402.10476 null
2024-02-15 Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task Mirko Nava et.al. 2402.09886 link
2024-02-14 Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Yannis Kalantidis et.al. 2402.09237 null
2024-02-13 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu et.al. 2402.08567 link
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-02-10 Semantic Object-level Modeling for Robust Visual Camera Relocalization Yifan Zhu et.al. 2402.06951 null
2024-02-09 Large Language Models for Captioning and Retrieving Remote Sensing Images João Daniel Silva et.al. 2402.06475 null
2024-02-09 PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes Xinggang Hu et.al. 2402.06131 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352 null
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141 null
2024-02-01 Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering Tianxiao Gao et.al. 2402.00330 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-29 Cross-Modal Coordination Across a Diverse Set of Input Modalities Jorge Sánchez et.al. 2401.16347 null
2024-01-29 Regressing Transformers for Data-efficient Visual Place Recognition María Leyva-Vallina et.al. 2401.16304 null
2024-01-27 Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Ayush Dubey et.al. 2401.15362 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613 null
2024-01-23 PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Shyam Sundar Kannan et.al. 2401.13082 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 null
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 null

2024-3

Pose Estimation

Publish Date Title Authors PDF Code
2024-04-05 ToolEENet: Tool Affordance 6D Pose Estimation Yunlong Wang et.al. 2404.04193 null
2024-04-04 SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation Sichen Chen et.al. 2404.03518 link
2024-04-04 Multi Positive Contrastive Learning with Pose-Consistent Generated Images Sho Inayoshi et.al. 2404.03256 null
2024-04-04 HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud Wencan Cheng et.al. 2404.03159 link
2024-04-03 Fusing Multi-sensor Input with State Information on TinyML Brains for Autonomous Nano-drones Luca Crupi et.al. 2404.02567 null
2024-04-03 Semi-Supervised Unconstrained Head Pose Estimation in the Wild Huayi Zhou et.al. 2404.02544 link
2024-04-02 3D Congealing: 3D-Aware Image Alignment in the Wild Yunzhi Zhang et.al. 2404.02125 null
2024-04-02 SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation Vinkle Srivastav et.al. 2404.02041 null
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891 null
2024-03-31 Graph-Based vs. Error State Kalman Filter-Based Fusion Of 5G And Inertial Data For MAV Indoor Pose Estimation Meisam Kabiri et.al. 2404.00691 null
2024-03-31 OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos Dongyoung Choi et.al. 2404.00676 null
2024-04-02 KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation Jihua Peng et.al. 2404.00658 link
2024-03-29 FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model Molin Zhang et.al. 2404.00132 null
2024-03-29 Latent Embedding Clustering for Occlusion Robust Head Pose Estimation José Celestino et.al. 2403.20251 null
2024-03-29 A Unified Framework for Human-centric Point Cloud Video Understanding Yiteng Xu et.al. 2403.20031 null
2024-04-01 Video-Based Human Pose Regression via Decoupled Space-Time Aggregation Jijie He et.al. 2403.19926 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-26 Mathematical Foundation and Corrections for Full Range Head Pose Estimation Huei-Chung Hu et.al. 2403.18104 null
2024-03-26 EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation Chenhongyi Yang et.al. 2403.18080 null
2024-03-26 A Survey on 3D Egocentric Human Pose Estimation Md Mushfiqur Azam et.al. 2403.17893 null
2024-03-26 GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction Hrishav Bakul Barua et.al. 2403.17837 link
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827 null
2024-03-26 System Calibration of a Field Phenotyping Robot with Multiple High-Precision Profile Laser Scanners Felix Esser et.al. 2403.17788 null
2024-03-25 Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos Remy Sabathier et.al. 2403.17103 null
2024-03-25 Characterisation of the Intel RealSense D415 Stereo Depth Camera for Motion-Corrected CT Perfusion Imaging Mahdieh Dashtbani Moghari et.al. 2403.16490 null
2024-03-25 Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Zicong Fan et.al. 2403.16428 null
2024-03-25 A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups Yixiao Ge et.al. 2403.16411 null
2024-03-25 ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose Estimation Hannah Schieber et.al. 2403.16400 null
2024-03-24 KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments Abdelrahman Younes et.al. 2403.16238 null
2024-03-24 Diffusion Model is a Good Pose Estimator from 3D RF-Vision Junqiao Fan et.al. 2403.16198 null
2024-03-23 UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation Yuliang Guo et.al. 2403.15705 null
2024-03-22 InterFusion: Text-Driven Generation of 3D Human-Object Interaction Sisi Dai et.al. 2403.15612 null
2024-03-22 Augmented Reality Warnings in Roadway Work Zones: Evaluating the Effect of Modality on Worker Reaction Times Sepehr Sabeti et.al. 2403.15571 null
2024-03-22 Gesture-Controlled Aerial Robot Formation for Human-Swarm Interaction in Safety Monitoring Applications Vít Krátký et.al. 2403.15333 null
2024-03-22 WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization Jialu Wang et.al. 2403.15272 null
2024-03-22 DITTO: Demonstration Imitation by Trajectory Transformation Nick Heppert et.al. 2403.15203 null
2024-03-22 Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning Bumsoo Kim et.al. 2403.15048 null
2024-03-22 Trajectory Regularization Enhances Self-Supervised Geometric Representation Jiayun Wang et.al. 2403.14973 null
2024-03-21 VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding Ahmad Mahmood et.al. 2403.14743 null
2024-03-21 Visibility-Aware Keypoint Localization for 6DoF Object Pose Estimation Ruyi Lian et.al. 2403.14559 null
2024-03-23 Exploring 3D Human Pose Estimation and Forecasting from the Robot’s Perspective: The HARPER Dataset Andrea Avogaro et.al. 2403.14447 null
2024-03-21 Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests Haedam Oh et.al. 2403.14326 null
2024-03-21 Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation Francesco Di Felice et.al. 2403.14279 null
2024-03-20 DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses Chen Zhao et.al. 2403.13683 link
2024-03-20 Meta-Point Learning and Refining for Category-Agnostic Pose Estimation Junjie Chen et.al. 2403.13647 link
2024-03-20 Advancing 6D Pose Estimation in Augmented Reality – Overcoming Projection Ambiguity with Uncontrolled Imagery Mayura Manawadu et.al. 2403.13434 null
2024-03-20 DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation Yamin Mao et.al. 2403.13405 null
2024-03-20 ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics Qiaojun Yu et.al. 2403.13365 null
2024-03-20 MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination Weiying Wang et.al. 2403.13348 null
2024-03-19 FaceXFormer: A Unified Transformer for Facial Analysis Kartik Narayan et.al. 2403.12960 null
2024-03-19 WHAC: World-grounded Humans and Cameras Wanqi Yin et.al. 2403.12959 null
2024-03-19 Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation Jingtao Sun et.al. 2403.12728 link
2024-03-19 IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model Matteo Bortolon et.al. 2403.12682 null
2024-03-19 In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing Mingrui Yu et.al. 2403.12676 null
2024-03-19 Self-learning Canonical Space for Multi-view 3D Human Pose Estimation Xiaoben Li et.al. 2403.12440 null
2024-03-20 Human Mesh Recovery from Arbitrary Multi-view Images Xiaoben Li et.al. 2403.12434 null
2024-03-19 XPose: eXplainable Human Pose Estimation Luyu Qiu et.al. 2403.12370 null
2024-03-18 HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data Mengqi Zhang et.al. 2403.12011 null
2024-03-18 Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction Wolfgang Fuhl et.al. 2403.11665 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627 link
2024-03-18 GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects Sungphill Moon et.al. 2403.11510 null
2024-03-17 A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation Qucheng Peng et.al. 2403.11310 null
2024-03-17 Compact 3D Gaussian Splatting For Dense Visual SLAM Tianchen Deng et.al. 2403.11247 null
2024-03-16 Robotic Task Success Evaluation Under Multi-modal Non-Parametric Object Pose Uncertainty Lakshadeep Naik et.al. 2403.10874 null
2024-03-16 DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation Christopher Kolios et.al. 2403.10773 null
2024-03-15 GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation Dingding Cai et.al. 2403.10683 null
2024-03-15 CLOSURE: Fast Quantification of Pose Uncertainty Sets Yihuai Gao et.al. 2403.09990 null
2024-03-14 ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Image Fangqiang Ding et.al. 2403.09871 null
2024-03-14 BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects Tomas Hodan et.al. 2403.09799 null
2024-03-14 Scalable Autonomous Drone Flight in the Forest with Visual-Inertial SLAM and Dense Submaps Built without LiDAR Sebastián Barbas Laina et.al. 2403.09596 null
2024-03-14 Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting Pawel Knap et.al. 2403.09437 null
2024-03-14 LM2D: Lyrics- and Music-Driven Dance Synthesis Wenjie Yin et.al. 2403.09407 null
2024-03-14 SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios Ding-Tao Huang et.al. 2403.09317 link
2024-03-14 MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion Arul Selvam Periyasamy et.al. 2403.09309 null
2024-03-13 Data Augmentation in Human-Centric Vision Wentao Jiang et.al. 2403.08650 null
2024-03-15 PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections Matteo Taiana et.al. 2403.08586 null
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 null
2024-03-12 Q-SLAM: Quadric Representations for Monocular SLAM Chensheng Peng et.al. 2403.08125 null
2024-03-12 MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation Yuelong Li et.al. 2403.08019 null
2024-03-12 Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation Kira Wursthorn et.al. 2403.07741 null
2024-03-12 Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving JunDa Cheng et.al. 2403.07535 null
2024-03-12 Category-Agnostic Pose Estimation for Point Clouds Bowen Liu et.al. 2403.07437 null
2024-03-12 Monocular Microscope to CT Registration using Pose Estimation of the Incus for Augmented Reality Cochlear Implant Surgery Yike Zhang et.al. 2403.07219 null
2024-03-11 Real-Time Simulated Avatar from Head-Mounted Sensors Zhengyi Luo et.al. 2403.06862 null
2024-03-11 Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition Erkut Akdag et.al. 2403.06577 null
2024-03-10 Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation Paweł A. Pierzchlewicz et.al. 2403.06164 link
2024-03-10 Diffusion Models Trained with Large Data Are Transferable Visual Models Guangkai Xu et.al. 2403.06090 null
2024-03-08 Prepared for the Worst: A Learning-Based Adversarial Attack for Resilience Analysis of the ICP Algorithm Ziyu Zhang et.al. 2403.05666 null
2024-03-11 Exploiting polar symmetry in designing equivariant observers for vision-based motion estimation Tarek Bouazza et.al. 2403.05450 null
2024-03-07 Real-Time Planning Under Uncertainty for AUVs Using Virtual Maps Ivana Collado-Gonzalez et.al. 2403.04936 null
2024-03-07 That’s My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation Georgi Pramatarov et.al. 2403.04755 null
2024-03-07 Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser Qingyuan Cai et.al. 2403.04444 null
2024-03-09 Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation Ruicong Liu et.al. 2403.04381 null
2024-03-05 FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation Chris Rockwell et.al. 2403.03221 null
2024-03-05 NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors Yannan He et.al. 2403.03122 null
2024-03-05 Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection Mohamed Afifi et.al. 2403.03111 null
2024-03-05 Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps Timothy Chen et.al. 2403.02751 null
2024-03-04 PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station Cunyi Yin et.al. 2403.01913 link
2024-03-04 A Simple Baseline for Efficient Hand Mesh Reconstruction Zhishan Zhou et.al. 2403.01813 null
2024-03-03 MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images Junwen Huang et.al. 2403.01517 null
2024-03-02 Single-image camera calibration with model-free distortion correction Katia Genovese et.al. 2403.01263 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-03-01 Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations Syed Shabbir Ahmed et.al. 2403.00988 null
2024-03-04 TEXterity – Tactile Extrinsic deXterity: Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Sangwoon Kim et.al. 2403.00049 null
2024-03-01 Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach Sarina Thomas et.al. 2402.19062 null
2024-02-29 Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey Yang Liu et.al. 2402.18844 link
2024-02-28 Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting Taeho Kang et.al. 2402.18330 link
2024-02-28 Location-guided Head Pose Estimation for Fisheye Image Bing Li et.al. 2402.18320 null
2024-02-28 NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images Jingrui Yu et.al. 2402.18196 null
2024-02-28 Six-Point Method for Multi-Camera Systems with Reduced Solution Space Banglei Guan et.al. 2402.18066 null
2024-02-27 Real-Time Estimation of Relative Pose for UAVs Using a Dual-Channel Feature Association Zhaoying Wang et.al. 2402.17504 null
2024-02-26 HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields Haozhe Qi et.al. 2402.17062 link
2024-02-26 DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation Shang Wu et.al. 2402.16640 null
2024-02-26 GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video Xinqi Liu et.al. 2402.16607 null

Visual Localization

Publish Date Title Authors PDF Code
2024-04-11 PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue et.al. 2404.07785 null
2024-04-11 Semantically-correlated memories in a dense associative model Thomas F Burns et.al. 2404.07123 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 null
2024-04-07 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Jinpeng Wang et.al. 2404.04998 link
2024-04-06 Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Juncheng Yang et.al. 2404.04538 null
2024-04-02 TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation Yehui Shen et.al. 2404.01587 link
2024-04-01 On Train-Test Class Overlap and Detection for Image Retrieval Chull Hwan Song et.al. 2404.01524 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546 null
2024-03-31 NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation Diwei Sheng et.al. 2404.00504 null
2024-03-30 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs Yang Miao et.al. 2404.00469 null
2024-03-30 Do Vision-Language Models Understand Compound Nouns? Sonal Kumar et.al. 2404.00419 null
2024-04-05 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 null
2024-03-28 JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition Gabriele Berton et.al. 2403.19787 link
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 null
2024-03-27 AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation Changkun Liu et.al. 2403.18281 null
2024-03-26 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge Dongjin Kim et.al. 2403.17420 link
2024-03-25 Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras Gokul B. Nair et.al. 2403.16425 null
2024-03-24 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval Yucheng Suo et.al. 2403.16005 null
2024-03-24 BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Yinda Chen et.al. 2403.15992 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 link
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 null
2024-03-22 Piecewise-Linear Manifolds for Deep Metric Learning Shubhang Bhatnagar et.al. 2403.14977 null
2024-03-21 Enhancing Historical Image Retrieval with Compositional Cues Tingyu Lin et.al. 2403.14287 link
2024-03-20 Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval Aymene Berriche et.al. 2403.13747 null
2024-03-20 Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval Haoyu Liu et.al. 2403.13317 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800 null
2024-03-19 Quantixar: High-performance Vector Data Management System Gulshan Yadav et.al. 2403.12583 null
2024-03-17 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization Peng Jiang et.al. 2403.11367 null
2024-03-17 MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paul S. Scotti et.al. 2403.11207 link
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746 null
2024-03-13 Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer Kenta Tsukahara et.al. 2403.10552 null
2024-03-20 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297 link
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283 null
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577 null
2024-03-14 VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition Benjamin Ramtoula et.al. 2403.09025 null
2024-03-13 PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models Siddharth Mishra-Sharma et.al. 2403.08851 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 null
2024-03-12 It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234 link
2024-03-12 You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203 null
2024-03-11 EarthLoc: Astronaut Photography Localization by Indexing Earth from Space Gabriele Berton et.al. 2403.06758 link
2024-03-11 BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues Fudong Ge et.al. 2403.06600 null
2024-03-11 Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology Stefan Denner et.al. 2403.06567 null
2024-03-10 Texture image retrieval using a classification and contourlet-based features Asal Rouhafzay et.al. 2403.06048 null
2024-03-11 LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map Xinrui Wu et.al. 2403.05002 link
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 null
2024-03-06 Self-supervised Photographic Image Layout Representation Learning Zhaoran Zhao et.al. 2403.03740 link
2024-03-04 Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models Benedikt Blumenstiel et.al. 2403.02059 link
2024-03-03 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval Yongchao Du et.al. 2403.01431 null
2024-03-01 Asymmetric Feature Fusion for Image Retrieval Hui Wu et.al. 2403.00671 null
2024-03-01 Structure Similarity Preservation Learning for Asymmetric Image Retrieval Hui Wu et.al. 2403.00648 link
2024-02-29 CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feng Lu et.al. 2402.19231 link
2024-02-28 Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport Bin Li et.al. 2402.18411 link
2024-02-28 Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning Hanyao Wang et.al. 2402.18400 null
2024-02-28 Representing 3D sparse map points and lines for camera relocalization Bach-Thuan Bui et.al. 2402.18011 link
2024-02-27 Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control Thong Nguyen et.al. 2402.17535 link
2024-02-29 Active propulsion noise shaping for multi-rotor aircraft localization Gabriele Serussi et.al. 2402.17289 link
2024-02-27 NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer Bingxi Liu et.al. 2402.17159 null
2024-02-25 Deep Homography Estimation for Visual Place Recognition Feng Lu et.al. 2402.16086 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 null
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null

2024-4

Pose Estimation

Publish Date Title Authors PDF Code
2024-05-02 IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning Ryan Hoque et.al. 2405.01472 null
2024-05-02 Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning Liu Qiyuan et.al. 2405.01284 null
2024-05-02 Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors Wenxuan Guo et.al. 2405.01112 null
2024-05-02 CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications Jan Blumenkamp et.al. 2405.01107 null
2024-05-02 HandSSCA: 3D Hand Mesh Reconstruction with State Space Channel Attention from RGB images Zixun Jiao et.al. 2405.01066 null
2024-05-01 Radar-Based Localization For Autonomous Ground Vehicles In Suburban Neighborhoods Andrew J. Kramer et.al. 2405.00600 null
2024-04-30 Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging Rayan Armani et.al. 2404.19541 link
2024-04-30 UniFS: Universal Few-shot Instance Perception with Point Representations Sheng Jin et.al. 2404.19401 null
2024-04-30 Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training Xingyu Song et.al. 2404.19279 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction Antoine Maiorca et.al. 2404.18628 null
2024-04-29 Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle Jungwoo Lee et.al. 2404.18395 null
2024-04-29 Reconstructing Satellites in 3D from Amateur Telescope Images Zhiming Chang et.al. 2404.18394 null
2024-04-27 Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs Yiming Bao et.al. 2404.17837 null
2024-04-26 Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses Yi Shen et.al. 2404.17685 null
2024-04-26 SLAM for Indoor Mapping of Wide Area Construction Environments Vincent Ress et.al. 2404.17215 null
2024-04-25 WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users William Huang et.al. 2404.17063 link
2024-04-25 Transformer-Based Local Feature Matching for Multimodal Image Registration Remi Delaunay et.al. 2404.16802 null
2024-04-25 DeepKalPose: An Enhanced Deep-Learning Kalman Filter for Temporally Consistent Monocular Vehicle Pose Estimation Leandro Di Bella et.al. 2404.16558 null
2024-04-25 Efficient Solution of Point-Line Absolute Pose Petr Hruby et.al. 2404.16552 link
2024-04-25 COBRA – COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images Panagiotis Sapoutzoglou et.al. 2404.16471 link
2024-04-25 MegaParticles: Range-based 6-DoF Monte Carlo Localization with GPU-Accelerated Stein Particle Filter Kenji Koide et.al. 2404.16370 null
2024-04-24 3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement Filipa Lino et.al. 2404.16136 null
2024-04-23 SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation Xiangyu Xu et.al. 2404.15276 link
2024-04-25 Domain adaptive pose estimation via multi-level alignment Yugan Chen et.al. 2404.14885 link
2024-04-23 Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking Kexin Meng et.al. 2404.14835 null
2024-04-23 UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues Vandad Davoodnia et.al. 2404.14634 null
2024-04-22 DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation Yonghao Dang et.al. 2404.14025 null
2024-04-23 CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory Yunlong Ran et.al. 2404.13896 null
2024-04-21 Resampling-free Particle Filters in High-dimensions Akhilan Boopathy et.al. 2404.13698 null
2024-04-20 EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment Guanghao Li et.al. 2404.13346 link
2024-04-18 Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds Oliver Lemke et.al. 2404.12440 null
2024-04-18 Gait Recognition from Highly Compressed Videos Andrei Niculae et.al. 2404.12183 null
2024-04-17 Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding George Retsinas et.al. 2404.12144 link
2024-04-17 Kathakali Hand Gesture Recognition With Minimal Data Kavitha Raju et.al. 2404.11205 null
2024-04-17 GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement Linfang Zheng et.al. 2404.11139 null
2024-04-17 CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation Lianyu Hu et.al. 2404.11111 link
2024-04-16 HumMUSS: Human Motion Understanding using State Space Models Arnab Kumar Mondal et.al. 2404.10880 null
2024-04-16 Invariant Kalman Filtering with Noise-Free Pseudo-Measurements Sven Goffin et.al. 2404.10687 null
2024-04-16 The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement Gabriele Trivigno et.al. 2404.10438 null
2024-04-16 GaitPoint+: A Gait Recognition Network Incorporating Point Cloud Analysis and Recycling Huantao Ren et.al. 2404.10213 null
2024-04-16 LWIRPOSE: A novel LWIR Thermal Image Dataset and Benchmark Avinash Upadhyay et.al. 2404.10212 link
2024-04-15 LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Jiadi Cui et.al. 2404.09748 null
2024-04-14 In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition Wiktor Mucha et.al. 2404.09308 null
2024-04-13 DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector Johan Edstedt et.al. 2404.08928 link
2024-04-16 3D Human Scan With A Moving Event Camera Kai Kohyama et.al. 2404.08504 null
2024-04-11 Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method Tashmoy Ghosh et.al. 2404.07649 null
2024-04-11 GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu et.al. 2404.07603 null
2024-04-10 Measuring proximity to standard planes during fetal brain ultrasound scanning Chiara Di Vece et.al. 2404.07124 null
2024-04-10 MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints Bedirhan Uguz et.al. 2404.07094 null
2024-04-10 Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting Xiaolei Lang et.al. 2404.06926 null
2024-04-09 Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences Axel Barroso-Laguna et.al. 2404.06337 link
2024-04-09 Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes Tianchen Deng et.al. 2404.06050 null
2024-04-08 Learning 3D-Aware GANs from Unposed Images with Template Feature Field Xinya Chen et.al. 2404.05705 null
2024-04-08 Learning a Category-level Object Pose Estimator without Pose Annotations Fengrui Tian et.al. 2404.05626 null
2024-04-08 DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker Jiapeng Wu et.al. 2404.05518 link
2024-04-08 Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks Maksym Ivashechkin et.al. 2404.05414 null
2024-04-08 STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs Kush Hari et.al. 2404.05151 null
2024-04-05 ToolEENet: Tool Affordance 6D Pose Estimation Yunlong Wang et.al. 2404.04193 null
2024-04-04 SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation Sichen Chen et.al. 2404.03518 link
2024-04-04 Multi Positive Contrastive Learning with Pose-Consistent Generated Images Sho Inayoshi et.al. 2404.03256 null
2024-04-04 HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud Wencan Cheng et.al. 2404.03159 link
2024-04-03 Fusing Multi-sensor Input with State Information on TinyML Brains for Autonomous Nano-drones Luca Crupi et.al. 2404.02567 null
2024-04-03 Semi-Supervised Unconstrained Head Pose Estimation in the Wild Huayi Zhou et.al. 2404.02544 link
2024-04-02 3D Congealing: 3D-Aware Image Alignment in the Wild Yunzhi Zhang et.al. 2404.02125 null
2024-04-02 SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation Vinkle Srivastav et.al. 2404.02041 null
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891 null
2024-03-31 Graph-Based vs. Error State Kalman Filter-Based Fusion Of 5G And Inertial Data For MAV Indoor Pose Estimation Meisam Kabiri et.al. 2404.00691 null
2024-03-31 OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos Dongyoung Choi et.al. 2404.00676 null
2024-04-02 KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation Jihua Peng et.al. 2404.00658 link
2024-03-29 FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model Molin Zhang et.al. 2404.00132 null
2024-03-29 Latent Embedding Clustering for Occlusion Robust Head Pose Estimation José Celestino et.al. 2403.20251 null
2024-03-29 A Unified Framework for Human-centric Point Cloud Video Understanding Yiteng Xu et.al. 2403.20031 null
2024-04-01 Video-Based Human Pose Regression via Decoupled Space-Time Aggregation Jijie He et.al. 2403.19926 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-26 Mathematical Foundation and Corrections for Full Range Head Pose Estimation Huei-Chung Hu et.al. 2403.18104 null
2024-03-26 EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation Chenhongyi Yang et.al. 2403.18080 null
2024-03-26 A Survey on 3D Egocentric Human Pose Estimation Md Mushfiqur Azam et.al. 2403.17893 null
2024-03-26 GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction Hrishav Bakul Barua et.al. 2403.17837 link

Visual Localization

Publish Date Title Authors PDF Code
2024-05-14 HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval Chao He et.al. 2405.07524 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429 null
2024-05-12 BoQ: A Place is Worth a Bag of Learnable Queries Amar Ali-bey et.al. 2405.07364 link
2024-05-07 Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction Nematollah Saeidi et.al. 2405.04211 null
2024-05-06 A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions Sharath Raghvendra et.al. 2405.03664 null
2024-05-06 Knowledge-aware Text-Image Retrieval for Remote Sensing Images Li Mi et.al. 2405.03373 null
2024-05-06 Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng et.al. 2405.03190 null
2024-05-05 iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval Lorenzo Agnolucci et.al. 2405.02951 link
2024-05-01 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval Young Kyun Jang et.al. 2405.00571 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746 null
2024-04-29 Dual-Modal Prompting for Sketch-Based Image Retrieval Liying Gao et.al. 2404.18695 null
2024-05-01 Semantic Line Combination Detector Jinwon Ko et.al. 2404.18399 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498 null
2024-04-25 CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Samia Shafique et.al. 2404.16972 null
2024-04-29 Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Ryoya Nara et.al. 2404.16398 null
2024-04-24 Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval Haokun Wen et.al. 2404.15875 link
2024-04-24 DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines Xin Jiang et.al. 2404.15771 null
2024-04-23 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval Young Kyun Jang et.al. 2404.15516 null
2024-04-22 EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models Mathias Thorsager et.al. 2404.14236 null
2024-04-22 Hierarchical localization with panoramic views and triplet loss functions Marcos Alfaro et.al. 2404.14117 link
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437 null
2024-04-20 Collaborative Visual Place Recognition through Federated Learning Mattia Dutto et.al. 2404.13324 null
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives Zhangchi Feng et.al. 2404.11317 null
2024-04-17 Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing Sanggeon Yun et.al. 2404.11025 null
2024-04-16 SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Niklas Gard et.al. 2404.10527 link
2024-04-20 CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning Haojian Huang et.al. 2404.09640 link
2024-04-11 PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue et.al. 2404.07785 null
2024-04-11 Semantically-correlated memories in a dense associative model Thomas F Burns et.al. 2404.07123 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 null
2024-04-07 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Jinpeng Wang et.al. 2404.04998 link
2024-04-06 Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Juncheng Yang et.al. 2404.04538 null
2024-04-02 TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation Yehui Shen et.al. 2404.01587 link
2024-04-01 On Train-Test Class Overlap and Detection for Image Retrieval Chull Hwan Song et.al. 2404.01524 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546 null
2024-03-31 NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation Diwei Sheng et.al. 2404.00504 null
2024-03-30 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs Yang Miao et.al. 2404.00469 null
2024-03-30 Do Vision-Language Models Understand Compound Nouns? Sonal Kumar et.al. 2404.00419 null
2024-04-05 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 null
2024-03-28 JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition Gabriele Berton et.al. 2403.19787 link
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 null
2024-03-27 AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation Changkun Liu et.al. 2403.18281 null
2024-03-26 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge Dongjin Kim et.al. 2403.17420 link
2024-03-25 Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras Gokul B. Nair et.al. 2403.16425 null
2024-03-24 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval Yucheng Suo et.al. 2403.16005 null
2024-03-24 BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Yinda Chen et.al. 2403.15992 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 link
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 null

Keypoint Detection

Publish Date Title Authors PDF Code
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null

2024-5

Visual Localization

Publish Date Title Authors PDF Code
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835 null
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411 link
2024-06-04 MeshVPR: Citywide Visual Place Recognition Using 3D Meshes Gabriele Berton et.al. 2406.02776 null
2024-06-04 Can CLIP help CLIP in learning 3D? Cristian Sbrolli et.al. 2406.02202 null
2024-06-03 Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian et.al. 2406.01583 null
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-06-01 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization Wugang Meng et.al. 2406.00312 null
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985 null
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333 null
2024-05-29 ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions Honglin Lin et.al. 2405.19226 null
2024-05-30 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval Xintong Jiang et.al. 2405.19149 null
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801 null
2024-05-29 Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs Jialiang Xu et.al. 2405.18740 link
2024-05-28 EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Issar Tzachor et.al. 2405.18065 null
2024-05-28 AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval Sihe Zhang et.al. 2405.17718 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-29 Composed Image Retrieval for Remote Sensing Bill Psomas et.al. 2405.15587 link
2024-05-24 Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval Yiming Wu et.al. 2405.15451 null
2024-05-20 UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization Wenjia Xu et.al. 2405.11936 link
2024-05-19 Register assisted aggregation for Visual Place Recognition Xuan Yu et.al. 2405.11526 null
2024-05-16 FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models Adrian Bulat et.al. 2405.10286 null
2024-05-15 Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study Farnaz Khun Jush et.al. 2405.09334 null
2024-05-14 BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment Lihong Jin et.al. 2405.09001 null
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-14 HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval Chao He et.al. 2405.07524 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429 link
2024-05-12 BoQ: A Place is Worth a Bag of Learnable Queries Amar Ali-bey et.al. 2405.07364 link
2024-05-07 Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction Nematollah Saeidi et.al. 2405.04211 null
2024-05-06 A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions Sharath Raghvendra et.al. 2405.03664 null
2024-05-06 Knowledge-aware Text-Image Retrieval for Remote Sensing Images Li Mi et.al. 2405.03373 null
2024-05-06 Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng et.al. 2405.03190 null
2024-05-05 iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval Lorenzo Agnolucci et.al. 2405.02951 link
2024-05-01 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval Young Kyun Jang et.al. 2405.00571 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746 null
2024-04-29 Dual-Modal Prompting for Sketch-Based Image Retrieval Liying Gao et.al. 2404.18695 null
2024-05-01 Semantic Line Combination Detector Jinwon Ko et.al. 2404.18399 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498 null
2024-04-25 CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Samia Shafique et.al. 2404.16972 null
2024-04-29 Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Ryoya Nara et.al. 2404.16398 null
2024-04-24 Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval Haokun Wen et.al. 2404.15875 link
2024-04-24 DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines Xin Jiang et.al. 2404.15771 null

Pose Estimation

Publish Date Title Authors PDF Code
2024-06-05 Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices Xingjian Yang et.al. 2406.02977 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model Yu Tian et.al. 2406.01914 null
2024-06-03 A Robust Filter for Marker-less Multi-person Tracking in Human-Robot Interaction Scenarios Enrico Martini et.al. 2406.01832 link
2024-06-01 Equivariant amortized inference of poses for cryo-EM Larissa de Ruijter et.al. 2406.01630 null
2024-06-03 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information Sihan Wen et.al. 2406.01196 null
2024-06-01 CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation Matan Rusanovsky et.al. 2406.00384 link
2024-05-30 Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach Muhammad Saif Ullah Khan et.al. 2405.20084 null
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-29 Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives Mingqi Yuan et.al. 2405.19531 null
2024-05-29 Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation Sabrina Cynthia Triess et.al. 2405.19173 null
2024-05-28 World Models for General Surgical Grasping Hongbin Lin et.al. 2405.17940 null
2024-05-27 MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds Jiahui Lei et.al. 2405.17421 null
2024-05-27 Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding Niloofar Azizi et.al. 2405.17397 null
2024-05-27 $\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation Weiquan Wang et.al. 2405.17016 null
2024-05-27 Clustering-based Learning for UAV Tracking and Pose Estimation Jiaping Xiao et.al. 2405.16867 null
2024-05-26 Multi-Modal UAV Detection, Classification and Tracking Algorithm – Technical Report for CVPR 2024 UG2 Challenge Tianchen Deng et.al. 2405.16464 link
2024-05-25 Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality Hakim Ikebayashi et.al. 2405.16008 null
2024-05-23 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments Yang Zhou et.al. 2405.14731 link
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-21 Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos Jayroop Ramesh et.al. 2405.13235 null
2024-05-21 Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations Antoine Legrand et.al. 2405.12728 null
2024-05-21 PoseGravity: Pose Estimation from Points and Lines with Axis Prior Akshay Chandrasekhar et.al. 2405.12646 link
2024-05-19 Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation Zejun Gu et.al. 2405.12247 null
2024-05-20 AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements Calvin Yeung et.al. 2405.12070 link
2024-05-19 Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries Christiaan G. A. Viviers et.al. 2405.11677 link
2024-05-19 Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation Zejun Gu et.al. 2405.11448 null
2024-05-18 PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking Yifan Yang et.al. 2405.11257 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-17 Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation Yongliang Lin et.al. 2405.10557 null
2024-05-16 Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder Mohamed Ilyes Lakhal et.al. 2405.10423 null
2024-05-17 Toon3D: Seeing Cartoons from a New Perspective Ethan Weber et.al. 2405.10320 null
2024-05-15 Task-adaptive Q-Face Haomiao Sun et.al. 2405.09059 null
2024-05-14 RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images Zong-Wei Hong et.al. 2405.08483 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-13 Deep Learning-Based Object Pose Estimation: A Comprehensive Survey Jian Liu et.al. 2405.07801 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429 link
2024-05-11 TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization Zhen Tan et.al. 2405.07027 null
2024-05-11 AHPPEBot: Autonomous Robot for Tomato Harvesting based on Phenotyping and Pose Estimation Xingxu Li et.al. 2405.06959 null
2024-05-10 CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras James Tang et.al. 2405.06845 link
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-10 Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera Haixin Shi et.al. 2405.05858 null
2024-05-09 Semi-Autonomous Laparoscopic Robot Docking with Learned Hand-Eye Information Fusion Huanyu Tian et.al. 2405.05817 null
2024-05-09 NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM Yiping Xie et.al. 2405.05807 null
2024-05-09 Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview Yuhang Ming et.al. 2405.05526 null
2024-05-08 Adversary-Guided Motion Retargeting for Skeleton Anonymization Thomas Carr et.al. 2405.05428 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216 link
2024-05-08 ProbRadarM3F: mmWave Radar based Human Skeletal Pose Estimation with Probability Map Guided Multi-Format Feature Fusion Bing Zhu et.al. 2405.05164 null
2024-05-08 GISR: Geometric Initialization and Silhouette-based Refinement for Single-View Robot Pose and Configuration Estimation Ivan Bilić et.al. 2405.04890 null
2024-05-07 Learning Distributional Demonstration Spaces for Task-Specific Cross-Pose Estimation Jenny Wang et.al. 2405.04609 null
2024-05-07 Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform Zhijian Qiao et.al. 2405.03969 null
2024-05-07 Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints Xiongjun Guan et.al. 2405.03959 null
2024-05-06 Pose Priors from Language Models Sanjay Subramanian et.al. 2405.03689 null
2024-05-06 Optimizing Hand Region Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Avoid Downstream Errors Amit Moryossef et.al. 2405.03545 link
2024-05-05 Multi-hop graph transformer network for 3D human pose estimation Zaedul Islam et.al. 2405.03055 null
2024-05-05 Blending Distributed NeRFs with Tri-stage Robust Pose Optimization Baijun Ye et.al. 2405.02880 null
2024-05-03 WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD Xuxin Cheng et.al. 2405.02241 null
2024-05-03 Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation Xianzhou Zeng et.al. 2405.02114 link
2024-05-03 An Onboard Framework for Staircases Modeling Based on Point Clouds Chun Qing et.al. 2405.01918 null
2024-05-06 ShadowNav: Autonomous Global Localization for Lunar Navigation in Darkness Deegan Atha et.al. 2405.01673 null
2024-05-02 IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning Ryan Hoque et.al. 2405.01472 null
2024-05-02 Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning Liu Qiyuan et.al. 2405.01284 null
2024-05-02 Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors Wenxuan Guo et.al. 2405.01112 null
2024-05-02 CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications Jan Blumenkamp et.al. 2405.01107 null
2024-05-04 HandSSCA: 3D Hand Mesh Reconstruction with State Space Channel Attention from RGB images Zixun Jiao et.al. 2405.01066 null
2024-05-01 Radar-Based Localization For Autonomous Ground Vehicles In Suburban Neighborhoods Andrew J. Kramer et.al. 2405.00600 null
2024-04-30 Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging Rayan Armani et.al. 2404.19541 link
2024-04-30 UniFS: Universal Few-shot Instance Perception with Point Representations Sheng Jin et.al. 2404.19401 null
2024-04-30 Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training Xingyu Song et.al. 2404.19279 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction Antoine Maiorca et.al. 2404.18628 null
2024-04-29 Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle Jungwoo Lee et.al. 2404.18395 null
2024-04-29 Reconstructing Satellites in 3D from Amateur Telescope Images Zhiming Chang et.al. 2404.18394 null
2024-04-27 Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs Yiming Bao et.al. 2404.17837 null
2024-04-26 Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses Yi Shen et.al. 2404.17685 null

Keypoint Detection

Publish Date Title Authors PDF Code
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link

2024-6

Keypoint Detection

Publish Date Title Authors PDF Code
2024-08-15 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null

Pose Estimation

Publish Date Title Authors PDF Code
2024-07-03 Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation Mengmeng Cui et.al. 2407.02990 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-02 SUPER: Seated Upper Body Pose Estimation using mmWave Radars Bo Zhang et.al. 2407.02455 null
2024-07-02 ReliaAvatar: A Robust Real-Time Avatar Animator with Integrated Motion Prediction Bo Qian et.al. 2407.02129 null
2024-07-02 Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval Nicola Messina et.al. 2407.02104 null
2024-07-01 Active Human Pose Estimation via an Autonomous UAV Agent Jingxi Chen et.al. 2407.01811 null
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 null
2024-07-01 Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization Ruofei Bai et.al. 2407.01013 null
2024-06-30 Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation Adnan Abdullah et.al. 2407.00848 null
2024-06-29 When Robots Get Chatty: Grounding Multimodal Human-Robot Conversation and Collaboration Philipp Allgeuer et.al. 2407.00518 null
2024-06-28 Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review Moseli Mots’oehli et.al. 2407.00252 null
2024-06-28 EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans Nicola Garau et.al. 2406.19726 null
2024-06-28 CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services DongKi Noh et.al. 2406.19634 null
2024-06-27 Multimodal Visual-haptic pose estimation in the presence of transient occlusion Michael Zechmair et.al. 2406.19323 null
2024-06-27 Human Modelling and Pose Estimation Overview Pawel Knap et.al. 2406.19290 null
2024-06-26 Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference Yuan Gao et.al. 2406.18453 link
2024-06-27 Automatic infant 2D pose estimation from videos: comparing seven deep neural network methods Filipe Gama et.al. 2406.17382 null
2024-06-24 High-resolution open-vocabulary object 6D pose estimation Jaime Corsetti et.al. 2406.16384 null
2024-06-23 Breaking the Frame: Image Retrieval by Visual Overlap Prediction Tong Wei et.al. 2406.16204 link
2024-06-21 Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe Sandeep Singh Sengar et.al. 2406.15649 link
2024-06-24 Investigating the impact of 2D gesture representation on co-speech gesture generation Teo Guichoux et.al. 2406.15111 null
2024-06-20 Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data Moira Shooter et.al. 2406.14412 null
2024-06-20 PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions Sihan Ma et.al. 2406.14367 null
2024-06-19 NeRF-Feat: 6D Object Pose Estimation using Feature Rendering Shishir Reddy Vutukur et.al. 2406.13796 null
2024-06-19 CNN Based Flank Predictor for Quadruped Animal Species Vanessa Suessle et.al. 2406.13588 null
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515 null
2024-06-19 An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses Johanna Bräunig et.al. 2406.13464 null
2024-06-18 Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings Ruijie Tang et.al. 2406.13048 null
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-17 Domain Generalization for In-Orbit 6D Pose Estimation Antoine Legrand et.al. 2406.11743 null
2024-06-17 SeamPose: Repurposing Seams as Capacitive Sensors in a Shirt for Upper-Body Pose Tracking Tianhong Catherine Yu et.al. 2406.11645 null
2024-06-14 Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization Wonho Song et.al. 2406.11599 null
2024-06-15 MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception M. Mahbubur Rahman et.al. 2406.10708 null
2024-06-15 Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference Shayan Shekarforoush et.al. 2406.10455 null
2024-06-14 The BabyView dataset: High-resolution egocentric videos of infants’ and young children’s everyday experiences Bria Long et.al. 2406.10447 null
2024-06-14 OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics Yoni Gozlan et.al. 2406.09788 null
2024-06-13 ImageNet3D: Towards General-Purpose Object-Level 3D Understanding Wufei Ma et.al. 2406.09613 link
2024-06-13 Deep Transformer Network for Monocular Pose Estimation of Ship-Based UAV Maneesha Wickramasuriya et.al. 2406.09260 link
2024-06-14 Language-Driven Closed-Loop Grasping with Model-Predictive Trajectory Replanning Huy Hoang Nguyen et.al. 2406.09039 null
2024-06-14 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jiannan Wu et.al. 2406.08394 link
2024-06-12 Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization Jiaxin Deng et.al. 2406.08001 null
2024-06-12 IFTD: Image Feature Triangle Descriptor for Loop Detection in Driving Scenes Fengtian Lang et.al. 2406.07937 link
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785 link
2024-06-12 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500 link
2024-06-11 Realistic Data Generation for 6D Pose Estimation of Surgical Instruments Juan Antonio Barragan et.al. 2406.07328 link
2024-06-11 SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale Shester Gueuwou et.al. 2406.06907 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-08 A preprocessing-based planning framework for utilizing contacts in high-precision insertion tasks Muhammad Suhail Saleem et.al. 2406.05522 null
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-06 Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking Jiyao Zhang et.al. 2406.04316 null
2024-06-05 Hi5: 2D Hand Pose Estimation with Zero Human Annotation Masum Hasan et.al. 2406.03599 null
2024-06-05 Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices Xingjian Yang et.al. 2406.02977 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model Yu Tian et.al. 2406.01914 null
2024-06-03 A Robust Filter for Marker-less Multi-person Tracking in Human-Robot Interaction Scenarios Enrico Martini et.al. 2406.01832 link
2024-06-01 Equivariant amortized inference of poses for cryo-EM Larissa de Ruijter et.al. 2406.01630 null
2024-06-03 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information Sihan Wen et.al. 2406.01196 null
2024-06-01 CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation Matan Rusanovsky et.al. 2406.00384 link
2024-05-30 Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach Muhammad Saif Ullah Khan et.al. 2405.20084 null
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-29 Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives Mingqi Yuan et.al. 2405.19531 null
2024-05-29 Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation Sabrina Cynthia Triess et.al. 2405.19173 null

Visual Localization

Publish Date Title Authors PDF Code
2024-07-08 Pseudo-triplet Guided Few-shot Composed Image Retrieval Bohan Hou et.al. 2407.06001 null
2024-07-09 HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels Yingying Jiang et.al. 2407.05795 null
2024-07-05 Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning Mainak Singha et.al. 2407.04207 null
2024-07-04 Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models Chang-Sheng Kao et.al. 2407.03615 link
2024-07-03 Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach Pronay Debnath et.al. 2407.03486 null
2024-07-02 Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition Sergio Izquierdo et.al. 2407.02422 link
2024-07-01 Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval Aneeshan Sain et.al. 2407.01810 null
2024-07-01 Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval Hanwen Su et.al. 2407.00979 null
2024-07-01 Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios Connor Malone et.al. 2407.00863 null
2024-06-27 PathAlign: A vision-language model for whole slide images in histopathology Faruk Ahmed et.al. 2406.19578 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs Huaying Zhang et.al. 2406.18836 null
2024-06-26 WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images Yannik Glaser et.al. 2406.18765 null
2024-06-26 View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis Subin Varghese et.al. 2406.18012 null
2024-06-25 Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Zonglin Lyu et.al. 2406.17520 null
2024-06-23 Breaking the Frame: Image Retrieval by Visual Overlap Prediction Tong Wei et.al. 2406.16204 link
2024-06-19 Towards a multimodal framework for remote sensing image change retrieval and captioning Roger Ferrod et.al. 2406.13424 null
2024-06-19 CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval Christian Lülf et.al. 2406.13322 link
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-22 Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment Jianan Jiang et.al. 2406.11551 link
2024-06-17 They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias Salma Abdel Magid et.al. 2406.11331 null
2024-06-17 Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion Guoyuan An et.al. 2406.11242 null
2024-06-14 Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval Genc Hoxha et.al. 2406.10107 null
2024-06-14 BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Imanol Miranda et.al. 2406.09952 link
2024-06-13 Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases Meng Wang et.al. 2406.09317 null
2024-06-13 Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval Jaeseok Byun et.al. 2406.09188 null
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773 null
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Kam Woh Ng et.al. 2406.08457 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502 link
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450 link
2024-06-11 Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval Adrià Molina et.al. 2406.07315 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-09 Unified Text-to-Image Generation and Retrieval Leigang Qu et.al. 2406.05814 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184 link
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746 link
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835 null
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411 link
2024-06-04 MeshVPR: Citywide Visual Place Recognition Using 3D Meshes Gabriele Berton et.al. 2406.02776 null
2024-06-04 Can CLIP help CLIP in learning 3D? Cristian Sbrolli et.al. 2406.02202 null
2024-06-03 Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian et.al. 2406.01583 null
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-06-01 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization Wugang Meng et.al. 2406.00312 null
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985 null
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333 null
2024-05-29 ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions Honglin Lin et.al. 2405.19226 null
2024-05-30 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval Xintong Jiang et.al. 2405.19149 null
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801 null

2024-7

Pose Estimation

Publish Date Title Authors PDF Code
2024-08-05 Joint-Motion Mutual Learning for Pose Estimation in Videos Sifan Wu et.al. 2408.02285 null
2024-08-04 AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos Feichi Lu et.al. 2408.02110 null
2024-08-04 Generalized Maximum Likelihood Estimation for Perspective-n-Point Problem Tian Zhan et.al. 2408.01945 null
2024-08-03 MotionTrace: IMU-based Field of View Prediction for Smartphone AR Interactions Rahul Islam et.al. 2408.01850 null
2024-08-03 BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles Lun Luo et.al. 2408.01841 null
2024-08-03 E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images Yunshan Qi et.al. 2408.01840 null
2024-08-03 Survey on Emotion Recognition through Posture Detection and the possibility of its application in Virtual Reality Leina Elansary et.al. 2408.01728 null
2024-08-03 Stimulating Imagination: Towards General-purpose Object Rearrangement Jianyang Wu et.al. 2408.01655 null
2024-08-02 Full-range Head Pose Geometric Data Augmentations Huei-Chung Hu et.al. 2408.01566 null
2024-07-31 Adapting Skills to Novel Grasps: A Self-Supervised Approach Georgios Papagiannis et.al. 2408.00178 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-30 HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation Wencan Cheng et.al. 2407.20542 link
2024-07-30 Markers Identification for Relative Pose Estimation of an Uncooperative Target Batu Candan et.al. 2407.20515 null
2024-07-29 BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Saunders et.al. 2407.20437 null
2024-07-28 Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph Zhengcen Li et.al. 2407.19497 null
2024-07-26 Flexible graph convolutional network for 3D human pose estimation Abu Taib Mohammed Shahjahan et.al. 2407.19077 null
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590 null
2024-07-28 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Zhenzhi Wang et.al. 2407.17438 link
2024-07-24 Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments Wei Gao et.al. 2407.17078 null
2024-07-30 DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction Xiaobiao Du et.al. 2407.16988 link
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961 null
2024-07-23 COALA: A Practical and Vision-Centric Federated Learning Platform Weiming Zhuang et.al. 2407.16560 link
2024-07-23 Probabilistic Parameter Estimators and Calibration Metrics for Pose Estimation from Image Features Romeo Valentin et.al. 2407.16223 null
2024-07-23 Optimal camera-robot pose estimation in linear time from points and lines Guangyang Zeng et.al. 2407.16151 null
2024-07-23 3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images Jie Zhao et.al. 2407.16137 null
2024-07-21 CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models Zheng Chong et.al. 2407.15886 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-22 Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection Kangqi Ma et.al. 2407.15771 null
2024-07-22 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model Matteo Bortolon et.al. 2407.15484 null
2024-07-23 Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions Yihao Ai et.al. 2407.15451 null
2024-07-22 avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality Dizhi Ma et.al. 2407.15373 null
2024-07-20 From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM Lorenzo Montano-Oliván et.al. 2407.14797 null
2024-07-19 ESCAPE: Energy-based Selective Adaptive Correction for Out-of-distribution 3D Human Pose Estimation Luke Bidulka et.al. 2407.14605 null
2024-07-19 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry Sungho Chun et.al. 2407.14136 link
2024-07-18 RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark Yuan-Hao Ho et.al. 2407.13930 null
2024-07-19 GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation Bangyan Liao et.al. 2407.13537 null
2024-07-18 SCAPE: A Simple and Strong Category-Agnostic Pose Estimator Yujia Liang et.al. 2407.13483 link
2024-07-17 SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization Yiyang Chen et.al. 2407.12667 link
2024-07-17 Invertible Neural Warp for NeRF Shin-Fang Chng et.al. 2407.12354 null
2024-07-16 NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models Francesco Milano et.al. 2407.12207 link
2024-07-16 Monocular pose estimation of articulated surgical instruments in open surgery Robert Spektor et.al. 2407.12138 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-15 A BlueROV2-based platform for underwater mapping experiments Tudor Alinei-Poiana et.al. 2407.10901 null
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782 null
2024-07-15 Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis Antoine Legrand et.al. 2407.10762 null
2024-07-16 GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation Haonan Wang et.al. 2407.10756 null
2024-07-15 Learning to Estimate the Pose of a Peer Robot in a Camera Image by Predicting the States of its LEDs Nicholas Carlotti et.al. 2407.10661 null
2024-07-15 Deep-Learning-Based Markerless Pose Estimation Systems in Gait Analysis: DeepLabCut Custom Training and the Refinement Function Giulia Panconi et.al. 2407.10590 null
2024-07-14 3D Foundation Models Enable Simultaneous Geometry and Pose Estimation of Grasped Objects Weiming Zhi et.al. 2407.10331 null
2024-07-16 psifx – Psychological and Social Interactions Feature Extraction Package Guillaume Rochette et.al. 2407.10266 null
2024-07-14 PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation Nermin Samet et.al. 2407.10220 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102 null
2024-07-12 iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning Tom Fischer et.al. 2407.09271 null
2024-07-12 HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation Manuel Birlo et.al. 2407.09215 null
2024-07-12 KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting Andrew Jeong et.al. 2407.08909 null
2024-07-11 RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation Tao Jiang et.al. 2407.08634 link
2024-07-11 SRPose: Two-view Relative Pose Estimation with Sparse Keypoints Rui Yin et.al. 2407.08199 link
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 null
2024-07-10 RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of Appearance-Changing Objects Jiahao Nick Li et.al. 2407.08081 null
2024-07-10 Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Jinjie Mai et.al. 2407.08023 link
2024-07-10 Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation Junjia Han et.al. 2407.07389 null
2024-07-09 Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images Chuanrui Zhang et.al. 2407.06984 null
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513 null
2024-07-08 GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields Weiyi Xue et.al. 2407.05597 null
2024-07-10 On the power of data augmentation for head pose estimation Michael Welter et.al. 2407.05357 null
2024-07-07 SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning Yi Feng et.al. 2407.05283 link
2024-07-05 Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos Leonhard Sommer et.al. 2407.04384 link
2024-07-04 Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation Laiyan Ding et.al. 2407.04041 null
2024-07-04 Markerless Multi-view 3D Human Pose Estimation: a survey Ana Filipa Rodrigues Nogueira et.al. 2407.03817 null
2024-07-04 A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios Zikang Yuan et.al. 2407.03590 null
2024-07-03 Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation Mengmeng Cui et.al. 2407.02990 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-02 SUPER: Seated Upper Body Pose Estimation using mmWave Radars Bo Zhang et.al. 2407.02455 null
2024-07-02 ReliaAvatar: A Robust Real-Time Avatar Animator with Integrated Motion Prediction Bo Qian et.al. 2407.02129 null
2024-07-02 Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval Nicola Messina et.al. 2407.02104 null
2024-07-01 Active Human Pose Estimation via an Autonomous UAV Agent Jingxi Chen et.al. 2407.01811 null
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 null
2024-07-01 Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization Ruofei Bai et.al. 2407.01013 null
2024-06-30 Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation Adnan Abdullah et.al. 2407.00848 null
2024-06-29 When Robots Get Chatty: Grounding Multimodal Human-Robot Conversation and Collaboration Philipp Allgeuer et.al. 2407.00518 null
2024-06-28 Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review Moseli Mots’oehli et.al. 2407.00252 null
2024-06-28 EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans Nicola Garau et.al. 2406.19726 null

Visual Localization

Publish Date Title Authors PDF Code
2024-08-19 Fashion Image-to-Image Translation for Complementary Item Retrieval Matteo Attimonelli et.al. 2408.09847 null
2024-08-20 MambaLoc: Efficient Camera Localisation via State Space Model Jialu Wang et.al. 2408.09680 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910 null
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-10 Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network Junyan Ye et.al. 2408.05475 link
2024-08-09 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos Heeseung Yun et.al. 2408.05364 null
2024-08-06 AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Pavel Suma et.al. 2408.03282 null
2024-08-05 CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration Gongxin Yao et.al. 2408.02394 null
2024-08-02 On Validation of Search & Retrieval of Tissue Images in Digital Pathology H. R. Tizhoosh et.al. 2408.01570 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416 null
2024-07-30 Re-localization acceleration with Medoid Silhouette Clustering Hongyi Zhang et.al. 2407.20749 null
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590 null
2024-07-24 Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Yongqi Li et.al. 2407.17274 null
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-19 Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization Yuehua Ding et.al. 2407.14643 null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766 link
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis Ruijie Yang et.al. 2407.11401 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 DINO Pre-training for Vision-based End-to-end Autonomous Driving Shubham Juneja et.al. 2407.10803 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683 null
2024-07-15 An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots J. J. Cabrera et.al. 2407.10596 link
2024-07-15 An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments J. J. Cabrera et.al. 2407.10536 null
2024-07-12 Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval Vaibhav Balloli et.al. 2407.08908 link
2024-07-11 Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates Owen Claxton et.al. 2407.08162 link
2024-07-12 Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal Xinyu Zhu et.al. 2407.08153 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-09 CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Wenhao Xu et.al. 2407.06611 null
2024-07-08 Pseudo-triplet Guided Few-shot Composed Image Retrieval Bohan Hou et.al. 2407.06001 null
2024-07-09 HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels Yingying Jiang et.al. 2407.05795 null
2024-07-05 Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning Mainak Singha et.al. 2407.04207 link
2024-07-04 Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models Chang-Sheng Kao et.al. 2407.03615 link
2024-07-03 Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach Pronay Debnath et.al. 2407.03486 null
2024-07-02 Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition Sergio Izquierdo et.al. 2407.02422 link
2024-07-01 Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval Aneeshan Sain et.al. 2407.01810 null
2024-07-01 Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval Hanwen Su et.al. 2407.00979 null
2024-07-01 Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios Connor Malone et.al. 2407.00863 null
2024-06-27 PathAlign: A vision-language model for whole slide images in histopathology Faruk Ahmed et.al. 2406.19578 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs Huaying Zhang et.al. 2406.18836 null
2024-06-26 WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images Yannik Glaser et.al. 2406.18765 null
2024-06-26 View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis Subin Varghese et.al. 2406.18012 null
2024-06-25 Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Zonglin Lyu et.al. 2406.17520 null

Keypoint Detection

Publish Date Title Authors PDF Code
2024-10-03 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 null
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null

2024-8

Pose Estimation

Publish Date Title Authors PDF Code
2024-09-01 Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach Wenjun Huang et.al. 2409.02715 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-03 EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision Yiming Zhao et.al. 2409.02224 null
2024-09-03 Deep learning for objective estimation of Parkinsonian tremor severity Felipe Duque-Quiceno et.al. 2409.02011 null
2024-09-03 SPiKE: 3D Human Pose from Point Cloud Sequences Irene Ballester et.al. 2409.01879 link
2024-09-02 Kalman Filtering for Precise Indoor Position and Orientation Estimation Using IMU and Acoustics on Riemannian Manifolds Mohammed H. AlSharif et.al. 2409.01002 null
2024-09-01 Detection, Recognition and Pose Estimation of Tabletop Objects Sanjuksha Nirgude et.al. 2409.00869 null
2024-09-01 DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation Huixin Zhang et.al. 2409.00744 link
2024-09-01 MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds Ziqiang Dang et.al. 2409.00736 null
2024-08-31 ActionPose: Pretraining 3D Human Pose Estimation with the Dark Knowledge of Action Longyun Liao et.al. 2409.00449 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-08-30 BOP-D: Revisiting 6D Pose Estimation Benchmark for Better Evaluation under Visual Ambiguities Boris Meden et.al. 2408.17297 null
2024-08-30 EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs Zhen Fan et.al. 2408.17168 null
2024-09-01 Generic Objects as Pose Probes for Few-Shot View Synthesis Zhirui Gao et.al. 2408.16690 null
2024-08-29 OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation Yuchen Che et.al. 2408.16547 link
2024-08-29 GRPose: Learning Graph Relations for Human Image Generation with Pose Priors Xiangchen Yin et.al. 2408.16540 null
2024-08-28 Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators Nikita Kister et.al. 2408.16536 null
2024-08-28 Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation Laura Bragagnolo et.al. 2408.15810 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph Zherong Zhang et.al. 2408.15750 null
2024-08-28 Benchmarking ML Approaches to UWB-Based Range-Only Posture Recognition for Human Robot-Interaction Salma Salimi et.al. 2408.15717 null
2024-08-26 Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model Abu Saleh Musa Miah et.al. 2408.14111 null
2024-08-25 InterTrack: Tracking Human Object Interaction without Object Templates Xianghui Xie et.al. 2408.13953 null
2024-08-24 Temporally-consistent 3D Reconstruction of Birds Johannes Hägerlind et.al. 2408.13629 null
2024-08-24 Explainable Convolutional Networks for Crater Detection and Lunar Landing Navigation Jianing Song et.al. 2408.13587 null
2024-08-27 Sapiens: Foundation for Human Vision Models Rawal Khirodkar et.al. 2408.12569 null
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 null
2024-08-20 ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data Elia Bonetto et.al. 2408.10831 null
2024-08-20 MPL: Lifting 3D Human Pose from Multi-view 2D Poses Seyed Abolfazl Ghasemzadeh et.al. 2408.10805 link
2024-08-19 RUMI: Rummaging Using Mutual Information Sheng Zhong et.al. 2408.10450 null
2024-08-19 SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Chao Xu et.al. 2408.10195 null
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 link
2024-08-19 Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation Qianhui Men et.al. 2408.09931 null
2024-08-18 OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare Chen Long-fei et.al. 2408.09409 null
2024-08-17 An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval Interface Kevin Jose Thomas et.al. 2408.09311 link
2024-08-16 ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation Hao Tang et.al. 2408.09042 null
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723 null
2024-08-16 SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis Xingyue Lin et.al. 2408.08623 null
2024-08-15 HyperTaxel: Hyper-Resolution for Taxel-Based Tactile Signals Through Contrastive Learning Hongyu Li et.al. 2408.08312 null
2024-08-15 Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation Varun Burde et.al. 2408.08234 link
2024-08-15 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-08-15 Your Turn: Real-World Turning Angle Estimation for Parkinson’s Disease Severity Assessment Qiushuo Cheng et.al. 2408.08182 null
2024-08-15 Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models Tianyu Wang et.al. 2408.07975 null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 link
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-12 UniT: Unified Tactile Representation for Robot Learning Zhengtong Xu et.al. 2408.06481 link
2024-08-12 Moo-ving Beyond Tradition: Revolutionizing Cattle Behavioural Phenotyping with Pose Estimation Techniques Navid Ghassemi et.al. 2408.06336 null
2024-08-12 CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments Yanpeng Jia et.al. 2408.05981 null
2024-08-12 PAFormer: Part Aware Transformer for Person Re-identification Hyeono Jung et.al. 2408.05918 null
2024-08-11 SABER-6D: Shape Representation Based Implicit Object Pose Estimation Shishir Reddy Vutukur et.al. 2408.05867 null
2024-08-10 Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis Zhongche Qu et.al. 2408.05635 null
2024-08-10 Anticipation through Head Pose Estimation: a preliminary study Federico Figari Tomenotti et.al. 2408.05516 null
2024-08-09 Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing Lennart Niecksch et.al. 2408.04979 null
2024-08-07 PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model Yunlong Huang et.al. 2408.03540 null
2024-08-06 Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera Zibin Liu et.al. 2408.03225 link
2024-08-06 Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW Elia Cereda et.al. 2408.03168 null
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-07 Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network Xinyi Zhang et.al. 2408.02922 null
2024-08-05 Analyzing Data Efficiency and Performance of Machine Learning Algorithms for Assessing Low Back Pain Physical Rehabilitation Exercises Aleksa Marusic et.al. 2408.02855 null
2024-08-05 Joint-Motion Mutual Learning for Pose Estimation in Videos Sifan Wu et.al. 2408.02285 null
2024-08-04 AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos Feichi Lu et.al. 2408.02110 null
2024-08-04 Generalized Maximum Likelihood Estimation for Perspective-n-Point Problem Tian Zhan et.al. 2408.01945 null
2024-08-03 MotionTrace: IMU-based Field of View Prediction for Smartphone AR Interactions Rahul Islam et.al. 2408.01850 null
2024-08-03 BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles Lun Luo et.al. 2408.01841 null
2024-08-03 E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images Yunshan Qi et.al. 2408.01840 null
2024-08-03 Survey on Emotion Recognition through Posture Detection and the possibility of its application in Virtual Reality Leina Elansary et.al. 2408.01728 null
2024-08-03 Stimulating Imagination: Towards General-purpose Object Rearrangement Jianyang Wu et.al. 2408.01655 null
2024-08-02 Full-range Head Pose Geometric Data Augmentations Huei-Chung Hu et.al. 2408.01566 null
2024-07-31 Adapting Skills to Novel Grasps: A Self-Supervised Approach Georgios Papagiannis et.al. 2408.00178 null

Visual Localization

Publish Date Title Authors PDF Code
2024-09-04 Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications Abby Stylianou et.al. 2409.03012 null
2024-09-04 NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sepanta Zeighami et.al. 2409.02343 link
2024-09-03 Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment Konstantin Schall et.al. 2409.01936 link
2024-09-02 A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches Kim Jinwoo et.al. 2409.01219 null
2024-09-02 Evidential Transformers for Improved Image Retrieval Danilo Dordevic et.al. 2409.01082 null
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-08-29 A compact neuromorphic system for ultra energy-efficient, on-device robot localization Adam D. Hines et.al. 2408.16754 link
2024-08-29 Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata et.al. 2408.16296 null
2024-08-28 Temporal Attention for Cross-View Sequential Image Localization Dong Yuan et.al. 2408.15569 null
2024-08-27 Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Tianqi Wei et.al. 2408.14723 null
2024-08-25 LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task Ali Asgarov et.al. 2408.13909 link
2024-08-15 Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval Lifeng Zhou et.al. 2408.13705 null
2024-08-15 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119 null
2024-08-21 FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization Son Tung Nguyen et.al. 2408.12037 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305 link
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 null
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383 null
2024-08-23 Fashion Image-to-Image Translation for Complementary Item Retrieval Matteo Attimonelli et.al. 2408.09847 null
2024-08-20 MambaLoc: Efficient Camera Localisation via State Space Model Jialu Wang et.al. 2408.09680 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910 null
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-10 Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network Junyan Ye et.al. 2408.05475 link
2024-08-09 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos Heeseung Yun et.al. 2408.05364 null
2024-08-06 AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Pavel Suma et.al. 2408.03282 null
2024-08-05 CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration Gongxin Yao et.al. 2408.02394 null
2024-08-02 On Validation of Search & Retrieval of Tissue Images in Digital Pathology H. R. Tizhoosh et.al. 2408.01570 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416 null
2024-07-30 Re-localization acceleration with Medoid Silhouette Clustering Hongyi Zhang et.al. 2407.20749 null
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590 null
2024-07-24 Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Yongqi Li et.al. 2407.17274 null
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-19 Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization Yuehua Ding et.al. 2407.14643 null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 null
2024-09-26 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-08-15 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null

2024-9

Pose Estimation

Publish Date Title Authors PDF Code
2024-10-03 Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition Nikolaos Stathoulopoulos et.al. 2410.02643 null
2024-10-03 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment Xingyu Ji et.al. 2410.01618 null
2024-10-02 SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network Ahmed Tawfik Aboukhadra et.al. 2410.01293 null
2024-10-01 Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models Jerry Yan et.al. 2410.01061 null
2024-10-01 RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations Kaichen Zhou et.al. 2410.00713 link
2024-10-01 GERA: Geometric Embedding for Efficient Point Registration Analysis Geng Li et.al. 2410.00589 null
2024-09-30 Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations Muhammad Saif Ullah Khan et.al. 2409.20469 null
2024-09-30 Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies Shalini Sarode et.al. 2409.20237 null
2024-09-30 PuzzleBoard: A New Camera Calibration Pattern with Position Encoding Peer Stelldinger et.al. 2409.20127 link
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-09-30 GearTrack: Automating 6D Pose Estimation Yu Deng et.al. 2409.19986 null
2024-09-29 PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and Beyond Chen Song et.al. 2409.19772 null
2024-09-29 GelSlim 4.0: Focusing on Touch and Reproducibility Andrea Sipos et.al. 2409.19770 null
2024-09-27 Robust Proximity Operations using Probabilistic Markov Models Deep Parikh et.al. 2409.19062 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-27 DynaWeightPnP: Toward global real-time 3D-2D solver in PnP without correspondences Jingwei Song et.al. 2409.18457 null
2024-09-26 Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation Mengchen Zhang et.al. 2409.18261 null
2024-09-26 AI-Powered Augmented Reality for Satellite Assembly, Integration and Test Alvaro Patricio et.al. 2409.18101 null
2024-09-27 Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes Katja Ludwig et.al. 2409.17671 null
2024-09-25 Safe Leaf Manipulation for Accurate Shape and Pose Estimation of Occluded Fruits Shaoxiong Yao et.al. 2409.17389 null
2024-09-25 Hierarchical Tri-manual Planning for Vision-assisted Fruit Harvesting with Quadrupedal Robots Zhichao Liu et.al. 2409.17116 null
2024-09-25 Self-Sensing for Proprioception and Contact Detection in Soft Robots Using Shape Memory Alloy Artificial Muscles Ran Jing et.al. 2409.17111 null
2024-09-25 Online 6DoF Pose Estimation in Forests using Cross-View Factor Graph Optimisation and Deep Learned Re-localisation Lucas Carvalho de Lima et.al. 2409.16680 null
2024-09-25 FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation Jingyi Tang et.al. 2409.16600 null
2024-09-25 Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots Masoud Dayani Najafabadi et.al. 2409.16595 null
2024-09-24 PseudoNeg-MAE: Self-Supervised Point Cloud Learning using Conditional Pseudo-Negative Embeddings Sutharsan Mahendren et.al. 2409.15832 null
2024-09-24 LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation Ruida Zhang et.al. 2409.15727 null
2024-09-23 Framework for Robust Localization of UUVs and Mapping of Net Pens David Botta et.al. 2409.15475 null
2024-09-23 FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera Guoyang Zhao et.al. 2409.15054 link
2024-09-23 BranchPoseNet: Characterizing tree branching with a deep learning-based pose estimation approach Stefano Puliti et.al. 2409.14755 link
2024-09-23 ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Based on Lightweight and Compact Polygon Maps Haiming Gao et.al. 2409.14723 null
2024-09-22 Tactile Functasets: Neural Implicit Representations of Tactile Datasets Sikai Li et.al. 2409.14592 null
2024-09-22 AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way Sining Huang et.al. 2409.14577 null
2024-09-22 DROP: Dexterous Reorientation via Online Planning Albert H. Li et.al. 2409.14562 null
2024-09-21 Combining Absolute and Semi-Generalized Relative Poses for Visual Localization Vojtech Panek et.al. 2409.14269 null
2024-09-18 SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection Tim Engelbracht et.al. 2409.11870 null
2024-09-18 End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation Thomas Pöllabauer et.al. 2409.11819 null
2024-09-18 Bridging Domain Gap for Flight-Ready Spaceborne Vision Tae Ha Park et.al. 2409.11661 null
2024-09-17 Good Grasps Only: A data engine for self-supervised fine-tuning of pose estimation using grasp poses for verification Frederik Hagelskjær et.al. 2409.11512 null
2024-09-17 Training Datasets Generation for Machine Learning: Application to Vision Based Navigation Jérémy Lebreton et.al. 2409.11383 null
2024-09-17 OmniGen: Unified Image Generation Shitao Xiao et.al. 2409.11340 link
2024-09-17 ULOC: Learning to Localize in Complex Large-Scale Environments with Ultra-Wideband Ranges Thien-Minh Nguyen et.al. 2409.11122 link
2024-09-17 Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB Alessandro Simoni et.al. 2409.11104 null
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925 null
2024-09-17 Pose estimation of CubeSats via sensor fusion and Error-State Extended Kalman Filter Deep Parikh et.al. 2409.10815 null
2024-09-16 CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera Jingpei Lu et.al. 2409.10441 null
2024-09-16 HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models Vineet Bhat et.al. 2409.10419 null
2024-09-16 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? Téo Guichoux et.al. 2409.10357 null
2024-09-16 Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference Huy-Dung Nguyen et.al. 2409.10095 null
2024-09-15 Precise Pick-and-Place using Score-Based Diffusion Networks Shih-Wei Guo et.al. 2409.09725 null
2024-09-15 Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild Nie Lin et.al. 2409.09714 null
2024-09-15 Proximity operations of CubeSats via sensor fusion of ultra-wideband range measurements with rate gyroscopes, accelerometers and monocular vision Deep Parikh et.al. 2409.09665 null
2024-09-15 A Scalable Tabletop Satellite Automation Testbed:Design And Experiments Deep Parikh et.al. 2409.09633 null
2024-09-14 MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry Yuheng Qiu et.al. 2409.09479 null
2024-09-14 Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM Haoying Li et.al. 2409.09410 null
2024-09-13 Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry Yunus Bilge Kurt et.al. 2409.08769 link
2024-09-13 WheelPoser: Sparse-IMU Based Body Pose Estimation for Wheelchair Users Yunzhi Li et.al. 2409.08494 null
2024-09-12 Bayesian Inverse Graphics for Few-Shot Concept Learning Octavio Arriaga et.al. 2409.08351 null
2024-09-12 Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation Samanta Rodriguez et.al. 2409.08269 null
2024-09-12 Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation Haoying Li et.al. 2409.07933 null
2024-09-12 GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions Liang Feng et.al. 2409.07798 null
2024-09-12 GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution Liang Feng et.al. 2409.07752 null
2024-09-11 FaVoR: Features via Voxel Rendering for Camera Relocalization Vincenzo Polizzi et.al. 2409.07571 null
2024-09-11 Benchmarking 2D Egocentric Hand Pose Datasets Olga Taran et.al. 2409.07337 null
2024-09-11 iKalibr-RGBD: Partially-Specialized Target-Free Visual-Inertial Spatiotemporal Calibration For RGBDs via Continuous-Time Velocity Estimation Shuolong Chen et.al. 2409.07116 link
2024-09-11 Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry Anbo Tao et.al. 2409.06948 null
2024-09-13 A Bayesian framework for active object recognition, pose estimation and shape transfer learning through touch Haodong Zheng et.al. 2409.06912 null
2024-09-11 Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences Shishir Reddy Vutukur et.al. 2409.06683 null
2024-09-10 PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation Ginger Delmas et.al. 2409.06535 null
2024-09-10 Test-Time Certifiable Self-Supervision to Bridge the Sim2Real Gap in Event-Based Satellite Pose Estimation Mohsi Jawaid et.al. 2409.06240 null
2024-09-09 From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models Tessa Pulli et.al. 2409.05413 null
2024-09-08 HelmetPoser: A Helmet-Mounted IMU Dataset for Data-Driven Estimation of Human Head Motion in Diverse Conditions Jianping Li et.al. 2409.05006 null
2024-09-06 Casper DPM: Cascaded Perceptual Dynamic Projection Mapping onto Hands Yotam Erel et.al. 2409.04397 null
2024-09-06 GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers Lorenza Prospero et.al. 2409.04196 null
2024-09-06 Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics Woojin Cho et.al. 2409.04033 null
2024-09-06 Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments Therese Joseph et.al. 2409.03998 null
2024-09-09 The Influence of Faulty Labels in Data Sets on Human Pose Estimation Arnold Schwarz et.al. 2409.03887 null
2024-09-05 MaskVal: Simple but Effective Uncertainty Quantification for 6D Pose Estimation Philipp Quentin et.al. 2409.03556 null
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 null
2024-09-01 Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach Wenjun Huang et.al. 2409.02715 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-03 EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision Yiming Zhao et.al. 2409.02224 null
2024-09-03 Deep learning for objective estimation of Parkinsonian tremor severity Felipe Duque-Quiceno et.al. 2409.02011 null
2024-09-03 SPiKE: 3D Human Pose from Point Cloud Sequences Irene Ballester et.al. 2409.01879 link
2024-09-02 Kalman Filtering for Precise Indoor Position and Orientation Estimation Using IMU and Acoustics on Riemannian Manifolds Mohammed H. AlSharif et.al. 2409.01002 null
2024-09-01 Detection, Recognition and Pose Estimation of Tabletop Objects Sanjuksha Nirgude et.al. 2409.00869 null
2024-09-01 DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation Huixin Zhang et.al. 2409.00744 link
2024-09-01 MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds Ziqiang Dang et.al. 2409.00736 null
2024-08-31 ActionPose: Pretraining 3D Human Pose Estimation with the Dark Knowledge of Action Longyun Liao et.al. 2409.00449 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-08-30 BOP-D: Revisiting 6D Pose Estimation Benchmark for Better Evaluation under Visual Ambiguities Boris Meden et.al. 2408.17297 null
2024-08-30 EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs Zhen Fan et.al. 2408.17168 null
2024-09-01 Generic Objects as Pose Probes for Few-Shot View Synthesis Zhirui Gao et.al. 2408.16690 null
2024-08-29 OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation Yuchen Che et.al. 2408.16547 link
2024-08-29 GRPose: Learning Graph Relations for Human Image Generation with Pose Priors Xiangchen Yin et.al. 2408.16540 null
2024-08-28 Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators Nikita Kister et.al. 2408.16536 null
2024-08-28 Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation Laura Bragagnolo et.al. 2408.15810 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph Zherong Zhang et.al. 2408.15750 null

Visual Localization

Publish Date Title Authors PDF Code
2024-10-07 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-23 CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis Xiang Zhang et.al. 2409.15169 null
2024-09-21 Combining Absolute and Semi-Generalized Relative Poses for Visual Localization Vojtech Panek et.al. 2409.14269 null
2024-09-21 SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality Hongjia Zhai et.al. 2409.14067 null
2024-09-20 Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval Morris Florek et.al. 2409.13513 link
2024-09-18 Towards Global Localization using Multi-Modal Object-Instance Re-Identification Aneesh Chavan et.al. 2409.12002 link
2024-09-17 Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information Kunal Chelani et.al. 2409.11536 null
2024-09-17 Improving the Efficiency of Visually Augmented Language Models Paula Ontalvilla et.al. 2409.11148 null
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925 null
2024-09-16 SOLVR: Submap Oriented LiDAR-Visual Re-Localisation Joshua Knights et.al. 2409.10247 null
2024-09-16 Garment Attribute Manipulation with Multi-level Attention Vittorio Casula et.al. 2409.10206 null
2024-09-14 Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval Amirreza Mahbod et.al. 2409.09430 link
2024-09-12 Structured Pruning for Efficient Visual Place Recognition Oliver Grainge et.al. 2409.07834 null
2024-09-10 GeoCalib: Learning Single-image Calibration with Geometric Optimization Alexander Veicht et.al. 2409.06704 link
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-10 A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions Zhicong Wu et.al. 2409.06381 null
2024-09-09 Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding Bram Willemsen et.al. 2409.05721 link
2024-09-09 Open-World Dynamic Prompt and Continual Visual Representation Learning Youngeun Kim et.al. 2409.05312 null
2024-09-12 Training-free ZS-CIR via Weighted Modality Fusion and Similarity Ren-Di Wu et.al. 2409.04918 null
2024-09-12 Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models Saghir Alfasly et.al. 2409.04631 null
2024-09-06 Reprojection Errors as Prompts for Efficient Scene Coordinate Regression Ting-Ru Liu et.al. 2409.04178 null
2024-09-06 Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments Therese Joseph et.al. 2409.03998 null
2024-09-04 Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications Abby Stylianou et.al. 2409.03012 null
2024-09-04 NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sepanta Zeighami et.al. 2409.02343 link
2024-09-03 Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment Konstantin Schall et.al. 2409.01936 link
2024-09-02 A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches Kim Jinwoo et.al. 2409.01219 null
2024-09-02 Evidential Transformers for Improved Image Retrieval Danilo Dordevic et.al. 2409.01082 null
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-08-29 A compact neuromorphic system for ultra energy-efficient, on-device robot localization Adam D. Hines et.al. 2408.16754 link
2024-08-29 Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata et.al. 2408.16296 null
2024-08-28 Temporal Attention for Cross-View Sequential Image Localization Dong Yuan et.al. 2408.15569 null
2024-08-27 Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Tianqi Wei et.al. 2408.14723 null
2024-08-25 LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task Ali Asgarov et.al. 2408.13909 link
2024-08-21 FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization Son Tung Nguyen et.al. 2408.12037 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-10-18 Sim2real Cattle Joint Estimation in 3D point clouds Okour Mohammad et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729 link
2024-10-16 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 null
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link

2024-10

Keypoint Detection

Publish Date Title Authors PDF Code
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 null
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-04 Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation Aman Anand et.al. 2410.14700 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729 link
2024-10-16 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 null
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link

Pose Estimation

Publish Date Title Authors PDF Code
2024-11-06 GS2Pose: Tow-stage 6D Object Pose Estimation Guided by Gaussian Splatting Jilan Mei et.al. 2411.03807 null
2024-11-06 Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage Claus D. Hansen et.al. 2411.03724 null
2024-11-05 Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data Seunggeun Chi et.al. 2411.03561 null
2024-11-05 HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features Arnab Dey et.al. 2411.03086 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Activating Self-Attention for Multi-Scene Absolute Pose Regression Miso Lee et.al. 2411.01443 link
2024-11-04 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction Jongmin Lee et.al. 2411.00543 null
2024-10-31 Whole-Herd Elephant Pose Estimation from Drone Data for Collective Behavior Analysis Brody McNutt et.al. 2411.00196 null
2024-10-31 No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images Botao Ye et.al. 2410.24207 link
2024-11-06 SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation Aditya Agarwal et.al. 2410.23643 null
2024-10-30 SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung et.al. 2410.22715 null
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-29 HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation Zhoujie Xu et.al. 2410.22079 null
2024-10-29 EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data Zhonghua Yi et.al. 2410.21743 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 null
2024-10-29 BLAPose: Enhancing 3D Human Pose Estimation with Bone Length Adjustment Chih-Hsiang Hsu et.al. 2410.20731 link
2024-11-01 RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior Mingjiang Liang et.al. 2410.20358 null
2024-10-27 Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions Rawal Khirodkar et.al. 2410.20294 null
2024-10-26 Neural Fields in Robotics: A Survey Muhammad Zubair Irshad et.al. 2410.20220 null
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 null
2024-10-24 Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction Junyi Chen et.al. 2410.18962 null
2024-10-24 VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation Daniel Bermuth et.al. 2410.18723 null
2024-10-23 Robust Two-View Geometry Estimation with Implicit Differentiation Vladislav Pyatov et.al. 2410.17983 link
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-21 Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers Andrea Berra et.al. 2410.15802 null
2024-10-21 ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos Tao Tang et.al. 2410.15582 link
2024-10-20 Neural Active Structure-from-Motion in Dark and Textureless Environment Kazuto Ichimaru et.al. 2410.15378 null
2024-10-20 POSE: Pose estimation Of virtual Sync Exhibit system Hao-Tang Tsui et.al. 2410.15343 link
2024-10-18 Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing Jianping Li et.al. 2410.14565 null
2024-10-18 Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior Calvin-Khang Ta et.al. 2410.14540 null
2024-10-18 Sim2real Cattle Joint Estimation in 3D point clouds Okour Mohammad et.al. 2410.14419 null
2024-10-18 Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping Renguang Chen et.al. 2410.14161 null
2024-10-15 From Real Artifacts to Virtual Reference: A Robust Framework for Translating Endoscopic Images unyang Wu et.al. 2410.13896 null
2024-10-17 DualQuat-LOAM: LiDAR Odometry and Mapping parametrized on Dual Quaternions Edison P. Velasco-Sánchez et.al. 2410.13541 null
2024-10-17 Object Pose Estimation Using Implicit Representation For Transparent Objects Varun Burde et.al. 2410.13465 null
2024-10-16 Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation Francesco Evangelisti et.al. 2410.12679 null
2024-10-15 Contrastive Touch-to-Touch Pretraining Samanta Rodriguez et.al. 2410.11834 null
2024-10-18 X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing Xinyan Chen et.al. 2410.10167 null
2024-10-13 Occluded Human Pose Estimation based on Limb Joint Augmentation Gangtao Han et.al. 2410.09885 null
2024-10-12 Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors Hritam Basak et.al. 2410.09467 null
2024-10-12 Towards Multi-Modal Animal Pose Estimation: An In-Depth Analysis Qianyi Deng et.al. 2410.09312 link
2024-10-11 CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation Jianyu Zhao et.al. 2410.09010 link
2024-10-11 Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization Christian Schmidt et.al. 2410.08743 link
2024-10-10 Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation Felix Petersen et.al. 2410.08125 null
2024-10-10 Robotic framework for autonomous manipulation of laboratory equipment with different degrees of transparency via 6D pose estimation Maria Makarova et.al. 2410.07801 null
2024-10-10 Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos Cuong Le et.al. 2410.07795 link
2024-10-12 Autonomous Driving in Unstructured Environments: How Far Have We Come? Chen Min et.al. 2410.07701 null
2024-10-10 Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks Minxing Zhang et.al. 2410.07670 null
2024-10-09 OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB Yunzhi Lin et.al. 2410.06694 null
2024-10-08 SpecTrack: Learned Multi-Rotation Tracking via Speckle Imaging Ziyang Chen et.al. 2410.06028 null
2024-10-08 AIVIO: Closed-loop, Object-relative Navigation of UAVs with AI-aided Visual Inertial Odometry Thomas Jantos et.al. 2410.05996 null
2024-10-08 Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos et.al. 2410.05984 link
2024-10-08 FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance Ruocheng Wang et.al. 2410.05791 null
2024-10-07 Comparison of marker-less 2D image-based methods for infant pose estimation Lennart Jahn et.al. 2410.04980 null
2024-10-06 Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion Mehwish Ghafoor et.al. 2410.04574 link
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-05 Test-Time Adaptation for Keypoint-Based Spacecraft Pose Estimation Based on Predicted-View Synthesis Juan Ignacio Bravo Pérez-Villar et.al. 2410.04298 link
2024-10-05 A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems Nikola Radulov et.al. 2410.04242 link
2024-10-04 Unsupervised Prior Learning: Discovering Categorical Pose Priors from Videos Ziyu Wang et.al. 2410.03858 null
2024-10-04 Universal Global State Estimation for Inertial Navigation Systems Sifeddine Benahmed et.al. 2410.03846 null
2024-10-04 MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Junyi Zhang et.al. 2410.03825 null
2024-10-04 Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images Ci Li et.al. 2410.03438 null
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-04 CLIP-Clique: Graph-based Correspondence Matching Augmented by Vision Language Models for Object-based Global Localization Shigemichi Matsuzaki et.al. 2410.03054 null
2024-10-03 Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition Nikolaos Stathoulopoulos et.al. 2410.02643 null
2024-10-03 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment Xingyu Ji et.al. 2410.01618 null
2024-10-02 SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network Ahmed Tawfik Aboukhadra et.al. 2410.01293 null
2024-10-01 Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models Jerry Yan et.al. 2410.01061 null
2024-10-01 RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations Kaichen Zhou et.al. 2410.00713 link
2024-10-01 GERA: Geometric Embedding for Efficient Point Registration Analysis Geng Li et.al. 2410.00589 null
2024-09-30 Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations Muhammad Saif Ullah Khan et.al. 2409.20469 null
2024-09-30 Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies Shalini Sarode et.al. 2409.20237 null
2024-09-30 PuzzleBoard: A New Camera Calibration Pattern with Position Encoding Peer Stelldinger et.al. 2409.20127 link
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-09-30 GearTrack: Automating 6D Pose Estimation Yu Deng et.al. 2409.19986 null

Visual Localization

Publish Date Title Authors PDF Code
2024-11-05 From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Xintian Sun et.al. 2411.05826 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-11 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Edward Vendrow et.al. 2411.02537 link
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification MD Shaikh Rahman et.al. 2411.01473 null
2024-11-01 Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh et.al. 2411.00997 null
2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval Neil Chowdhury et.al. 2410.24114 link
2024-10-31 MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval Haiwen Li et.al. 2410.23736 null
2024-10-30 Decoupling Semantic Similarity from Spatial Alignment for Neural Networks Tassilo Wald et.al. 2410.23107 null
2024-10-29 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Monica Riedler et.al. 2410.21943 link
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 null
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Zijia Zhao et.al. 2410.18715 link
2024-10-25 On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features Tomáš Pivoňka et.al. 2410.18573 null
2024-10-22 Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2410.17393 null
2024-10-20 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Haiwen Diao et.al. 2410.15266 link
2024-10-19 Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway’s Digitised Book Collection Marie Roald et.al. 2410.14969 link
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269 link
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593 null
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614 null
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542 null
2024-10-08 Temporal Image Caption Retrieval Competition – Description and Results Jakub Pokrywka et.al. 2410.06314 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621 null
2024-10-11 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-23 CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis Xiang Zhang et.al. 2409.15169 null

2024-11

Pose Estimation

Publish Date Title Authors PDF Code
2024-11-29 Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling Qirui Wu et.al. 2411.19492 null
2024-11-29 Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning Yang You et.al. 2411.19458 null
2024-11-28 GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model Rui Zhou et.al. 2411.19289 null
2024-11-28 HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos Prithviraj Banerjee et.al. 2411.19167 null
2024-11-28 Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations Tjark Behrens et.al. 2411.19162 null
2024-11-28 Distributed Dual Quaternion Extended Kalman Filtering for Spacecraft Pose Estimation Mathias Hudoba de Badyn et.al. 2411.19033 null
2024-11-28 Waterfall Transformer for Multi-person Pose Estimation Navin Ranjan et.al. 2411.18944 null
2024-12-02 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Sherwin Bahmani et.al. 2411.18673 null
2024-11-27 XR-MBT: Multi-modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration Denys Rozumnyi et.al. 2411.18377 null
2024-11-26 Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors Ziang Xu et.al. 2411.17790 null
2024-11-26 Geometric Point Attention Transformer for 3D Shape Reassembly Jiahan Li et.al. 2411.17788 null
2024-11-26 RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training Raktim Gautam Goswami et.al. 2411.17662 null
2024-11-26 Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles Susu Fang et.al. 2411.17432 null
2024-11-26 Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration Junyuan Deng et.al. 2411.17240 link
2024-11-27 SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting Gyeongjin Kang et.al. 2411.17190 null
2024-11-26 GMFlow: Global Motion-Guided Recurrent Flow for 6D Object Pose Estimation Xin Liu et.al. 2411.17174 null
2024-11-25 Diffusion Features for Zero-Shot 6DoF Object Pose Estimation Bernd Von Gimborn et.al. 2411.16668 null
2024-11-25 Edge Weight Prediction For Category-Agnostic Pose Estimation Or Hirschorn et.al. 2411.16665 link
2024-11-25 SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis Hyojun Go et.al. 2411.16443 link
2024-11-25 One Diffusion to Generate Them All Duong H. Le et.al. 2411.16318 link
2024-11-25 UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image Xingyu Liu et.al. 2411.16106 null
2024-11-24 Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching Yujing Sun et.al. 2411.15860 link
2024-11-24 PEnG: Pose-Enhanced Geo-Localisation Tavis Shore et.al. 2411.15742 null
2024-11-22 Personalization of Wearable Sensor-Based Joint Kinematic Estimation Using Computer Vision for Hip Exoskeleton Applications Changseob Song et.al. 2411.15366 null
2024-11-22 mmWave Radar for Sit-to-Stand Analysis: A Comparative Study with Wearables and Kinect Shuting Hu et.al. 2411.14656 null
2024-11-21 DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Tianhe Ren et.al. 2411.14347 link
2024-11-21 SEMPose: A Single End-to-end Network for Multi-object Pose Estimation Xin Liu et.al. 2411.14002 null
2024-11-21 Dehazing-aided Multi-Rate Multi-Modal Pose Estimation Framework for Mitigating Visual Disturbances in Extreme Underwater Domain Vidya Sudevan et.al. 2411.13988 null
2024-11-21 Hybrid-Neuromorphic Approach for Underwater Robotics Applications: A Conceptual Framework Vidya Sudevan et.al. 2411.13962 null
2024-11-20 Developing Normative Gait Cycle Parameters for Clinical Analysis Using Human Pose Estimation Rahm Ranjan et.al. 2411.13716 null
2024-11-20 Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction Yi Gu et.al. 2411.13620 null
2024-11-19 VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference Seong Jong Yoo et.al. 2411.13607 link
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-20 X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation Yuchen Yang et.al. 2411.13026 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-19 GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping Teli Ma et.al. 2411.12286 null
2024-11-18 IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos Yunong Liu et.al. 2411.11409 link
2024-11-15 USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting Kang Chen et.al. 2411.10504 link
2024-11-13 ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening Hojun Jang et.al. 2411.09435 null
2024-11-13 Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis Dominik Borer et.al. 2411.08603 null
2024-11-13 DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization Yueming Xu et.al. 2411.08373 null
2024-11-16 RINO: Accurate, Robust Radar-Inertial Odometry with Non-Iterative Estimation Shuocheng Yang et.al. 2411.07699 link
2024-11-12 Human Arm Pose Estimation with a Shoulder-worn Force-Myography Device for Human-Robot Interaction Rotem Atari et.al. 2411.07644 null
2024-11-12 Towards Seamless Integration of Magnetic Tracking into Fluoroscopy-guided Interventions Shuwei Xing et.al. 2411.07495 null
2024-11-08 Acoustic-based 3D Human Pose Estimation Robust to Human Position Yusuke Oumi et.al. 2411.07165 null
2024-11-11 CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models Junho Kim et.al. 2411.06869 null
2024-11-11 GenZ-ICP: Generalizable and Degeneracy-Robust LiDAR Odometry Using an Adaptive Weighting Daehan Lee et.al. 2411.06766 null
2024-11-11 GTA-Net: An IoT-Integrated 3D Human Pose Estimation System for Real-Time Adolescent Sports Posture Correction Shizhe Yuan et.al. 2411.06725 null
2024-11-10 Magnetic Field Aided Vehicle Localization with Acceleration Correction Mrunmayee Deshpande et.al. 2411.06543 null
2024-11-10 Visuotactile-Based Learning for Insertion with Compliant Hands Osher Azulay et.al. 2411.06408 null
2024-11-08 Poze: Sports Technique Feedback under Data Constraints Agamdeep Singh et.al. 2411.05734 null
2024-11-08 DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions Rafael Berral-Soler et.al. 2411.05552 link
2024-11-08 Tightly-Coupled, Speed-aided Monocular Visual-Inertial Localization in Topological Map Chanuk Yang et.al. 2411.05497 null
2024-11-08 Relative Pose Estimation for Nonholonomic Robot Formation with UWB-IO Measurements Kunrui Ze et.al. 2411.05481 null
2024-11-07 Social EgoMesh Estimation Luca Scofano et.al. 2411.04598 link
2024-11-07 Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player’s Trajectory Ali K. AlShami et.al. 2411.04501 null
2024-11-07 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation Xun Tu et.al. 2411.04386 null
2024-11-08 GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting Jilan Mei et.al. 2411.03807 null
2024-11-06 Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage Claus D. Hansen et.al. 2411.03724 null
2024-11-05 Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data Seunggeun Chi et.al. 2411.03561 null
2024-11-05 HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features Arnab Dey et.al. 2411.03086 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Activating Self-Attention for Multi-Scene Absolute Pose Regression Miso Lee et.al. 2411.01443 link
2024-11-04 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction Jongmin Lee et.al. 2411.00543 null
2024-10-31 Whole-Herd Elephant Pose Estimation from Drone Data for Collective Behavior Analysis Brody McNutt et.al. 2411.00196 null
2024-10-31 No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images Botao Ye et.al. 2410.24207 link
2024-11-06 SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation Aditya Agarwal et.al. 2410.23643 null
2024-10-30 SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung et.al. 2410.22715 null
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-29 HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation Zhoujie Xu et.al. 2410.22079 null
2024-10-29 EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data Zhonghua Yi et.al. 2410.21743 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 null
2024-10-29 BLAPose: Enhancing 3D Human Pose Estimation with Bone Length Adjustment Chih-Hsiang Hsu et.al. 2410.20731 link
2024-11-01 RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior Mingjiang Liang et.al. 2410.20358 null

Visual Localization

Publish Date Title Authors PDF Code
2024-12-06 DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification Ying Jin et.al. 2412.04828 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-04 Composed Image Retrieval for Training-Free Domain Conversion Nikos Efthymiadis et.al. 2412.03297 link
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310 link
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features MD Shaikh Rahman et.al. 2412.01555 null
2024-12-02 Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models Yi Liao et.al. 2412.01202 null
2024-12-01 EDTformer: An Efficient Decoder Transformer for Visual Place Recognition Tong Jin et.al. 2412.00784 null
2024-11-28 EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval Muhammad Huzaifa et.al. 2412.00139 null
2024-11-28 Unleashing the Power of Data Synthesis in Visual Localization Sihang Li et.al. 2412.00138 null
2024-11-28 Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval Yang Liu et.al. 2412.00120 null
2024-11-29 A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications Liqiang Zhang Ye Tian Dongyan Wei et.al. 2411.19845 null
2024-11-27 Optimizing Image Retrieval with an Extended b-Metric Space Abdelkader Belhenniche et.al. 2411.18800 null
2024-11-26 Learning Visual Hierarchies with Hyperbolic Embeddings Ziwei Wang et.al. 2411.17490 null
2024-12-02 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy You Li et.al. 2411.16752 null
2024-12-02 AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks You Li et.al. 2411.16749 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-22 Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Zengbao Sun et.al. 2411.14704 null
2024-11-20 Globally Correlation-Aware Hard Negative Generation Wenjie Peng et.al. 2411.13145 link
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 link
2024-11-13 Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval Saul Santos et.al. 2411.08590 link
2024-11-22 Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments Ashkan Nejad et.al. 2411.08567 link
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-05 From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Xintian Sun et.al. 2411.05826 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-11 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Edward Vendrow et.al. 2411.02537 link
2024-11-20 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification MD Shaikh Rahman et.al. 2411.01473 null
2024-11-01 Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh et.al. 2411.00997 null
2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval Neil Chowdhury et.al. 2410.24114 link
2024-10-31 MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval Haiwen Li et.al. 2410.23736 null
2024-10-30 Decoupling Semantic Similarity from Spatial Alignment for Neural Networks Tassilo Wald et.al. 2410.23107 null
2024-10-29 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Monica Riedler et.al. 2410.21943 link
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 null
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link

Keypoint Detection

Publish Date Title Authors PDF Code
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-04 Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation Aman Anand et.al. 2410.14700 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729 link

2024-12

Pose Estimation

Publish Date Title Authors PDF Code
2025-01-03 TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation Jiajie Liu et.al. 2501.01770 null
2025-01-03 Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2501.01752 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild Soumyaratna Debnath et.al. 2501.01174 null
2024-12-31 Relative Pose Observability Analysis Using Dual Quaternions Nicholas B. Andrews et.al. 2501.00657 null
2024-12-31 VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception Zhaoliang Wan et.al. 2501.00510 null
2024-12-30 Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields Evgenii Kruzhkov et.al. 2412.20976 null
2024-12-30 ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning Hrishikesh Gupta et.al. 2412.20830 link
2024-12-30 Frequency-aware Event Cloud Network Hongwei Ren et.al. 2412.20803 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-30 Towards nation-wide analytical healthcare infrastructures: A privacy-preserving augmented knee rehabilitation case study Boris Bačić et.al. 2412.20733 null
2024-12-29 Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation Qucheng Peng et.al. 2412.20538 link
2024-12-28 MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Shuo Wang et.al. 2412.20082 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-27 Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation Guangsheng Xu et.al. 2412.19676 link
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-26 Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos Changwoon Choi et.al. 2412.19089 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 null
2024-12-22 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923 null
2024-12-21 EasyVis2: A Real Time Multi-view 3D Visualization for Laparoscopic Surgery Training Enhanced by a Deep Neural Network YOLOv8-Pose Yung-Hong Sun et.al. 2412.16742 null
2024-12-21 FACTS: Fine-Grained Action Classification for Tactical Sports Christopher Lai et.al. 2412.16454 null
2024-12-20 Can Generative Video Models Help Pose Estimation? Ruojin Cai et.al. 2412.16155 null
2024-12-20 Monkey Transfer Learning Can Improve Human Pose Estimation Bradley Scott et.al. 2412.15966 null
2024-12-19 Scaling 4D Representations João Carreira et.al. 2412.15212 null
2024-12-13 IMPROVE: Impact of Mobile Phones on Remote Online Virtual Education Roberto Daza et.al. 2412.14195 link
2024-12-18 Level-Set Parameters: Novel Representation for 3D Shape Analysis Huan Lei et.al. 2412.13502 null
2024-12-18 Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation Xiaoqi An et.al. 2412.13454 null
2024-12-17 CondiMen: Conditional Multi-Person Mesh Recovery Brégier Romain et.al. 2412.13058 null
2024-12-17 ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries Wangyu Xue et.al. 2412.12675 null
2024-12-16 Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion Adam Bethell et.al. 2412.11420 null
2024-12-13 ExeChecker: Where Did I Go Wrong? Yiwen Gu et.al. 2412.10573 null
2024-12-11 CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty Harry Zhang et.al. 2412.10431 null
2024-12-13 RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting Lizhi Bai et.al. 2412.09868 null
2024-12-12 Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos Linyi Jin et.al. 2412.09621 null
2024-12-12 FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction Jiale Xu et.al. 2412.09573 null
2024-12-11 BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation Shengze Wang et.al. 2412.08640 null
2024-12-12 Drift-free Visual SLAM using Digital Twins Roxane Merat et.al. 2412.08496 null
2024-12-11 Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Siyan Dong et.al. 2412.08376 null
2024-12-10 LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models Ziqi Lu et.al. 2412.07746 null
2024-12-09 MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Zhenggang Tang et.al. 2412.06974 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 Attention-Enhanced Lightweight Hourglass Network for Human Pose Estimation Marsha Mariya Kappan et.al. 2412.06227 null
2024-12-06 CCS: Continuous Learning for Customized Incremental Wireless Sensing Services Qunhang Fu et.al. 2412.04821 null
2024-12-05 ProPLIKS: Probablistic 3D human body pose estimation Karthik Shetty et.al. 2412.04665 null
2024-12-05 DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction Ben Kaye et.al. 2412.04464 null
2024-12-05 Targeted Hard Sample Synthesis Based on Estimated Pose and Occlusion Error for Improved Object Pose Estimation Alan Li et.al. 2412.04279 null
2024-12-04 Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis Qitao Zhao et.al. 2412.03570 null
2024-12-06 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Lingen Li et.al. 2412.03517 null
2024-12-05 A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks Proma Hossain Progga et.al. 2412.03498 null
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146 link
2024-12-04 An indoor DSO-based ceiling-vision odometry system for indoor industrial environments Abdelhak Bougouffa et.al. 2412.02950 null
2024-12-03 EgoCast: Forecasting Egocentric Human Pose in the Wild Maria Escobar et.al. 2412.02903 null
2024-12-02 emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation Sasha Salter et.al. 2412.02725 null
2024-12-03 ProbPose: A Probabilistic Approach to 2D Human Pose Estimation Miroslav Purkrabek et.al. 2412.02254 null
2024-12-03 Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images Xiangyong Lu et.al. 2412.02197 link
2024-12-03 CLERF: Contrastive LEaRning for Full Range Head Pose Estimation Ting-Ruen Wei et.al. 2412.02066 null
2024-12-02 Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle Miroslav Purkrabek et.al. 2412.01562 link
2024-12-02 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting Yufeng Jin et.al. 2412.01543 null
2024-12-02 HandOS: 3D Hand Reconstruction in One Stage Xingyu Chen et.al. 2412.01537 null
2024-12-02 SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure Frames Yuxuan Zhou et.al. 2412.01500 null
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-12-02 Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures Qiyuan Shen et.al. 2412.01299 null
2024-12-02 CRISP: Object Pose and Shape Estimation with Test-Time Adaptation Jingnan Shi et.al. 2412.01052 null
2024-11-29 Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling Qirui Wu et.al. 2411.19492 null
2024-11-29 Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning Yang You et.al. 2411.19458 null
2024-11-28 GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model Rui Zhou et.al. 2411.19289 null
2024-11-28 HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos Prithviraj Banerjee et.al. 2411.19167 null
2024-11-28 Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations Tjark Behrens et.al. 2411.19162 null
2024-11-28 Distributed Dual Quaternion Extended Kalman Filtering for Spacecraft Pose Estimation Mathias Hudoba de Badyn et.al. 2411.19033 null
2024-11-28 Waterfall Transformer for Multi-person Pose Estimation Navin Ranjan et.al. 2411.18944 null
2024-12-02 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Sherwin Bahmani et.al. 2411.18673 null
2024-11-27 XR-MBT: Multi-modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration Denys Rozumnyi et.al. 2411.18377 null
2024-11-26 RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training Raktim Gautam Goswami et.al. 2411.17662 null

Visual Localization

Publish Date Title Authors PDF Code
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 null
2025-01-12 SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval Bhavin Jawade et.al. 2501.08347 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-06 Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI Xujin Li et.al. 2501.02841 null
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-02 R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Xudong Jiang et.al. 2501.01421 null
2025-01-02 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback Yucheng Zhou et.al. 2501.01377 null
2025-01-02 Domain-invariant feature learning in brain MR imaging for content-based image retrieval Shuya Tobari et.al. 2501.01326 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-25 FOR: Finetuning for Object Level Open Vocabulary Image Retrieval Hila Levi et.al. 2412.18806 null
2024-12-24 ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval Le Dong et.al. 2412.18136 link
2024-12-22 Where am I? Cross-View Geo-localization with Natural Language Descriptions Junyan Ye et.al. 2412.17007 null
2024-12-24 Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling Daichi Yashima et.al. 2412.16576 link
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632 null
2024-12-20 Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation Samantha J Alloo et.al. 2412.15513 null
2024-12-19 Learning Visual Composition through Improved Semantic Guidance Austin Stone et.al. 2412.15396 null
2024-12-19 MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Junjie Zhou et.al. 2412.14475 null
2024-12-18 Adversarial Hubness in Multi-Modal Retrieval Tingwei Zhang et.al. 2412.14113 link
2024-12-18 Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Giacomo Pacini et.al. 2412.13834 null
2024-12-18 ConDo: Continual Domain Expansion for Absolute Pose Regression Zijun Li et.al. 2412.13452 link
2024-12-17 Three Things to Know about Deep Metric Learning Yash Patel et.al. 2412.12432 null
2024-12-15 Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval Zelong Sun et.al. 2412.11087 null
2024-12-20 Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2412.11077 null
2024-12-13 MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition Qiwen Gu et.al. 2412.09199 null
2024-12-12 A Flexible Plug-and-Play Module for Generating Variable-Length Liyang He et.al. 2412.08922 link
2024-12-11 Image Retrieval Methods in the Dissimilarity Space Madhu Kiran et.al. 2412.08618 null
2024-12-11 Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Siyan Dong et.al. 2412.08376 null
2024-12-11 Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin Benjamin D. Killeen et.al. 2412.08020 null
2024-12-10 On Motion Blur and Deblurring in Visual Place Recognition Timur Ismagilov et.al. 2412.07751 null
2024-12-10 Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance Wanwen Chen et.al. 2412.07741 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition Connor Malone et.al. 2412.06153 null
2024-12-07 Compositional Image Retrieval via Instruction-Aware Contrastive Learning Wenliang Zhong et.al. 2412.05756 null
2024-12-06 DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification Ying Jin et.al. 2412.04828 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-04 Composed Image Retrieval for Training-Free Domain Conversion Nikos Efthymiadis et.al. 2412.03297 link
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310 link
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features MD Shaikh Rahman et.al. 2412.01555 null
2024-12-02 Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models Yi Liao et.al. 2412.01202 null
2024-12-01 EDTformer: An Efficient Decoder Transformer for Visual Place Recognition Tong Jin et.al. 2412.00784 null
2024-11-28 EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval Muhammad Huzaifa et.al. 2412.00139 null
2024-11-28 Unleashing the Power of Data Synthesis in Visual Localization Sihang Li et.al. 2412.00138 null
2024-11-28 Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval Yang Liu et.al. 2412.00120 null
2024-11-29 A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications Liqiang Zhang Ye Tian Dongyan Wei et.al. 2411.19845 null
2024-11-27 Optimizing Image Retrieval with an Extended b-Metric Space Abdelkader Belhenniche et.al. 2411.18800 null
2024-11-26 Learning Visual Hierarchies with Hyperbolic Embeddings Ziwei Wang et.al. 2411.17490 null
2024-12-02 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy You Li et.al. 2411.16752 null
2024-12-02 AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks You Li et.al. 2411.16749 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-22 Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Zengbao Sun et.al. 2411.14704 null
2024-11-20 Globally Correlation-Aware Hard Negative Generation Wenjie Peng et.al. 2411.13145 link
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null

Keypoint Detection

Publish Date Title Authors PDF Code
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null

2025-1

Pose Estimation

Publish Date Title Authors PDF Code
2025-02-08 Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment Maneesha Wickramasuriya et.al. 2502.05409 null
2025-02-06 Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation Nathan Louis et.al. 2502.04483 null
2025-02-06 GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation Weihang Li et.al. 2502.04293 null
2025-02-06 Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks Yuhui Jin et.al. 2502.03877 null
2025-02-05 Mapping and Localization Using LiDAR Fiducial Markers Yibo Liu et.al. 2502.03510 null
2025-02-04 Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation Jian Liu et.al. 2502.02525 link
2025-02-03 CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation Xiao Lin et.al. 2502.01312 null
2025-02-03 Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Dabin Kim et.al. 2502.01092 null
2025-02-03 ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking Jianqiu Chen et.al. 2502.01004 null
2025-01-31 A Direct Semi-Exhaustive Search Method for Robust, Partial-to-Full Point Cloud Registration Richard Cheng et.al. 2502.00115 null
2025-01-31 XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Bo Lan et.al. 2501.19034 link
2025-01-30 SimpleDepthPose: Fast and Reliable Human Pose Estimation with RGBD-Images Daniel Bermuth et.al. 2501.18478 null
2025-01-29 Online Trajectory Replanner for Dynamically Grasping Irregular Objects Minh Nhat Vu et.al. 2501.17968 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-27 Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach Hoosang Lee et.al. 2501.16146 null
2025-01-27 NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation Jialun Cai et.al. 2501.15763 null
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096 null
2025-01-25 SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos Yingying Jiao et.al. 2501.15073 null
2025-01-24 3D/2D Registration of Angiograms using Silhouette-based Differentiable Rendering Taewoong Lee et.al. 2501.14918 link
2025-01-24 Light3R-SfM: Towards Feed-forward Structure-from-Motion Sven Elflein et.al. 2501.14914 null
2025-01-24 Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstruction Bo Sun et.al. 2501.14896 null
2025-01-24 Optimizing Grasping Precision for Industrial Pick-and-Place Tasks Through a Novel Visual Servoing Approach Khairidine Benali et.al. 2501.14557 null
2025-01-24 LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing Marcello Cellina et.al. 2501.14502 null
2025-01-24 Optimizing Human Pose Estimation Through Focused Human and Joint Regions Yingying Jiao et.al. 2501.14439 null
2025-01-24 Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation Haipeng Chen et.al. 2501.14356 null
2025-01-24 HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting Javier Yu et.al. 2501.14147 null
2025-01-23 Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Jianing Yang et.al. 2501.13928 null
2025-01-23 EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs Yizhe Lv et.al. 2501.13805 link
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-22 Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects Louis Aberdeen et.al. 2501.13009 null
2025-01-21 BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation Tamás Karácsony et.al. 2501.12318 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069 null
2025-01-17 landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images Jef Jonkers et.al. 2501.10098 link
2025-01-16 A New Teacher-Reviewer-Student Framework for Semi-supervised 2D Human Pose Estimation Wulian Yun et.al. 2501.09565 null
2025-01-21 Towards Robust and Realistic Human Pose Estimation via WiFi Signals Yang Chen et.al. 2501.09411 link
2025-01-16 RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects Zhen Luo et.al. 2501.09307 null
2025-01-16 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module Dongzhihan Wang et.al. 2501.08659 null
2025-01-14 Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature Fusion Cesare Davide Pace et.al. 2501.08446 link
2025-01-14 Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation Hansoo Park et.al. 2501.08408 null
2025-01-14 Predicting 4D Hand Trajectory from Monocular Videos Yufei Ye et.al. 2501.08329 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-14 AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation Feng Zhang et.al. 2501.08088 null
2025-01-14 Robust Low-Light Human Pose Estimation through Illumination-Texture Modulation Feng Zhang et.al. 2501.08038 null
2025-01-14 BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos Farnoosh Koleini et.al. 2501.07800 null
2025-01-13 Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation Yaqing Ding et.al. 2501.07742 link
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-13 Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics Tze Ho Elden Tse et.al. 2501.07100 null
2025-01-10 eKalibr: Dynamic Intrinsic Calibration for Event Cameras From First Principles of Events Shuolong Chen et.al. 2501.05688 null
2025-01-09 Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Yifan Yu et.al. 2501.05446 link
2025-01-09 From Simple to Complex Skills: The Case of In-Hand Object Reorientation Haozhi Qi et.al. 2501.05439 null
2025-01-11 Towards Balanced Continual Multi-Modal Learning in Human Pose Estimation Jiaxuan Peng et.al. 2501.05264 null
2025-01-08 KN-LIO: Geometric Kinematics and Neural Field Coupled LiDAR-Inertial Odometry Zhong Wang et.al. 2501.04263 null
2025-01-10 MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer Junsheng Luan et.al. 2501.03630 null
2025-01-07 TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction Scenes Alakh Aggarwal et.al. 2501.03525 link
2025-01-06 Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation Songlin Hou et.al. 2501.03336 null
2025-01-06 SurgRIPE challenge: Benchmark of Surgical Robot Instrument Pose Estimation Haozheng Xu et.al. 2501.02990 null
2025-01-06 HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos Jinglei Zhang et.al. 2501.02973 null
2025-01-06 Spiking monocular event based 6D pose estimation for space application Jonathan Courtois et.al. 2501.02916 null
2025-01-06 Universal Features Guided Zero-Shot Category-Level Object Pose Estimation Wentian Qu et.al. 2501.02831 null
2025-01-06 Unsupervised Domain Adaptation for Occlusion Resilient Human Pose Estimation Arindam Dutta et.al. 2501.02773 null
2025-01-06 WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation Tianjian Jiang et.al. 2501.02771 null
2025-01-05 LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments Haosong Yue et.al. 2501.02580 null
2025-01-04 ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle Yinchuan Wang et.al. 2501.02166 link
2025-01-03 TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation Jiajie Liu et.al. 2501.01770 null
2025-01-03 Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2501.01752 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild Soumyaratna Debnath et.al. 2501.01174 null
2024-12-31 Relative Pose Observability Analysis Using Dual Quaternions Nicholas B. Andrews et.al. 2501.00657 null
2024-12-31 VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception Zhaoliang Wan et.al. 2501.00510 null
2024-12-30 Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields Evgenii Kruzhkov et.al. 2412.20976 null
2024-12-30 ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning Hrishikesh Gupta et.al. 2412.20830 link
2024-12-30 Frequency-aware Event Cloud Network Hongwei Ren et.al. 2412.20803 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-30 Towards nation-wide analytical healthcare infrastructures: A privacy-preserving augmented knee rehabilitation case study Boris Bačić et.al. 2412.20733 null
2024-12-29 Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation Qucheng Peng et.al. 2412.20538 link

Visual Localization

Publish Date Title Authors PDF Code
2025-02-11 Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields Petr Koutenský et.al. 2502.07338 null
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Osman Tursun et.al. 2502.07215 null
2025-02-10 AstroLoc: Robust Space to Ground Image Localizer Gabriele Berton et.al. 2502.07003 null
2025-02-09 Uni-Retrieval: A Multi-Style Retrieval Framework for STEM’s Education Yanhao Jia et.al. 2502.05863 null
2025-02-07 Learning Street View Representations with Spatiotemporal Contrast Yong Li et.al. 2502.04638 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 null
2025-02-05 Human-Aligned Image Models Improve Visual Decoding from the Brain Nona Rajabi et.al. 2502.03081 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-01-27 Freestyle Sketch-in-the-Loop Image Segmentation Subhadeep Koley et.al. 2501.16022 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-22 Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Kenta Uesugi et.al. 2501.13968 null
2025-01-19 Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection Zhipeng Yu et.al. 2501.11063 link
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval Weihang Zhang et.al. 2501.10638 null
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 null
2025-01-12 SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval Bhavin Jawade et.al. 2501.08347 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-06 Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI Xujin Li et.al. 2501.02841 null
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-02 R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Xudong Jiang et.al. 2501.01421 null
2025-01-02 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback Yucheng Zhou et.al. 2501.01377 null
2025-01-02 Domain-invariant feature learning in brain MR imaging for content-based image retrieval Shuya Tobari et.al. 2501.01326 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-25 FOR: Finetuning for Object Level Open Vocabulary Image Retrieval Hila Levi et.al. 2412.18806 null
2024-12-24 ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval Le Dong et.al. 2412.18136 link
2024-12-22 Where am I? Cross-View Geo-localization with Natural Language Descriptions Junyan Ye et.al. 2412.17007 null
2024-12-24 Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling Daichi Yashima et.al. 2412.16576 link
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632 null
2024-12-20 Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation Samantha J Alloo et.al. 2412.15513 null

Keypoint Detection

Publish Date Title Authors PDF Code
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 null
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 null
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link

2025-2

Pose Estimation

Publish Date Title Authors PDF Code
2025-02-28 BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports Jing-Yuan Chang et.al. 2502.21085 null
2025-02-28 Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints Masoumeh Chapariniya et.al. 2502.20803 null
2025-02-27 Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison Jiageng Zhong et.al. 2502.20154 null
2025-02-27 BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground Yufei Wei et.al. 2502.20078 null
2025-02-28 SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation Zijie Zhou et.al. 2502.20077 link
2025-02-27 RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges Thibaut Loiseau et.al. 2502.19955 null
2025-02-27 QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects Elkhan Ismayilzada et.al. 2502.19769 null
2025-02-27 Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System Shunkun Liang et.al. 2502.19708 null
2025-02-26 Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects Petri Mäkinen et.al. 2502.19169 null
2025-02-25 EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity Dominik Hollidt et.al. 2502.18373 null
2025-02-25 Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation Tianyang Xu et.al. 2502.18214 link
2025-02-24 V-HOP: Visuo-Haptic 6D Object Pose Tracking Hongyu Li et.al. 2502.17434 null
2025-02-23 Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM Yao Zhang et.al. 2502.16495 null
2025-02-23 DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion Jianbin Jiao et.al. 2502.16419 link
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-21 SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training Nie Lin et.al. 2502.15251 null
2025-02-21 Nonlinear Dynamical Systems for Automatic Face Annotation in Head Tracking and Pose Estimation Thoa Thieu et.al. 2502.15179 null
2025-02-20 Design of a Visual Pose Estimation Algorithm for Moon Landing Atakan Süslü et.al. 2502.14942 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-19 EfficientPose 6D: Scalable and Efficient 6D Object Pose Estimation Zixuan Fang et.al. 2502.14061 null
2025-02-19 Active Illumination for Visual Ego-Motion Estimation in the Dark Francesco Crocetti et.al. 2502.13708 null
2025-02-19 Object-Pose Estimation With Neural Population Codes Heiko Hoffmann et.al. 2502.13403 null
2025-02-18 Spatiotemporal Multi-Camera Calibration using Freely Moving People Sang-Eun Lee et.al. 2502.12546 null
2025-02-18 Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation Kaiwen Ren et.al. 2502.12535 null
2025-02-19 FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views Shangzhan Zhang et.al. 2502.12138 null
2025-02-17 Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection Tessa Pulli et.al. 2502.12027 null
2025-02-17 SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking Zijian Wu et.al. 2502.11534 null
2025-02-18 VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS Ming Meng et.al. 2502.10729 link
2025-02-15 Semantics-aware Test-time Adaptation for 3D Human Pose Estimation Qiuxia Lin et.al. 2502.10724 null
2025-02-15 Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video Runyang Feng et.al. 2502.10616 null
2025-02-14 HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation Yibo Liu et.al. 2502.10606 null
2025-02-14 Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models Chenrui Tie et.al. 2502.10090 null
2025-02-13 Metamorphic Testing for Pose Estimation Systems Matias Duran et.al. 2502.09460 null
2025-02-13 BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization Qiwei Wang et.al. 2502.09080 null
2025-02-14 Siren Song: Manipulating Pose Estimation in XR Headsets Using Acoustic Attacks Zijian Huang et.al. 2502.08865 null
2025-02-12 LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features Shujie Zhou et.al. 2502.08676 link
2025-02-12 CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World Yankai Fu et.al. 2502.08449 null
2025-02-11 GaRLIO: Gravity enhanced Radar-LiDAR-Inertial Odometry Chiyun Noh et.al. 2502.07703 link
2025-02-11 Matrix3D: Large Photogrammetry Model All-in-One Yuanxun Lu et.al. 2502.07685 null
2025-02-08 Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment Maneesha Wickramasuriya et.al. 2502.05409 null
2025-02-06 Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation Nathan Louis et.al. 2502.04483 link
2025-02-06 GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation Weihang Li et.al. 2502.04293 null
2025-02-06 Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks Yuhui Jin et.al. 2502.03877 null
2025-02-05 Mapping and Localization Using LiDAR Fiducial Markers Yibo Liu et.al. 2502.03510 null
2025-02-04 Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation Jian Liu et.al. 2502.02525 link
2025-02-03 CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation Xiao Lin et.al. 2502.01312 null
2025-02-03 Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Dabin Kim et.al. 2502.01092 null
2025-02-03 ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking Jianqiu Chen et.al. 2502.01004 null
2025-01-31 A Direct Semi-Exhaustive Search Method for Robust, Partial-to-Full Point Cloud Registration Richard Cheng et.al. 2502.00115 null
2025-01-31 XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Bo Lan et.al. 2501.19034 link
2025-01-30 SimpleDepthPose: Fast and Reliable Human Pose Estimation with RGBD-Images Daniel Bermuth et.al. 2501.18478 null
2025-01-29 Online Trajectory Replanner for Dynamically Grasping Irregular Objects Minh Nhat Vu et.al. 2501.17968 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-27 Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach Hoosang Lee et.al. 2501.16146 null

Visual Localization

Publish Date Title Authors PDF Code
2025-03-04 TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition Oliver Grainge et.al. 2503.02511 null
2025-03-04 Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models Kenta Tsukahara et.al. 2503.02256 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334 link
2025-03-03 AirRoom: Objects Matter in Room Reidentification Runmao Yao et.al. 2503.01130 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862 null
2025-03-01 Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning Songlin Dong et.al. 2503.00515 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167 null
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link
2025-02-27 On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation Ruben T. Lucassen et.al. 2502.19285 null
2025-02-19 A Comprehensive Survey on Composed Image Retrieval Xuemeng Song et.al. 2502.18495 null
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-23 Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries Yin Wu et.al. 2502.16636 link
2025-02-23 SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition Feng Lu et.al. 2502.16601 link
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 null
2025-02-20 Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition Tianyi Shang et.al. 2502.14195 link
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 link
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-17 Descriminative-Generative Custom Tokens for Vision-Language Models Pramuditha Perera et.al. 2502.12095 null
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748 null
2025-02-17 Range and Bird’s Eye View Fused Cross-Modal Visual Place Recognition Jianyi Peng et.al. 2502.11742 null
2025-02-17 Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics Francesco Croce et.al. 2502.11725 link
2025-02-17 Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization Yuanze Xu et.al. 2502.11408 null
2025-02-12 E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection Junjie Wu et.al. 2502.10455 null
2025-02-11 Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning Yuhang Dong et.al. 2502.09649 null
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411 null
2025-02-12 SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization Artem Dementyev et.al. 2502.08848 null
2025-02-12 Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Prajwal Gatti et.al. 2502.08438 null
2025-02-11 Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang et.al. 2502.07830 null
2025-02-11 Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields Petr Koutenský et.al. 2502.07338 null
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Osman Tursun et.al. 2502.07215 null
2025-02-10 AstroLoc: Robust Space to Ground Image Localizer Gabriele Berton et.al. 2502.07003 null
2025-02-09 Uni-Retrieval: A Multi-Style Retrieval Framework for STEM’s Education Yanhao Jia et.al. 2502.05863 null
2025-02-07 Learning Street View Representations with Spatiotemporal Contrast Yong Li et.al. 2502.04638 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 link
2025-02-05 Human-Aligned Image Models Improve Visual Decoding from the Brain Nona Rajabi et.al. 2502.03081 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-01-27 Freestyle Sketch-in-the-Loop Image Segmentation Subhadeep Koley et.al. 2501.16022 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-22 Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Kenta Uesugi et.al. 2501.13968 null
2025-01-19 Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection Zhipeng Yu et.al. 2501.11063 link
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval Weihang Zhang et.al. 2501.10638 null
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 null

Keypoint Detection

Publish Date Title Authors PDF Code
2025-03-05 Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing Ryan Banks et.al. 2503.13477 null
2025-03-16 Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting Jiadong Zhou et.al. 2503.12541 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 null
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 link
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null

2025-3

Pose Estimation

Publish Date Title Authors PDF Code
2025-04-04 Robust Human Registration with Body Part Segmentation on Noisy Point Clouds Kai Lascheit et.al. 2504.03602 null
2025-04-04 Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video Jiaxin Guo et.al. 2504.03198 null
2025-04-03 Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks Hyun-Ho Choi et.al. 2504.03052 null
2025-04-03 BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Van Nguyen Nguyen et.al. 2504.02812 null
2025-04-03 PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation Lihua Liu et.al. 2504.02617 null
2025-04-02 Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation Mingrui Ye et.al. 2504.01764 link
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-04-01 AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline Lei Wang et.al. 2504.00394 null
2025-03-31 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Xingyu Chen et.al. 2503.24391 link
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 PhysPose: Refining 6D Object Poses with Physical Constraints Martin Malenický et.al. 2503.23587 null
2025-03-30 Improving Indoor Localization Accuracy by Using an Efficient Implicit Neural Map Representation Haofei Kuang et.al. 2503.23480 link
2025-03-30 SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation Pranjal Paul et.al. 2503.23465 null
2025-03-30 HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation Hongwei Zheng et.al. 2503.23331 null
2025-03-29 Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization Jintao Cheng et.al. 2503.23199 null
2025-03-28 ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection Nandakishor M et.al. 2503.22363 null
2025-03-28 GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion Li-Heng Chen et.al. 2503.22349 null
2025-03-27 NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications Kibon Ku et.al. 2503.21958 null
2025-03-27 Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video David Yifan Yao et.al. 2503.21761 link
2025-03-27 Reconstructing Humans with a Biomechanically Accurate Skeleton Yan Xia et.al. 2503.21751 null
2025-03-27 OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation Mallika Garg et.al. 2503.21723 null
2025-03-27 RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond Daniel Bermuth et.al. 2503.21692 null
2025-03-27 STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM Yongxu Wang et.al. 2503.21425 null
2025-03-27 Lidar-only Odometry based on Multiple Scan-to-Scan Alignments over a Moving Window Aaron Kurda et.al. 2503.21293 null
2025-03-27 Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation Junjie Chen et.al. 2503.21140 link
2025-03-26 DINeMo: Learning Neural Mesh Models with no 3D Annotations Weijie Guo et.al. 2503.20220 null
2025-03-25 Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors Yuke Lou et.al. 2503.20118 null
2025-03-25 Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders Paul Koch et.al. 2503.19947 null
2025-03-25 Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing Lukas Mack et.al. 2503.19893 null
2025-03-25 Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving Yusen Xie et.al. 2503.19713 null
2025-03-25 DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios Xiangting Meng et.al. 2503.19625 null
2025-03-25 Pose-Based Fall Detection System: Efficient Monitoring on Standard CPUs Vinayak Mali et.al. 2503.19501 null
2025-03-25 Multi-modal 3D Pose and Shape Estimation with Computed Tomography Mingxiao Tu et.al. 2503.19405 null
2025-03-25 From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Zhiwei Huang et.al. 2503.19358 null
2025-03-25 Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation Zhuoran Zhao et.al. 2503.19307 null
2025-03-25 Any6D: Model-free 6D Pose Estimation of Novel Objects Taeyeop Lee et.al. 2503.18673 null
2025-03-24 Structure-Aware Correspondence Learning for Relative Pose Estimation Yihan Chen et.al. 2503.18671 null
2025-03-24 TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos Kazuhiro Yamada et.al. 2503.18282 null
2025-03-23 Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning Xiang Fang et.al. 2503.17938 null
2025-03-22 Co-op: Correspondence-based Novel Object Pose Estimation Sungphill Moon et.al. 2503.17731 null
2025-03-21 Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image Jerred Chen et.al. 2503.17358 null
2025-03-21 Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors Wonbong Jang et.al. 2503.17316 null
2025-03-20 ContactFusion: Stochastic Poisson Surface Maps from Visual and Contact Sensing Aditya Kamireddypalli et.al. 2503.16592 null
2025-03-19 A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions Saddam Hussain Khan et.al. 2503.16546 null
2025-03-20 Probabilistic Prompt Distribution Learning for Animal Pose Estimation Jiyong Rao et.al. 2503.16120 link
2025-03-20 Automating 3D Dataset Generation with Neural Radiance Fields P. Schulz et.al. 2503.15997 link
2025-03-20 Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras Beilei Cui et.al. 2503.15917 null
2025-03-19 EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds Yuanchao Yue et.al. 2503.15284 null
2025-03-20 GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation Zinqin Huang et.al. 2503.15110 link
2025-03-20 Distilling 3D distinctive local descriptors for 6D pose estimation Amir Hamza et.al. 2503.15106 null
2025-03-18 Validation of Human Pose Estimation and Human Mesh Recovery for Extracting Clinically Relevant Motion Data from Videos Kai Armstrong et.al. 2503.14760 null
2025-03-18 SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model Yucheng Mao et.al. 2503.14463 null
2025-03-18 SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation Weihong Chen et.al. 2503.14097 null
2025-03-18 Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach Tianshu Wu et.al. 2503.14051 null
2025-03-19 Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation Huan Ren et.al. 2503.13926 null
2025-03-20 STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans Shashikant Verma et.al. 2503.13344 null
2025-03-17 UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation Yinqiao Wang et.al. 2503.13303 null
2025-03-17 Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation Nassim Ali Ousalah et.al. 2503.13053 null
2025-03-17 PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data ChangHee Yang et.al. 2503.13025 null
2025-03-15 Gun Detection Using Combined Human Pose and Weapon Appearance Amulya Reddy Maligireddy et.al. 2503.12215 null
2025-03-15 TACO: Taming Diffusion for in-the-wild Video Amodal Completion Ruijie Lu et.al. 2503.12049 null
2025-03-14 Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation Hiroyasu Akada et.al. 2503.11652 null
2025-03-14 Online Test-time Adaptation for 3D Human Pose Estimation: A Practical Perspective with Estimated 2D Poses Qiuxia Lin et.al. 2503.11194 null
2025-03-14 Fast and Robust Localization for Humanoid Soccer Robot via Iterative Landmark Matching Ruochen Hou et.al. 2503.11020 null
2025-03-13 Clothes-Changing Person Re-identification Based On Skeleton Dynamics Asaf Joseph et.al. 2503.10759 null
2025-03-13 Consistent multi-animal pose estimation in cattle using dynamic Kalman filter based tracking Maarten Perneel et.al. 2503.10450 null
2025-03-13 6D Object Pose Tracking in Internet Videos for Robotic Manipulation Georgy Ponimatkin et.al. 2503.10307 null
2025-03-13 VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames Zhiqi Li et.al. 2503.10286 null
2025-03-12 Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting Weiquan Wang et.al. 2503.09640 null
2025-03-12 GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals Shuokang Huang et.al. 2503.09537 null
2025-03-12 MonoSLAM: Robust Monocular SLAM with Global Structure Optimization Bingzheng Jiang et.al. 2503.09296 null
2025-03-12 Better Together: Unified Motion Capture and 3D Avatar Reconstruction Arthur Moreau et.al. 2503.09293 null
2025-03-11 Acoustic Neural 3D Reconstruction Under Pose Drift Tianxiang Lin et.al. 2503.08930 null
2025-03-11 Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments Rajitha de Silva et.al. 2503.08843 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-11 SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving Akshat Ghiya et.al. 2503.08016 null
2025-03-10 Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration Yehyun Suh et.al. 2503.07767 null
2025-03-10 HumanMM: Global Human Motion Recovery from Multi-shot Videos Yuhong Zhang et.al. 2503.07597 null
2025-03-11 AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements Calvin Yeung et.al. 2503.07499 null
2025-03-10 Multi-Robot System for Cooperative Exploration in Unknown Environments: A Survey Chuqi Wang et.al. 2503.07278 null
2025-03-12 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-10 Multi-Modal 3D Mesh Reconstruction from Images and Text Melvin Reka et.al. 2503.07190 null
2025-03-11 PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM Alan Dao et.al. 2503.07111 null
2025-03-09 AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation Yang Zou et.al. 2503.06660 null
2025-03-08 NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features Hongjia Zhai et.al. 2503.06117 null
2025-03-08 Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision David C. Jeong et.al. 2503.06089 null
2025-03-08 ReJSHand: Efficient Real-Time Hand Pose Estimation and Mesh Reconstruction Using Refined Joint and Skeleton Features Shan An et.al. 2503.05995 null
2025-03-07 Differentiable Rendering-based Pose Estimation for Surgical Robotic Instruments Zekai Liang et.al. 2503.05953 null
2025-03-07 Novel Object 6D Pose Estimation with a Single Reference View Jian Liu et.al. 2503.05578 null
2025-03-07 Multi-Grained Feature Pruning for Video-Based Human Pose Estimation Zhigang Wang et.al. 2503.05365 null
2025-03-07 Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects Justin Yu et.al. 2503.05189 null
2025-03-07 SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting Linqi Yang et.al. 2503.05174 null
2025-03-07 GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting Zheng Zhou et.al. 2503.05161 null
2025-03-06 MarsLGPR: Mars Rover Localization with Ground Penetrating Radar Anja Sheppard et.al. 2503.04944 null
2025-03-06 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et.al. 2503.04500 null
2025-03-05 Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames Jun Yang et.al. 2503.03726 null
2025-03-05 Machine Learning in Biomechanics: Key Applications and Limitations in Walking, Running, and Sports Movements Carlo Dindorf et.al. 2503.03717 null
2025-03-05 Improving 6D Object Pose Estimation of metallic Household and Industry Objects Thomas Pöllabauer et.al. 2503.03655 null
2025-03-05 Tiny Lidars for Manipulator Self-Awareness: Sensor Characterization and Initial Localization Experiments Giammarco Caroleo et.al. 2503.03449 null
2025-03-05 Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments Jie Deng et.al. 2503.03373 null
2025-03-05 Supervised Visual Docking Network for Unmanned Surface Vehicles Using Auto-labeling in Real-world Water Environments Yijie Chu et.al. 2503.03282 null
2025-03-05 SCORE: Saturated Consensus Relocalization in Semantic Line Maps Haodong Jiang et.al. 2503.03254 null
2025-03-04 Monocular Person Localization under Camera Ego-motion Yu Zhan et.al. 2503.02916 null
2025-03-04 PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers Wooju Lee et.al. 2503.02388 null
2025-03-04 DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting Haoyuan Li et.al. 2503.02223 null
2025-03-04 Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints Yan Miao et.al. 2503.02198 null
2025-03-03 Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM Marco Giberna et.al. 2503.02050 null
2025-03-03 Category-level Meta-learned NeRF Priors for Efficient Object Mapping Saad Ejaz et.al. 2503.01582 null
2025-03-03 RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation Shu Pan et.al. 2503.01434 null
2025-03-03 ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization Anas Abdelkarim et.al. 2503.01311 null
2025-03-03 Convex Hull-based Algebraic Constraint for Visual Quadric SLAM Xiaolong Yu et.al. 2503.01254 link
2025-03-04 Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction Haolin Wang et.al. 2503.00397 null
2025-03-01 BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds Yuto Shibata et.al. 2503.00389 null

Visual Localization

Publish Date Title Authors PDF Code
2025-04-09 Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Ruotian Peng et.al. 2504.06666 null
2025-04-08 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 null
2025-04-06 NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval Peng Gao et.al. 2504.04339 null
2025-04-04 REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval Shabnam Choudhury et.al. 2504.03169 null
2025-04-06 Re-thinking Temporal Search for Long-Form Video Understanding Jinhui Ye et.al. 2504.02259 null
2025-04-02 Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval Yuji Nozawa et.al. 2504.01348 null
2025-04-01 IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Bangwei Liu et.al. 2504.00954 null
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Yingrui Ji et.al. 2503.24182 null
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 Multiview Image-Based Localization Cameron Fiore et.al. 2503.23577 null
2025-03-27 LOCORE: Image Re-ranking with Long-Context Sequence Modeling Zilin Xiao et.al. 2503.21772 link
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. 2503.21338 link
2025-03-27 FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval Zixu Li et.al. 2503.21309 link
2025-03-27 Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing Shuai Li et.al. 2503.21236 null
2025-03-25 CoLLM: A Large Language Model for Composed Image Retrieval Chuong Huynh et.al. 2503.19910 link
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Zhiwei Huang et.al. 2503.19358 null
2025-03-25 Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval Haoqiang Lin et.al. 2503.19296 link
2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null
2025-03-23 Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning Xiang Fang et.al. 2503.17938 null
2025-03-23 What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images Dongheng Lin et.al. 2503.17899 null
2025-03-22 good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval Pranavi Kolouju et.al. 2503.17871 null
2025-03-21 Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2503.17109 null
2025-03-20 PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Qiang Zou et.al. 2503.16064 link
2025-03-20 Automating 3D Dataset Generation with Neural Radiance Fields P. Schulz et.al. 2503.15997 link
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Scale Efficient Training for Large Datasets Qing Zhou et.al. 2503.13385 null
2025-03-17 Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features Václav Truhlařík et.al. 2503.13090 null
2025-03-17 All You Need to Know About Training Image Retrieval Models Gabriele Berton et.al. 2503.13045 link
2025-03-12 Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark Yibin Ye et.al. 2503.10692 link
2025-03-13 ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Pengfei Luo et.al. 2503.10166 link
2025-03-12 Revisiting Medical Image Retrieval via Knowledge Consolidation Yang Nan et.al. 2503.09370 null
2025-03-11 CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition Dongyue Li et.al. 2503.08170 null
2025-03-10 Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Michael Green et.al. 2503.07038 null
2025-03-10 Zero-Shot Hashing Based on Reconstruction With Part Alignment Yan Jiang et.al. 2503.07037 null
2025-03-10 Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction Somayeh Hussaini et.al. 2503.06840 null
2025-03-09 RoboDesign1M: A Large-scale Dataset for Robot Design Understanding Tri Le et.al. 2503.06796 null
2025-03-09 StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2503.06601 link
2025-03-09 TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification Huaqi Tao et.al. 2503.06501 null
2025-03-08 NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features Hongjia Zhai et.al. 2503.06117 null
2025-03-07 Data-Efficient Generalization for Zero-shot Composed Image Retrieval Zining Chen et.al. 2503.05204 null
2025-03-06 RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining Tengfei Zhang et.al. 2503.04653 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior Haitao Wu et.al. 2503.04207 null
2025-03-06 Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments Beverley Gorry et.al. 2503.04096 null
2025-03-04 TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition Oliver Grainge et.al. 2503.02511 null
2025-03-04 Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models Kenta Tsukahara et.al. 2503.02256 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334 link
2025-03-03 AirRoom: Objects Matter in Room Reidentification Runmao Yao et.al. 2503.01130 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862 null
2025-03-01 Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning Songlin Dong et.al. 2503.00515 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167 null
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link

Keypoint Detection

Publish Date Title Authors PDF Code
2025-04-29 Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis Muhammad Turab et.al. 2504.21154 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-26 VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation Niaz Ahmad et.al. 2504.19032 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-15 UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques Pedro Diaz-Garcia et.al. 2504.11063 null
2025-04-15 Acquisition of high-quality images for camera calibration in robotics applications via speech prompts Timm Linder et.al. 2504.11031 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-29 Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction Shayan Sepahvand et.al. 2503.23171 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-05 Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing Ryan Banks et.al. 2503.13477 null
2025-03-16 Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting Jiadong Zhou et.al. 2503.12541 null
2025-04-12 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 null
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 link
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null

2025-4

Pose Estimation

Publish Date Title Authors PDF Code
2025-05-02 T-Graph: Enhancing Sparse-view Camera Pose Estimation by Pairwise Translation Graph Qingyu Xian et.al. 2505.01207 null
2025-05-02 3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer Kamel Aouaidjia et.al. 2505.01003 null
2025-05-01 Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? Viktor Kocur et.al. 2505.00866 null
2025-05-01 P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors Atsuya Watanabe et.al. 2505.00755 null
2025-05-01 Dietary Intake Estimation via Continuous 3D Reconstruction of Food Wallace Lee et.al. 2505.00606 null
2025-05-02 InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method Nguyen Hoang Khoi Tran et.al. 2505.00512 null
2025-04-30 Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling Stavrow A. Bahnam et.al. 2504.21695 null
2025-04-29 Dance Style Recognition Using Laban Movement Analysis Muhammad Turab et.al. 2504.21166 null
2025-04-29 Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining Weizhen He et.al. 2504.20800 null
2025-04-29 A Survey on Event-based Optical Marker Systems Nafiseh Jabbari Tofighi et.al. 2504.20736 null
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-05-01 PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking Xiatao Sun et.al. 2504.20359 null
2025-04-28 Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM Leon Davies et.al. 2504.19654 null
2025-04-28 GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM Leon Davies et.al. 2504.19653 null
2025-04-28 Category-Level and Open-Set Object Pose Estimation for Robotics Peter Hönig et.al. 2504.19572 null
2025-04-25 Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift Devansh R. Agrawal et.al. 2504.18713 null
2025-04-25 SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations Shuting Zhao et.al. 2504.18332 null
2025-04-25 S3MOT: Monocular 3D Object Tracking with Selective State Space Model Zhuohao Yan et.al. 2504.18068 null
2025-04-22 SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos Yuxin Yao et.al. 2504.17810 null
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788 null
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636 null
2025-04-24 Object Pose Estimation by Camera Arm Control Based on the Next Viewpoint Estimation Tomoki Mizuno et.al. 2504.17424 null
2025-04-24 Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization Guangyang Zeng et.al. 2504.17410 null
2025-04-23 WiFi based Human Fall and Activity Recognition using Transformer based Encoder Decoder and Graph Neural Networks Younggeol Cho et.al. 2504.16655 null
2025-04-23 Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection Md Fahimuzzman Sohan et.al. 2504.16404 null
2025-04-22 SignX: The Foundation Model for Sign Recognition Sen Fang et.al. 2504.16315 null
2025-04-22 GADS: A Super Lightweight Model for Head Pose Estimation Menan Velayuthan et.al. 2504.15751 null
2025-04-21 Field Report on Ground Penetrating Radar for Localization at the Mars Desert Research Station Anja Sheppard et.al. 2504.15455 null
2025-04-21 Vision6D: 3D-to-2D Interactive Visualization and Annotation Tool for 6D Pose Estimation Yike Zhang et.al. 2504.15329 null
2025-04-21 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Chun-Hsiao Yeh et.al. 2504.15280 link
2025-04-21 Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation Xiao Zhang et.al. 2504.15134 null
2025-04-20 Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction Weirong Chen et.al. 2504.14516 null
2025-04-20 SG-Reg: Generalizable and Efficient Scene Graph Registration Chuhao Liu et.al. 2504.14440 link
2025-04-18 Imitation Learning with Precisely Labeled Human Demonstrations Yilong Song et.al. 2504.13803 null
2025-04-18 Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction Wenyu Li et.al. 2504.13419 null
2025-04-17 ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation Hongyu Li et.al. 2504.13179 null
2025-04-18 ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos Zetong Zhang et.al. 2504.13167 null
2025-04-17 Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms Jingjing Liu et.al. 2504.12699 null
2025-04-16 MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices Vasco Xu et.al. 2504.12492 link
2025-04-16 Diffusion Based Robust LiDAR Place Recognition Benjamin Krummenacher et.al. 2504.12412 null
2025-04-16 Regist3R: Incremental Registration with Stereo Foundation Model Sidun Liu et.al. 2504.12356 null
2025-04-16 CoMotion: Concurrent Multi-person 3D Motion Alejandro Newell et.al. 2504.12186 link
2025-04-16 No Fuss, Just Function – A Proposal for Non-Intrusive Full Body Tracking in XR for Meaningful Spatial Interactions Elisabeth Mayer et.al. 2504.11987 null
2025-04-16 An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World Xingwu Ji et.al. 2504.11698 link
2025-04-17 CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image Jingshun Huang et.al. 2504.11230 null
2025-04-15 DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention Haohan Chen et.al. 2504.11160 null
2025-04-14 MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model Jian Liu et.al. 2504.10433 null
2025-04-14 Benchmarking 3D Human Pose Estimation Models Under Occlusions Filipa Lino et.al. 2504.10350 null
2025-04-15 Differentially Private 2D Human Pose Estimation Kaushik Bhargav Sivangi et.al. 2504.10190 null
2025-04-14 TT3D: Table Tennis 3D Reconstruction Thomas Gossard et.al. 2504.10035 null
2025-04-14 Efficient 2D to Full 3D Human Pose Uplifting including Joint Rotations Katja Ludwig et.al. 2504.09953 null
2025-04-14 NeRF-Based Transparent Object Grasping Enhanced by Shape Priors Yi Han et.al. 2504.09868 null
2025-04-13 EasyREG: Easy Depth-Based Markerless Registration and Tracking using Augmented Reality Device for Surgical Guidance Yue Yang et.al. 2504.09498 null
2025-04-12 SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow Qingyuan Wang et.al. 2504.09160 null
2025-04-12 A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. 2504.09129 null
2025-04-12 BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting Jeongwan On et.al. 2504.09097 null
2025-04-11 The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation Masashi Hatano et.al. 2504.08654 null
2025-04-11 MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction Ian Noronha et.al. 2504.08646 null
2025-04-11 Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review Claudio Cimarelli et.al. 2504.08588 null
2025-04-11 Multi-person Physics-based Pose Estimation for Combat Sports Hossein Feiz et.al. 2504.08175 null
2025-04-10 Towards Unconstrained 2D Pose Estimation of the Human Spine Muhammad Saif Ullah Khan et.al. 2504.08110 null
2025-04-10 BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation Yuanhong Yu et.al. 2504.07955 null
2025-04-09 DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates Akash Jadhav et.al. 2504.07335 null
2025-04-09 Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation Yu Qi et.al. 2504.06961 null
2025-04-09 GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes Seunghyeok Back et.al. 2504.06866 link
2025-04-09 Setup-Invariant Augmented Reality for Teaching by Demonstration with Surgical Robots Alexandre Banks et.al. 2504.06677 link
2025-04-09 HGMamba: Enhancing 3D Human Pose Estimation with a HyperGCN-Mamba Network Hu Cui et.al. 2504.06638 null
2025-04-08 Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation Sarosij Bose et.al. 2504.05789 null
2025-04-08 SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes Minghao Ning et.al. 2504.05727 link
2025-04-08 POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction Songyan Zhang et.al. 2504.05692 link
2025-04-10 Learning Affine Correspondences by Integrating Geometric Constraints Pengju Sun et.al. 2504.04834 link
2025-04-10 A Convex and Global Solution for the P $n$ P Problem in 2D Forward-Looking Sonar Jiayi Su et.al. 2504.04445 null
2025-04-05 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS Zhisheng Huang et.al. 2504.04294 null
2025-04-02 A Geometric Approach For Pose and Velocity Estimation Using IMU and Inertial/Body-Frame Measurements Sifeddine Benahmed et.al. 2504.03764 null
2025-04-04 Robust Human Registration with Body Part Segmentation on Noisy Point Clouds Kai Lascheit et.al. 2504.03602 null
2025-04-04 Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video Jiaxin Guo et.al. 2504.03198 null
2025-04-03 Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks Hyun-Ho Choi et.al. 2504.03052 null
2025-04-03 BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Van Nguyen Nguyen et.al. 2504.02812 null
2025-04-03 PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation Lihua Liu et.al. 2504.02617 null
2025-04-02 Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation Mingrui Ye et.al. 2504.01764 link
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-04-01 AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline Lei Wang et.al. 2504.00394 null
2025-03-31 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Xingyu Chen et.al. 2503.24391 link
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 PhysPose: Refining 6D Object Poses with Physical Constraints Martin Malenický et.al. 2503.23587 null
2025-03-30 Improving Indoor Localization Accuracy by Using an Efficient Implicit Neural Map Representation Haofei Kuang et.al. 2503.23480 link
2025-03-30 SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation Pranjal Paul et.al. 2503.23465 null
2025-03-30 HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation Hongwei Zheng et.al. 2503.23331 null
2025-03-29 Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization Jintao Cheng et.al. 2503.23199 null

Visual Localization

Publish Date Title Authors PDF Code
2025-05-16 Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing Mathis Jürgen Adler et.al. 2505.11121 null
2025-05-04 OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery Chongsheng Zhang et.al. 2505.03836 link
2025-05-06 Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions Lukas Schichler et.al. 2505.03565 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-06 Seeing the Abstract: Translating the Abstract Language for Vision Language Models Davide Talon et.al. 2505.03242 link
2025-05-13 SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2505.01956 null
2025-05-02 NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization Xun Li et.al. 2505.01113 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-04-25 From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval Yabing Wang et.al. 2504.17990 null
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636 null
2025-04-23 Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval Xin Jiang et.al. 2504.16691 null
2025-04-22 Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs Merve Cerit et.al. 2504.16323 link
2025-04-19 A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling Kyle Buettner et.al. 2504.14359 null
2025-04-17 SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Haoxuan Li et.al. 2504.13172 null
2025-04-16 Generalized Visual Relation Detection with Diffusion Models Kaifeng Gao et.al. 2504.12100 null
2025-04-15 Visual Re-Ranking with Non-Visual Side Information Gustav Hanning et.al. 2504.11134 link
2025-04-15 TMCIR: Token Merge Benefits Composed Image Retrieval Chaoyang Wang et.al. 2504.10995 null
2025-04-14 Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition Changwei Wang et.al. 2504.09881 null
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields Asterios Reppas et.al. 2504.08901 null
2025-04-11 Hypergraph Vision Transformers: Images are More than Nodes, More than Edges Joshua Fixelle et.al. 2504.08710 null
2025-04-11 FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Cheng-Yu Hsieh et.al. 2504.08368 null
2025-04-10 Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval Zehong Ma et.al. 2504.07718 null
2025-04-09 A Pointcloud Registration Framework for Relocalization in Subterranean Environments David Akhihiero et.al. 2504.07231 null
2025-04-09 Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Ruotian Peng et.al. 2504.06666 null
2025-04-08 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 null
2025-04-06 NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval Peng Gao et.al. 2504.04339 null
2025-04-04 REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval Shabnam Choudhury et.al. 2504.03169 null
2025-04-06 Re-thinking Temporal Search for Long-Form Video Understanding Jinhui Ye et.al. 2504.02259 link
2025-04-02 Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval Yuji Nozawa et.al. 2504.01348 null
2025-04-01 IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Bangwei Liu et.al. 2504.00954 null
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Yingrui Ji et.al. 2503.24182 null
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 Multiview Image-Based Localization Cameron Fiore et.al. 2503.23577 null
2025-03-27 LOCORE: Image Re-ranking with Long-Context Sequence Modeling Zilin Xiao et.al. 2503.21772 link
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. 2503.21338 link
2025-03-27 FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval Zixu Li et.al. 2503.21309 link

Keypoint Detection

Publish Date Title Authors PDF Code
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-05-12 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Enabling Privacy-Aware AI-Based Ergonomic Analysis Sander De Coninck et.al. 2505.07306 null
2025-05-09 My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing Jingrui He et.al. 2505.06436 null
2025-05-05 Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration David Rivas-Villar et.al. 2505.02787 null
2025-05-05 Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance David Rivas-Villar et.al. 2505.02779 null
2025-05-04 Focus What Matters: Matchability-Based Reweighting for Local Feature Matching Dongyue Li et.al. 2505.02161 null
2025-05-04 Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery Sier Ha et.al. 2505.02049 null
2025-04-29 Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis Muhammad Turab et.al. 2504.21154 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-26 VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation Niaz Ahmad et.al. 2504.19032 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-15 UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques Pedro Diaz-Garcia et.al. 2504.11063 null
2025-04-15 Acquisition of high-quality images for camera calibration in robotics applications via speech prompts Timm Linder et.al. 2504.11031 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-29 Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction Shayan Sepahvand et.al. 2503.23171 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-16 Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting Jiadong Zhou et.al. 2503.12541 null
2025-04-12 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 null
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 link

2025-5

Pose Estimation

Publish Date Title Authors PDF Code
2025-06-04 Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Tianyu Huang et.al. 2506.04225 null
2025-06-04 Accelerating SfM-based Pose Estimation with Dominating Set Joji Joseph et.al. 2506.03667 null
2025-06-03 Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation Mingjie Wei et.al. 2506.02853 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-02 Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction Samuel Li et.al. 2506.02265 null
2025-06-02 E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models Wenyan Cong et.al. 2506.01933 null
2025-06-02 SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation Sang-Eun Lee et.al. 2506.01691 null
2025-06-01 TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction Yiyao Huang et.al. 2506.00953 null
2025-05-31 XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity Junwen Huang et.al. 2506.00599 null
2025-05-30 Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions Muhammad Suhail Saleem et.al. 2506.00285 null
2025-05-30 6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly Chengzhi Wu et.al. 2505.24669 null
2025-05-30 Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data Marios Glytsos et.al. 2505.24636 null
2025-05-30 PCIE_Pose Solution for EgoExo4D Pose and Proficiency Estimation Challenge Feng Chen et.al. 2505.24411 null
2025-05-29 Pose-free 3D Gaussian splatting via shape-ray estimation Youngju Na et.al. 2505.22978 null
2025-05-28 TwinTrack: Bridging Vision and Contact Physics for Real-Time Tracking of Unknown Dynamic Objects Wen Yang et.al. 2505.22882 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 MultiFormer: A Multi-Person Pose Estimation System Based on CSI and Attention Mechanism Yanyi Qu et.al. 2505.22555 null
2025-05-28 Event-based Egocentric Human Pose Estimation in Dynamic Environment Wataru Ikeda et.al. 2505.22007 null
2025-05-27 Spectral Compression Transformer with Line Pose Graph for Monocular 3D Human Pose Estimation Zenghao Zheng et.al. 2505.21309 null
2025-05-29 ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction Adeela Islam et.al. 2505.21117 null
2025-05-27 HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving Bingxiang Kang et.al. 2505.20906 null
2025-05-27 Mamba-Driven Topology Fusion for Monocular 3-D Human Pose Estimation Zenghao Zheng et.al. 2505.20611 null
2025-05-28 HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval Matthew Hong et.al. 2505.20455 null
2025-05-25 Learning the Contact Manifold for Accurate Pose Estimation During Peg-in-Hole Insertion of Complex Geometries Abhay Negi et.al. 2505.19215 null
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 null
2025-05-24 An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU Xuan Xiao et.al. 2505.18490 null
2025-05-23 Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance Jack Goffinet et.al. 2505.18342 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 null
2025-05-23 Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery Ming Hu et.al. 2505.17677 null
2025-05-23 PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation Uyoung Jeong et.al. 2505.17475 link
2025-05-22 Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds Valentin Schmuker et.al. 2505.16633 null
2025-05-22 GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation Ming Yang et.al. 2505.16144 null
2025-05-21 Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation Yihang Li et.al. 2505.15098 null
2025-05-20 UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction Nisarga Nilavadi et.al. 2505.14866 null
2025-05-19 Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos Ruoyu Wang et.al. 2505.13440 link
2025-05-19 KinTwin: Imitation Learning with Torque and Muscle Driven Biomechanical Models Enables Precise Replication of Able-Bodied and Impaired Movement from Markerless Motion Capture R. James Cotton et.al. 2505.13436 null
2025-05-19 The Way Up: A Dataset for Hold Usage Detection in Sport Climbing Anna Maschek et.al. 2505.12854 null
2025-05-17 Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation Niaz Ahmad et.al. 2505.12130 null
2025-05-17 Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation Zhiying Li et.al. 2505.12009 null
2025-05-17 ElderFallGuard: Real-Time IoT and Computer Vision-Based Fall Detection System for Elderly Safety Tasrifur Riahi et.al. 2505.11845 null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 null
2025-05-16 MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection Shrutarv Awasthi et.al. 2505.11282 null
2025-05-16 PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation Saad Manzur et.al. 2505.10888 link
2025-05-16 RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects Jaeguk Kim et.al. 2505.10841 null
2025-05-14 UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units Huakun Liu et.al. 2505.09393 link
2025-05-14 APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression Srinivas Ravuri et.al. 2505.09356 link
2025-05-13 Real-time Capable Learning-based Visual Tool Pose Correction via Differentiable Simulation Shuyuan Yang et.al. 2505.08875 null
2025-05-12 Sleep Position Classification using Transfer Learning for Bed-based Pressure Sensors Olivier Papillon et.al. 2505.08111 null
2025-05-07 Pose Estimation for Intra-cardiac Echocardiography Catheter via AI-Based Anatomical Understanding Jaeyoung Huh et.al. 2505.07851 null
2025-05-12 Enabling Privacy-Aware AI-Based Ergonomic Analysis Sander De Coninck et.al. 2505.07306 null
2025-05-13 Human Motion Prediction via Test-domain-aware Adaptation with Easily-available Human Motions Estimated from Videos Katsuki Shimbo et.al. 2505.07301 null
2025-05-12 When Dance Video Archives Challenge Computer Vision Philippe Colantoni et.al. 2505.07249 null
2025-05-10 CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments Shehryar Khattak et.al. 2505.06483 null
2025-05-09 Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach Tim Schneider et.al. 2505.06182 null
2025-05-08 Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors Zunjie Zhu et.al. 2505.05336 null
2025-05-08 Improving Global Motion Estimation in Sparse IMU-based Motion Capture with Physics Xinyu Yi et.al. 2505.05010 null
2025-05-08 An Efficient Method for Accurate Pose Estimation and Error Correction of Cuboidal Objects Utsav Rai et.al. 2505.04962 null
2025-05-07 Comparison of Visual Trackers for Biomechanical Analysis of Running Luis F. Gomez et.al. 2505.04713 null
2025-05-07 Do We Still Need to Work on Odometry for Autonomous Driving? Cedric Le Gentil et.al. 2505.04438 null
2025-05-07 HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation Yajie Fu et.al. 2505.04276 link
2025-05-07 One2Any: One-Reference 6D Pose Estimation for Any Object Mengya Liu et.al. 2505.04109 null
2025-05-06 Polar Coordinate-Based 2D Pose Prior with Neural Distance Field Qi Gan et.al. 2505.03445 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-06 Artificial Behavior Intelligence: Technology, Challenges, and Future Directions Kanghyun Jo et.al. 2505.03315 null
2025-05-05 Dance of Fireworks: An Interactive Broadcast Gymnastics Training System Based on Pose Estimation Haotian Chen et.al. 2505.02690 null
2025-05-05 Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions Asma Brazi et.al. 2505.02501 null
2025-05-05 Finger Pose Estimation for Under-screen Fingerprint Sensor Xiongjun Guan et.al. 2505.02481 link
2025-05-05 6D Pose Estimation on Spoons and Hands Kevin Tan et.al. 2505.02335 null
2025-05-04 Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation Shipeng Liu et.al. 2505.02287 null
2025-05-04 A Birotation Solution for Relative Pose Problems Hongbo Zhao et.al. 2505.02025 null
2025-05-03 Near-field 5D Pose Estimation using Reconfigurable Intelligent Surfaces Srikar Sharma Sadhu et.al. 2505.01829 null
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-03 PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth Bu Jin et.al. 2505.01729 null
2025-05-02 T-Graph: Enhancing Sparse-view Camera Pose Estimation by Pairwise Translation Graph Qingyu Xian et.al. 2505.01207 null
2025-05-02 3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer Kamel Aouaidjia et.al. 2505.01003 null
2025-05-01 Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? Viktor Kocur et.al. 2505.00866 null
2025-05-01 P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors Atsuya Watanabe et.al. 2505.00755 null
2025-05-01 Dietary Intake Estimation via Continuous 3D Reconstruction of Food Wallace Lee et.al. 2505.00606 null
2025-05-02 InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method Nguyen Hoang Khoi Tran et.al. 2505.00512 null
2025-04-30 Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling Stavrow A. Bahnam et.al. 2504.21695 null
2025-04-29 Dance Style Recognition Using Laban Movement Analysis Muhammad Turab et.al. 2504.21166 null
2025-04-29 Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining Weizhen He et.al. 2504.20800 null
2025-04-29 A Survey on Event-based Optical Marker Systems Nafiseh Jabbari Tofighi et.al. 2504.20736 null
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-05-01 PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking Xiatao Sun et.al. 2504.20359 null
2025-04-28 Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM Leon Davies et.al. 2504.19654 null
2025-04-28 GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM Leon Davies et.al. 2504.19653 null
2025-04-28 Category-Level and Open-Set Object Pose Estimation for Robotics Peter Hönig et.al. 2504.19572 null

Visual Localization

Publish Date Title Authors PDF Code
2025-06-10 Robust Visual Localization via Semantic-Guided Multi-Scale Transformer Zhongtao Tian et.al. 2506.08526 null
2025-06-08 Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Yikun Ji et.al. 2506.07045 null
2025-06-07 Zero Shot Composed Image Retrieval Santhosh Kakarla et.al. 2506.06602 null
2025-06-06 GenIR: Generative Visual Feedback for Mental Image Retrieval Diji Yang et.al. 2506.06220 null
2025-06-06 Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Sheng Chen et.al. 2506.06205 null
2025-06-05 HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition Suhan Woo et.al. 2506.04764 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-02 Entity Image and Mixed-Modal Image Retrieval Datasets Cristian-Ioan Blaga et.al. 2506.02291 null
2025-06-01 Quantization-based Bounds on the Wasserstein Metric Jonathan Bobrutsky et.al. 2506.00976 null
2025-05-30 SORCE: Small Object Retrieval in Complex Environments Chunxu Liu et.al. 2505.24441 link
2025-05-29 Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch Aneeshan Sain et.al. 2505.23763 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images Junhuan Liu et.al. 2505.22098 null
2025-05-28 Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang et.al. 2505.22089 null
2025-05-27 QuARI: Query Adaptive Retrieval Improvement Eric Xing et.al. 2505.21647 null
2025-05-27 ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval Eric Xing et.al. 2505.20764 null
2025-05-26 Visualized Text-to-Image Retrieval Di Wu et.al. 2505.20291 link
2025-05-26 Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval Rong-Cheng Tu et.al. 2505.19952 null
2025-05-26 Can Visual Encoder Learn to See Arrows? Naoyuki Terashita et.al. 2505.19944 null
2025-05-26 MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval Rong-Cheng Tu et.al. 2505.19707 null
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 null
2025-05-24 TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP Yuliang Cai et.al. 2505.18434 null
2025-05-23 ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models Minwoo Jung et.al. 2505.18364 null
2025-05-23 DART $^3$ : Leveraging Distance for Test Time Adaptation in Person Re-Identification Rajarshi Bhattacharya et.al. 2505.18337 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 null
2025-05-23 DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval Yuxin Yang et.al. 2505.17796 null
2025-05-22 TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition Oliver Grainge et.al. 2505.16447 null
2025-05-21 Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval Siting Li et.al. 2505.15877 null
2025-05-21 SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval Nikolaos Chaidos et.al. 2505.15867 link
2025-05-20 Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models Kiarash Naghavi Khanghah et.al. 2505.13828 null
2025-05-18 MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark Yiwei Ou et.al. 2505.12254 null
2025-05-16 Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization Aaron Wilhelm et.al. 2505.11620 null
2025-05-16 Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing Mathis Jürgen Adler et.al. 2505.11121 null
2025-05-04 OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery Chongsheng Zhang et.al. 2505.03836 link
2025-05-06 Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions Lukas Schichler et.al. 2505.03565 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-06 Seeing the Abstract: Translating the Abstract Language for Vision Language Models Davide Talon et.al. 2505.03242 link
2025-05-13 SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2505.01956 null
2025-05-02 NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization Xun Li et.al. 2505.01113 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-04-25 From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval Yabing Wang et.al. 2504.17990 null
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636 null
2025-04-23 Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval Xin Jiang et.al. 2504.16691 null
2025-04-22 Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs Merve Cerit et.al. 2504.16323 link
2025-04-19 A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling Kyle Buettner et.al. 2504.14359 null
2025-04-17 SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Haoxuan Li et.al. 2504.13172 null
2025-04-16 Generalized Visual Relation Detection with Diffusion Models Kaifeng Gao et.al. 2504.12100 null
2025-04-15 Visual Re-Ranking with Non-Visual Side Information Gustav Hanning et.al. 2504.11134 link
2025-04-15 TMCIR: Token Merge Benefits Composed Image Retrieval Chaoyang Wang et.al. 2504.10995 null

Keypoint Detection

Publish Date Title Authors PDF Code
2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model Maulana Bisyir Azhari et.al. 2507.13145 null
2025-07-15 KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model Jie Yang et.al. 2507.11102 null
2025-07-15 GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft Weizhao Ma et.al. 2507.11077 null
2025-07-14 FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching Ionuţ Grigore et.al. 2507.10770 null
2025-07-11 Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection Subhajit Maity et.al. 2507.07994 null
2025-07-09 Reading a Ruler in the Wild Yimu Pan et.al. 2507.07077 null
2025-07-09 MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning Yifan Yang et.al. 2507.06662 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-05-29 TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning Ron Shapira Weber et.al. 2505.23475 link
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 link
2025-05-18 SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving Muleilan Pei et.al. 2505.12246 null
2025-05-17 Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation Niaz Ahmad et.al. 2505.12130 null
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-06-19 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Enabling Privacy-Aware AI-Based Ergonomic Analysis Sander De Coninck et.al. 2505.07306 null
2025-05-09 My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing Jingrui He et.al. 2505.06436 null
2025-05-05 Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration David Rivas-Villar et.al. 2505.02787 null
2025-05-05 Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance David Rivas-Villar et.al. 2505.02779 null
2025-05-04 Focus What Matters: Matchability-Based Reweighting for Local Feature Matching Dongyue Li et.al. 2505.02161 null
2025-05-04 Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery Sier Ha et.al. 2505.02049 null
2025-04-29 Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis Muhammad Turab et.al. 2504.21154 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-26 VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation Niaz Ahmad et.al. 2504.19032 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-15 UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques Pedro Diaz-Garcia et.al. 2504.11063 null
2025-04-15 Acquisition of high-quality images for camera calibration in robotics applications via speech prompts Timm Linder et.al. 2504.11031 null

2025-6

Pose Estimation

Publish Date Title Authors PDF Code
2025-07-03 Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning Buzhen Huang et.al. 2507.02565 null
2025-07-03 IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep Learning Abiam Remache González et.al. 2507.02519 null
2025-07-03 3D Heart Reconstruction from Sparse Pose-agnostic 2D Echocardiographic Slices Zhurong Chen et.al. 2507.02411 null
2025-07-03 LMPNet for Weakly-supervised Keypoint Discovery Pei Guo et.al. 2507.02308 null
2025-07-02 What does really matter in image goal navigation? Gianluca Monaci et.al. 2507.01667 null
2025-07-01 2024 NASA SUITS Report: LLM-Driven Immersive Augmented Reality User Interface for Robotics and Space Exploration Kathy Zhuang et.al. 2507.01206 null
2025-07-01 Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation Hao Xing et.al. 2507.00752 null
2025-07-01 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Juelin Zhu et.al. 2507.00659 null
2025-06-30 Computer Vision for Objects used in Group Work: Challenges and Opportunities Changsoo Jung et.al. 2507.00224 null
2025-06-30 Validation of AI-Based 3D Human Pose Estimation in a Cyber-Physical Environment Lisa Marie Otto et.al. 2506.23739 null
2025-06-30 MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments Sai Krishna Ghanta et.al. 2506.23514 null
2025-06-29 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints Zhen Tan et.al. 2506.23207 null
2025-06-28 Deterministic Object Pose Confidence Region Estimation Jinghao Wang et.al. 2506.22720 null
2025-06-27 Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration Noora Sassali et.al. 2506.22116 null
2025-06-27 Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras Petr Hruby et.al. 2506.22069 null
2025-06-24 ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes Chenhao Zhang et.al. 2506.21629 null
2025-06-26 EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. 2506.21420 null
2025-06-26 CURL-SLAM: Continuous and Compact LiDAR Mapping Kaicheng Zhang et.al. 2506.21077 null
2025-06-27 DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation Wenzhou Lyu et.al. 2506.21034 null
2025-06-25 How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction? Stephanie Käs et.al. 2506.20795 null
2025-06-26 Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception Eric C. Joyce et.al. 2506.20045 null
2025-06-24 Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images Stephanie Käs et.al. 2506.19747 null
2025-06-23 RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base Kuanning Wang et.al. 2506.18856 null
2025-06-19 Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons Learned Olivier Gamache et.al. 2506.18844 null
2025-06-23 SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives Yizhou Chen et.al. 2506.18825 null
2025-06-20 RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking Teng Guo et.al. 2506.17119 link
2025-06-20 Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping Teng Guo et.al. 2506.17110 null
2025-06-20 LunarLoc: Segment-Based Global Localization on the Moon Annika Thomas et.al. 2506.16940 link
2025-06-19 ControlVLA: Few-shot Object-centric Adaptation for Pre-trained Vision-Language-Action Models Puhao Li et.al. 2506.16211 null
2025-06-19 STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution Yucheng Jin et.al. 2506.16061 null
2025-06-19 KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping Kowndinya Boyalakuntla et.al. 2506.15945 null
2025-06-19 Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization Yosub Shin et.al. 2506.15937 null
2025-06-18 Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples Viral Rasik Galaiya et.al. 2506.15865 null
2025-06-18 PRISM-Loc: a Lightweight Long-range LiDAR Localization in Urban Environments with Topological Maps Kirill Muravyev et.al. 2506.15849 null
2025-06-18 Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models Andela Ilic et.al. 2506.15290 null
2025-06-18 RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. 2506.15242 null
2025-06-17 PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation Ming Xu et.al. 2506.14596 null
2025-06-17 Non-Overlap-Aware Egocentric Pose Estimation for Collaborative Perception in Connected Autonomy Hong Huang et.al. 2506.14180 null
2025-06-17 TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping Jeewon Kim et.al. 2506.14178 null
2025-06-16 Diffusion-based Inverse Observation Model for Artificial Skin Ante Maric et.al. 2506.13986 null
2025-06-16 PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images Lingteng Qiu et.al. 2506.13766 null
2025-06-16 JENGA: Object selection and pose estimation for robotic grasping from a stack Sai Srinivas Jeevanandam et.al. 2506.13425 null
2025-06-16 Automatic Multi-View X-Ray/CT Registration Using Bone Substructure Contours Roman Flepp et.al. 2506.13292 null
2025-06-16 DETRPose: Real-time end-to-end transformer model for multi-person pose estimation Sebastian Janampa et.al. 2506.13027 link
2025-06-15 A large-scale, physically-based synthetic dataset for satellite pose estimation Szabolcs Velkei et.al. 2506.12782 null
2025-06-13 ViTaSCOPE: Visuo-tactile Implicit Representation for In-hand Pose and Extrinsic Contact Estimation Jayjun Lee et.al. 2506.12239 null
2025-06-10 Monocular 3D Hand Pose Estimation with Implicit Camera Alignment Christos Pantazopoulos et.al. 2506.11133 null
2025-06-12 Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders Hui Yang et.al. 2506.10816 null
2025-06-12 In-Hand Object Pose Estimation via Visual-Tactile Fusion Felix Nonnengießer et.al. 2506.10787 null
2025-06-11 Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers Jared Lawson et.al. 2506.09934 null
2025-06-11 EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks Athinoulla Konstantinou et.al. 2506.09895 link
2025-06-11 Accurate and efficient zero-shot 6D pose estimation with frozen foundation models Andrea Caraffa et.al. 2506.09784 null
2025-06-11 CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings Mattia Nardon et.al. 2506.09699 null
2025-06-10 Princeton365: A Diverse Dataset with Accurate Camera Pose Karhan Kayan et.al. 2506.09035 null
2025-06-10 ArrowPose: Segmentation, Detection, and 5 DoF Pose Estimation Network for Colorless Point Clouds Frederik Hagelskjaer et.al. 2506.08699 null
2025-06-09 UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References Ming-Feng Li et.al. 2506.07996 null
2025-06-09 Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation Yijie Deng et.al. 2506.07338 null
2025-06-10 From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models Pablo Acuaviva et.al. 2506.07280 null
2025-06-08 GoTrack: Generic 6DoF Object Pose Refinement and Tracking Van Nguyen Nguyen et.al. 2506.07155 null
2025-06-08 UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment Wentao Zhao et.al. 2506.07013 null
2025-06-07 Deep Inertial Pose: A deep learning approach for human pose estimation Sara M. Cerqueira et.al. 2506.06850 null
2025-06-06 Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. 2506.05965 null
2025-06-06 SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction Yuchao Zheng et.al. 2506.05935 null
2025-06-06 CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy Jiakai Zhang et.al. 2506.05864 null
2025-06-06 You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping Jingshun Huang et.al. 2506.05719 null
2025-06-05 On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images Andreas Meuleman et.al. 2506.05558 null
2025-06-05 Rectified Point Flow: Generic Point Cloud Pose Estimation Tao Sun et.al. 2506.05282 null
2025-06-05 Realizing Text-Driven Motion Generation on NAO Robot: A Reinforcement Learning-Optimized Control Pipeline Zihan Xu et.al. 2506.05117 link
2025-06-05 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx Lukas Picek et.al. 2506.04931 null
2025-06-05 SupeRANSAC: One RANSAC to Rule Them All Daniel Barath et.al. 2506.04803 null
2025-06-05 LGM-Pose: A Lightweight Global Modeling Network for Real-time Human Pose Estimation Biao Guo et.al. 2506.04561 null
2025-06-04 Photoreal Scene Reconstruction from an Egocentric Device Zhaoyang Lv et.al. 2506.04444 link
2025-06-04 cuVSLAM: CUDA accelerated visual odometry Alexander Korovko et.al. 2506.04359 null
2025-06-04 Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Tianyu Huang et.al. 2506.04225 null
2025-06-04 Accelerating SfM-based Pose Estimation with Dominating Set Joji Joseph et.al. 2506.03667 null
2025-06-03 Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation Mingjie Wei et.al. 2506.02853 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-02 Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction Samuel Li et.al. 2506.02265 null
2025-06-02 E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models Wenyan Cong et.al. 2506.01933 null
2025-06-02 SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation Sang-Eun Lee et.al. 2506.01691 null
2025-06-01 TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction Yiyao Huang et.al. 2506.00953 null
2025-05-31 XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity Junwen Huang et.al. 2506.00599 null
2025-05-30 Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions Muhammad Suhail Saleem et.al. 2506.00285 null
2025-05-30 6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly Chengzhi Wu et.al. 2505.24669 null
2025-05-30 Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data Marios Glytsos et.al. 2505.24636 null

Visual Localization

Publish Date Title Authors PDF Code
2025-07-08 Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval Haiwen Li et.al. 2507.05970 null
2025-07-08 OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval Zhiwei Chen et.al. 2507.05631 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 An analysis of vision-language models for fabric retrieval Francesco Giuliari et.al. 2507.04735 null
2025-07-08 What’s Making That Sound Right Now? Video-centric Audio-Visual Localization Hahyeon Choi et.al. 2507.04667 null
2025-07-06 U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration Xiaofan Li et.al. 2507.04503 null
2025-07-04 Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition Jiuhong Xiao et.al. 2507.03831 null
2025-07-01 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Juelin Zhu et.al. 2507.00659 null
2025-06-28 Utilizing a Novel Deep Learning Method for Scene Categorization in Remote Sensing Data Ghufran A. Omran et.al. 2506.22939 null
2025-06-28 Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval Li-Cheng Shen et.al. 2506.22864 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-26 OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography Caoshuo Li et.al. 2506.21101 null
2025-06-25 Visualizing intercalation effects in 2D materials using AFM based techniques Karmen Kapustić et.al. 2506.20467 null
2025-06-25 On the Burstiness of Faces in Set Jiong Wang et.al. 2506.20312 null
2025-06-24 jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval Michael Günther et.al. 2506.18902 null
2025-06-26 Referring Expression Instance Retrieval and A Strong End-to-End Baseline Xiangzhao Hao et.al. 2506.18246 null
2025-06-20 Class Agnostic Instance-level Descriptor for Visual Instance Search Qi-Ying Sun et.al. 2506.16745 null
2025-06-19 MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval Chao He et.al. 2506.16353 link
2025-06-19 Fine-grained Image Retrieval via Dual-Vision Adaptation Xin Jiang et.al. 2506.16273 null
2025-06-19 Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation Connor Malone et.al. 2506.15988 link
2025-06-18 Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles Qiyuan Wu et.al. 2506.15851 null
2025-06-18 ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections Ziling Huang et.al. 2506.15180 null
2025-06-17 HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search Qian Xu et.al. 2506.14707 null
2025-06-16 A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation Xiaoyang Wei et.al. 2506.13509 null
2025-06-19 Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval Kshitij Kavimandan et.al. 2506.13496 null
2025-06-16 EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition Bingxi Liu et.al. 2506.13133 null
2025-06-16 SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models Bingxi Liu et.al. 2506.13073 null
2025-06-14 Feature Complementation Architecture for Visual Place Recognition Weiwei Wang et.al. 2506.12401 null
2025-06-11 Towards a general-purpose foundation model for fMRI analysis Cheng Wang et.al. 2506.11167 null
2025-06-11 Improving Personalized Search with Regularized Low-Rank Parameter Updates Fiona Ryan et.al. 2506.10182 link
2025-06-10 Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment Tianyu Chen et.al. 2506.10030 null
2025-06-11 Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints Xiangkai Zhang et.al. 2506.09748 null
2025-06-10 Robust Visual Localization via Semantic-Guided Multi-Scale Transformer Zhongtao Tian et.al. 2506.08526 null
2025-06-08 Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Yikun Ji et.al. 2506.07045 null
2025-06-07 Zero Shot Composed Image Retrieval Santhosh Kakarla et.al. 2506.06602 null
2025-06-06 GenIR: Generative Visual Feedback for Mental Image Retrieval Diji Yang et.al. 2506.06220 null
2025-06-06 Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Sheng Chen et.al. 2506.06205 null
2025-06-05 HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition Suhan Woo et.al. 2506.04764 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-02 Entity Image and Mixed-Modal Image Retrieval Datasets Cristian-Ioan Blaga et.al. 2506.02291 null
2025-06-01 Quantization-based Bounds on the Wasserstein Metric Jonathan Bobrutsky et.al. 2506.00976 null
2025-05-30 SORCE: Small Object Retrieval in Complex Environments Chunxu Liu et.al. 2505.24441 link
2025-05-29 Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch Aneeshan Sain et.al. 2505.23763 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images Junhuan Liu et.al. 2505.22098 null
2025-05-28 Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang et.al. 2505.22089 null
2025-05-27 QuARI: Query Adaptive Retrieval Improvement Eric Xing et.al. 2505.21647 null
2025-05-27 ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval Eric Xing et.al. 2505.20764 null
2025-05-26 Visualized Text-to-Image Retrieval Di Wu et.al. 2505.20291 link

Keypoint Detection

Publish Date Title Authors PDF Code
2025-07-23 CartoonAlive: Towards Expressive Live2D Modeling from Single Portraits Chao He et.al. 2507.17327 null
2025-07-21 Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors Mohamed Adjel et.al. 2507.16850 null
2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model Maulana Bisyir Azhari et.al. 2507.13145 null
2025-07-15 KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model Jie Yang et.al. 2507.11102 null
2025-07-15 GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft Weizhao Ma et.al. 2507.11077 null
2025-07-14 FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching Ionuţ Grigore et.al. 2507.10770 null
2025-07-11 Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection Subhajit Maity et.al. 2507.07994 null
2025-07-09 Reading a Ruler in the Wild Yimu Pan et.al. 2507.07077 null
2025-07-09 MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning Yifan Yang et.al. 2507.06662 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-05-29 TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning Ron Shapira Weber et.al. 2505.23475 link
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 link
2025-05-18 SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving Muleilan Pei et.al. 2505.12246 null
2025-05-17 Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation Niaz Ahmad et.al. 2505.12130 null
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-06-19 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Enabling Privacy-Aware AI-Based Ergonomic Analysis Sander De Coninck et.al. 2505.07306 null
2025-05-09 My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing Jingrui He et.al. 2505.06436 null
2025-05-05 Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration David Rivas-Villar et.al. 2505.02787 null
2025-05-05 Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance David Rivas-Villar et.al. 2505.02779 null

2025-7

Pose Estimation

Publish Date Title Authors PDF Code
2025-07-23 RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction Yuqing Lan et.al. 2507.17594 null
2025-07-23 Physics-based Human Pose Estimation from a Single Moving RGB Camera Ayce Idil Aytekin et.al. 2507.17406 null
2025-07-21 Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors Mohamed Adjel et.al. 2507.16850 null
2025-07-22 Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers Batu Candan et.al. 2507.16214 null
2025-07-21 TONUS: Neuromorphic human pose estimation for artistic sound co-creation Jules Lecomte et.al. 2507.15734 null
2025-07-21 Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing Boni Hu et.al. 2507.15683 null
2025-07-21 Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images JunYing Huang et.al. 2507.15496 null
2025-07-20 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline Kaishva Chintan Shah et.al. 2507.14924 null
2025-07-20 An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks Xinyi Wu et.al. 2507.14798 null
2025-07-22 AI-Enhanced Precision in Sport Taekwondo: Increasing Fairness, Speed, and Trust in Competition (FST.ai) Keivan Shariatmadar et.al. 2507.14657 null
2025-07-18 C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 null
2025-07-21 PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations Yu Wei et.al. 2507.13891 null
2025-07-18 MaskHOI: Robust 3D Hand-Object Interaction Estimation via Masked Pre-training Yuechen Xie et.al. 2507.13673 null
2025-07-17 $π^3$ : Scalable Permutation-Equivariant Visual Geometry Learning Yifan Wang et.al. 2507.13347 null
2025-07-17 Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark Junsu Kim et.al. 2507.13314 null
2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model Maulana Bisyir Azhari et.al. 2507.13145 null
2025-07-17 AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability Tomohiro Suzuki et.al. 2507.12905 null
2025-07-17 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation Mengxi Liu et.al. 2507.12884 null
2025-07-19 SpatialTrackerV2: 3D Point Tracking Made Easy Yuxi Xiao et.al. 2507.12462 null
2025-07-16 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation Antonio Finocchiaro et.al. 2507.12292 null
2025-07-16 UniLGL: Learning Uniform Place Recognition for FOV-limited/Panoramic LiDAR Global Localization Hongming Shen et.al. 2507.12194 null
2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. 2507.12095 null
2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation Beining Xu et.al. 2507.12027 null
2025-07-16 SEPose: A Synthetic Event-based Human Pose Estimation Dataset for Pedestrian Monitoring Kaustav Chanda et.al. 2507.11910 null
2025-07-15 GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft Weizhao Ma et.al. 2507.11077 null
2025-07-15 Joint angle model based learning to refine kinematic human pose estimation Chang Peng et.al. 2507.11075 null
2025-07-14 Raci-Net: Ego-vehicle Odometry Estimation in Adverse Weather Conditions Mohammadhossein Talebi et.al. 2507.10376 null
2025-07-14 Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures Xinlong Ding et.al. 2507.10265 null
2025-07-14 ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users Xiangyu Yin et.al. 2507.10223 null
2025-07-13 VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose Estimation Xinyu Zhang et.al. 2507.09672 null
2025-07-13 EHPE: A Segmented Architecture for Enhanced Hand Pose Estimation Bolun Zheng et.al. 2507.09560 null
2025-07-13 Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding Yanchen Wang et.al. 2507.09513 null
2025-07-12 PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP Alignment Dewen Zhang et.al. 2507.09139 null
2025-07-10 RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. 2507.08136 null
2025-07-10 SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation Juyeop Han et.al. 2507.07467 null
2025-07-09 g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM Quanjie Qiu et.al. 2507.07142 null
2025-07-09 Smartphone Exergames with Real-Time Markerless Motion Capture: Challenges and Trade-offs Mathieu Phosanarack et.al. 2507.06669 null
2025-07-09 MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning Yifan Yang et.al. 2507.06662 null
2025-07-09 Mask6D: Masked Pose Priors For 6D Object Pose Estimation Yuechen Xie et.al. 2507.06486 null
2025-07-08 SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor Variations Yegyu Han et.al. 2507.05751 null
2025-07-08 Event-RGB Fusion for Spacecraft Pose Estimation Under Harsh Lighting Mohsi Jawaid et.al. 2507.05698 null
2025-07-07 W2W: A Simulated Exploration of IMU Placement Across the Human Body for Designing Smarter Wearable Lala Shakti Swarup Ray et.al. 2507.05532 null
2025-07-07 UDF-GMA: Uncertainty Disentanglement and Fusion for General Movement Assessment Zeqi Luo et.al. 2507.04814 null
2025-07-06 Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference Niels Leadholm et.al. 2507.04494 null
2025-07-09 Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM Xiaolei Lang et.al. 2507.04004 null
2025-07-05 Accurate Pose Estimation Using Contact Manifold Sampling for Safe Peg-in-Hole Insertion of Complex Geometries Abhay Negi et.al. 2507.03925 null
2025-07-02 Markerless Stride Length estimation in Athletic using Pose Estimation with monocular vision Patryk Skorupski et.al. 2507.03016 null
2025-07-03 Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning Buzhen Huang et.al. 2507.02565 null
2025-07-03 IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep Learning Abiam Remache González et.al. 2507.02519 null
2025-07-03 3D Heart Reconstruction from Sparse Pose-agnostic 2D Echocardiographic Slices Zhurong Chen et.al. 2507.02411 null
2025-07-03 LMPNet for Weakly-supervised Keypoint Discovery Pei Guo et.al. 2507.02308 null
2025-07-02 What does really matter in image goal navigation? Gianluca Monaci et.al. 2507.01667 null
2025-07-01 2024 NASA SUITS Report: LLM-Driven Immersive Augmented Reality User Interface for Robotics and Space Exploration Kathy Zhuang et.al. 2507.01206 null
2025-07-01 Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation Hao Xing et.al. 2507.00752 null
2025-07-01 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Juelin Zhu et.al. 2507.00659 null
2025-06-30 Computer Vision for Objects used in Group Work: Challenges and Opportunities Changsoo Jung et.al. 2507.00224 null
2025-06-30 Validation of AI-Based 3D Human Pose Estimation in a Cyber-Physical Environment Lisa Marie Otto et.al. 2506.23739 null
2025-06-30 MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments Sai Krishna Ghanta et.al. 2506.23514 null
2025-06-29 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints Zhen Tan et.al. 2506.23207 null
2025-06-28 Deterministic Object Pose Confidence Region Estimation Jinghao Wang et.al. 2506.22720 null
2025-06-27 Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration Noora Sassali et.al. 2506.22116 null

Visual Localization

Publish Date Title Authors PDF Code
2025-07-23 VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization Sania Waheed et.al. 2507.17455 null
2025-07-23 Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging Farnaz Khun Jush et.al. 2507.17412 null
2025-07-20 Visual Place Recognition for Large-Scale UAV Applications Ioannis Tsampikos Papapetros et.al. 2507.15089 null
2025-07-20 U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs Xiaojie Li et.al. 2507.14902 null
2025-07-19 OptiCorNet: Optimizing Sequence-Based Context Correlation for Visual Place Recognition Zhenyu Li et.al. 2507.14477 null
2025-07-16 Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired Jiayu et.al. 2507.14215 null
2025-07-17 FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval Jeong-Woo Park et.al. 2507.12823 null
2025-07-17 MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval Jeong-Woo Park et.al. 2507.12819 null
2025-07-16 QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image Retrieval Jaehyun Kwak et.al. 2507.12416 null
2025-07-16 CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning Peiwen Xia et.al. 2507.11834 null
2025-07-09 Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning Konstantinos I. Roumeliotis et.al. 2507.10571 null
2025-07-14 GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space David G. Shatwell et.al. 2507.10473 null
2025-07-14 Text-to-Remote-Sensing-Image Retrieval beyond RGB Sources Daniele Rege Cambrin et.al. 2507.10403 null
2025-07-14 Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures Xinlong Ding et.al. 2507.10265 null
2025-07-11 RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features Inye Na et.al. 2507.08546 null
2025-07-11 Deep Hashing with Semantic Hash Centers for Image Retrieval Li Chen et.al. 2507.08404 null
2025-07-08 Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis Li Li et.al. 2507.08021 null
2025-07-10 SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation Juyeop Han et.al. 2507.07467 null
2025-07-10 VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching Yu Chen et.al. 2507.07384 null
2025-07-08 FACap: A Large-scale Fashion Dataset for Fine-grained Composed Image Retrieval François Gardères et.al. 2507.07135 null
2025-07-09 Evaluating Attribute Confusion in Fashion Text-to-Image Generation Ziyue Liu et.al. 2507.07079 null
2025-07-09 MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval Naoya Sogi et.al. 2507.06654 null
2025-07-08 Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval Haiwen Li et.al. 2507.05970 null
2025-07-08 OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval Zhiwei Chen et.al. 2507.05631 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 An analysis of vision-language models for fabric retrieval Francesco Giuliari et.al. 2507.04735 null
2025-07-08 What’s Making That Sound Right Now? Video-centric Audio-Visual Localization Hahyeon Choi et.al. 2507.04667 null
2025-07-06 U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration Xiaofan Li et.al. 2507.04503 null
2025-07-04 Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition Jiuhong Xiao et.al. 2507.03831 null
2025-07-01 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Juelin Zhu et.al. 2507.00659 null
2025-06-28 Utilizing a Novel Deep Learning Method for Scene Categorization in Remote Sensing Data Ghufran A. Omran et.al. 2506.22939 null
2025-06-28 Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval Li-Cheng Shen et.al. 2506.22864 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-26 OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography Caoshuo Li et.al. 2506.21101 null
2025-06-25 Visualizing intercalation effects in 2D materials using AFM based techniques Karmen Kapustić et.al. 2506.20467 null
2025-06-25 On the Burstiness of Faces in Set Jiong Wang et.al. 2506.20312 null
2025-06-24 jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval Michael Günther et.al. 2506.18902 null
2025-06-26 Referring Expression Instance Retrieval and A Strong End-to-End Baseline Xiangzhao Hao et.al. 2506.18246 null
2025-06-20 Class Agnostic Instance-level Descriptor for Visual Instance Search Qi-Ying Sun et.al. 2506.16745 null

Keypoint Detection

Publish Date Title Authors PDF Code
2025-07-23 CartoonAlive: Towards Expressive Live2D Modeling from Single Portraits Chao He et.al. 2507.17327 null
2025-07-21 Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors Mohamed Adjel et.al. 2507.16850 null
2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model Maulana Bisyir Azhari et.al. 2507.13145 null
2025-07-15 KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model Jie Yang et.al. 2507.11102 null
2025-07-15 GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft Weizhao Ma et.al. 2507.11077 null
2025-07-14 FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching Ionuţ Grigore et.al. 2507.10770 null
2025-07-11 Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection Subhajit Maity et.al. 2507.07994 null
2025-07-09 Reading a Ruler in the Wild Yimu Pan et.al. 2507.07077 null
2025-07-09 MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning Yifan Yang et.al. 2507.06662 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-05-29 TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning Ron Shapira Weber et.al. 2505.23475 link
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 link
2025-05-18 SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving Muleilan Pei et.al. 2505.12246 null
2025-05-17 Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation Niaz Ahmad et.al. 2505.12130 null
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-06-19 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null