Environment Perception

🌐 Language: English 中文

This directory collects papers and code implementations related to environment perception in embodied AI.

Main Contents

Manually Added Papers

Date Title Paper Code Rating
2024 DreamWaQ++: Obstacle-aware quadrupedal locomotion with resilient multi-modal reinforcement learning [pdf] ⚠️ ⭐️⭐️
2024 Open-TeleVision: Teleoperation with Immersive Active Visual Feedback [pdf] OpenTeleVision/TeleVision ⭐️⭐️
2023 DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination [pdf] Manaro-Alpha/DreamWaQ ⭐️⭐️⭐️
2022 Elevation Mapping for Locomotion using GPU [pdf] leggedrobotics/elevation_mapping_cupy ⭐️⭐️
2022 Learning robust perceptive locomotion for quadrupedal robots [pdf] ⚠️ ⭐️⭐️⭐️

Auto-Updated Papers

Date Title Paper Code Rating
2025-06-26 Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-26 [ACTLLM] ACTLLM: Action Consistency Tuned Large Language Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-26 [CURL-SLAM] CURL-SLAM: Continuous and Compact LiDAR Mapping [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-25 Multimodal Behaviour Trees for Robotic Laboratory Task Automation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-24 Robotics Under Construction: Challenges on Job Sites [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-24 [AirV2X] AirV2X: Unified Air-Ground Vehicle-to-Everything Collaboration [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-24 [en] Ontology Neural Network and ORTSF: A Framework for Topological Reasoning and Delay-Robust Control [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-23 Learning Approach to Efficient Vision-based Active Tracking of a Flying Target by an Unmanned Aerial Vehicle [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-23 Multimodal Anomaly Detection with a Mixture-of-Experts [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-23 [en] FORTE: Tactile Force and Slip Sensing on Compliant Fingers for Delicate Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-21 [en] Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-20 General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-20 [Trajectory] Distilling On-device Language Models for Robot Planning with Minimal Human Intervention [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 Efficient and Generalizable Environmental Understanding for Visual Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 [CLAIM] CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 3D Vision-tactile Reconstruction from Infrared and Visible Images for Robotic Fine-grained Tactile Perception [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 [VIMS] VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-17 Determinação Automática de Limiar de Detecção de Ataques em Redes de Computadores Utilizando Autoencoders [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-17 [VisLanding] VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-17 Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-16 [en] Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-16 [en] Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-16 A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-13 Auditory-Tactile Congruence for Synthesis of Adaptive Pain Expressions in RoboPatients [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 [DCIRNet] DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 [en] VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-09 [en] Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-09 Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-08 Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-06 [en] Robust sensor fusion against on-vehicle sensor staleness [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-06 [en] Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-05 [en] LLMs for sensory-motor control: Combining in-context and iterative learning [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-05 Ontology-based knowledge representation for bone disease diagnosis: a foundation for safe and sustainable medical artificial intelligence systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-05 [MineInsight] MineInsight: A Multi-sensor Dataset for Humanitarian Demining Robotics in Off-Road Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-04 [cuVSLAM] cuVSLAM: CUDA accelerated visual odometry [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-03 [V2X-UniPool] V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-03 [Sign Language] Sign Language: Towards Sign Understanding for Robot Autonomy [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-03 [SAVOR] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-03 [en] Geometric Visual Servo Via Optimal Transport [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-31 [en] Multi-Objective Neural Network Assisted Design Optimization of Soft Fin-Ray Grippers for Enhanced Grasping Performance [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-31 [en] Constrained Stein Variational Gradient Descent for Robot Perception, Planning, and Identification [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-30 [SentinelAgent] SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-30 System-integrated intrinsic static-dynamic pressure sensing enabled by charge excitation and 3D gradient engineering for autonomous robotic interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-28 Cognitively-Inspired Emergent Communication via Knowledge Graphs for Assisting the Visually Impaired [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-28 From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-28 [UP-SLAM] UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-28 [iDSE] iDSE: Navigating Design Space Exploration in High-Level Synthesis Using LLMs [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-25 [en] Staircase Recognition and Location Based on Polarization Vision [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-24 Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-23 [RQR3D] RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-23 Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-23 [en] A Dataset and Benchmarks for Deep Learning-Based Optical Microrobot Pose and Depth Perception [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-22 [en] D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-21 [en] RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-21 [WaveTouch] WaveTouch: Active Tactile Sensing Using Vibro-Feedback for Classification of Variable Stiffness and Infill Density Objects [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-19 [en] MatPredict: a dataset and benchmark for learning material properties of diverse indoor objects [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-16 [en] Attention on the Sphere [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-15 Unsupervised Radar Point Cloud Enhancement via Arbitrary LiDAR Guided Diffusion Prior [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-15 Large-Scale Gaussian Splatting SLAM [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-15 [TartanGround] TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-14 Air-Ground Collaboration for Language-Specified Missions in Unknown Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-14 Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-14 EdgeAI Drone for Autonomous Construction Site Demonstrator [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 [MDF] MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Pixel Motion as Universal Representation for Robot Control [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Hybrid Spiking Vision Transformer for Object Detection with Event Cameras [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-11 [en] Dynamic Safety in Complex Environments: Synthesizing Safety Filters with Poisson’s Equation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 [Topo-VM-UNetV2] Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 [en] Camera-Only Bird’s Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 Differentiating Emigration from Return Migration of Scholars Using Name-Based Nationality Detection Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 [HashKitty] HashKitty: Distributed Password Analysis [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation [pdf] yc4ny/SVAD ⭐️⭐️⭐️
2025-05-08 A Survey [pdf] hzxie/awesome-3d-scene-generation ⭐️⭐️⭐️
2025-05-08 Predicting Structure and Motion via Ray Origin and Endpoint Diffusion [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 An Omni Foundation Model for Interleaved Multi-Modal Generation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Evaluating Legally Consistent Bias in Machine Learning [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Training Flow Matching Models via Online RL [pdf] yifan123/flow_grpo ⭐️⭐️⭐️
2025-05-08 Generating Physically Stable and Buildable LEGO Designs from Text [pdf] AvaLovelace1/LegoGPT ⭐️⭐️⭐️
2025-05-08 Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Comparison of integral equations used to study $T_{cc}^+$ [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Understanding Perception and Reasoning through Model Merging [pdf] shiqichen17/vlm_merging ⭐️⭐️⭐️
2025-05-08 Primordial black-hole formation and heavy r-process element synthesis from the cosmological QCD transition. Two aspects of an inhomogeneous early Universe [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Representation Stability for Marked Graph Complexes [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Emergence of Spin-Polarized Unconventional Skin Effect in Hatano-Nelson Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Study on Improvement of Image Quality in Quantum Polarized Microscopy using an Entangled-Photon Source [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 towards Spatial Intelligence Thorough Evaluation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Resolution of the Solar Convective Conundrum? New Results Using the Time-Distance Deep-Focus Method [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Conversational Process Model Redesign [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 The Brownian marble [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 implications for the observed abundance of ultra-violet luminous galaxies at z>10 [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Scalable Bernoulli factories for Bayesian inference with intractable likelihoods [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Cell size heterogeneity controls crystallization of the developing fruit fly wing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Non-Markovianity in collision models with initial intra-environment correlations [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Statistical Characterization of Entanglement Degradation Under Markovian Noise in Composite Quantum Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Two-dimensional water waves with constant vorticity and general bottom topography [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Empowering Scientific Workflows with Federated Agents [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 an LLM-based Literary Translation evaluation metric with Professional Question Answering [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Robustly optimal dynamics for active matter reservoir computing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Budget-Constrained Routing Perspective [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Variable Selection for Fixed and Random Effects in Multilevel Functional Mixed Effects Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Post-Training Compression for Ultra-Low Power Hyperdimensional Computing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Reasoning Models Don’t Always Say What They Think [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Neural network methods for power series problems of Perron-Frobenius operators [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Do LLMs Generate More Biased News Headlines than Humans? [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Testing an unstable cosmic neutrino background [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Representing spherical tensors with scalar-based machine-learning models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Application to attosecond X-ray spectroscopy [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Significant reflection and absorption effects in the X-ray emission of the Intermediate Polar IGR J17195-4100 [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Pain Assessment Framework based on multimodal data and Deep Machine Learning methods [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Impact of topology [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Simple yet Effective Event Denoising Method with State Space Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Near- to mid-infrared spectroscopic study of ice analysis using the AKARI/IRC and Spitzer/IRS spectra [pdf] ⚠️ ⭐️⭐️⭐️

📊 Statistics

Last updated: 2025-06-28