多模态交互

🌐 语言: English 中文

本目录收集了具身智能中与多模态交互相关的论文和代码实现。

主要内容

手动添加的论文

日期 标题 论文 代码 推荐指数
2024-09 ReMEmbR: Retrieval-Enhanced Memory for Robot Reasoning and Navigation [pdf] NVIDIA-AI-IOT/remembr ⭐️⭐️⭐️
2024 Gesture-Based Control for Robotic Systems [pdf] ⚠️ ⭐️⭐️
2023 Natural Language Instructions for Robot Manipulation [pdf] example/lang_robot ⭐️⭐️⭐️

自动更新的论文

日期 标题 论文 代码 推荐指数
2025-08-20 [NoteIt] NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-20 Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-19 The Social Context of Human-Robot Interactions [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-19 [BetaWeb] BetaWeb: Towards a Blockchain-enabled Trustworthy Agentic Web [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-19 [STER-VLM] STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-18 One-Class Intrusion Detection with Dynamic Graphs [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-18 A Surveillance Based Interactive Robot [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-18 [en] Precise Action-to-Video Generation Through Visual Action Prompts [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-16 [SimInterview] SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-14 A Multimodal Neural Network for Recognizing Subjective Self-Disclosure Towards Social Robots [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-13 Whole-Body Bilateral Teleoperation with Multi-Stage Object Parameter Estimation for Wheeled Humanoid Locomanipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-13 [HapticGiant] HapticGiant: A Novel Very Large Kinesthetic Haptic Interface with Hierarchical Force Control [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-12 Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-12 [Imposing AI] Imposing AI: Deceptive design patterns against sustainability [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-12 Generative AI for Critical Infrastructure in Smart Grids: A Unified Framework for Synthetic Data Generation and Anomaly Detection [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-12 Autonomous Mobile Plant Watering Robot : A Kinematic Approach [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-11 [Conversational DNA] Conversational DNA: A New Visual Language for Understanding Dialogue Structure in Human and AI [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 Towards Embodied Agentic AI: Review and Classification of LLM- and VLM-Driven Robot Autonomy and Interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 Large Language Models Transform Organic Synthesis From Reaction Prediction to Automation [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 Building Effective Safety Guardrails in AI Education Tools [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 Examining the legibility of humanoid robot arm movements in a pointing task [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 [ADSEL] ADSEL: Adaptive dual self-expression learning for EEG feature selection via incomplete multi-dimensional emotional tagging [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 [Multi-Agent] Dancing with a Robot: An Experimental Study of Child-Robot Interaction in a Performative Art Setting [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-07 [Multi-Agent] Affecta-Context: The Context-Guided Behavior Adaptation Framework [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-06 [StepWrite] StepWrite: Adaptive Planning for Speech-Driven Text Generation [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-06 Improving Tactile Gesture Recognition with Optical Flow [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-05 Decoding and Engineering the Phytobiome Communication for Smart Agriculture [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 [HyCodePolicy] HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 Would you let a humanoid play storytelling with your child? A usability study on LLM-powered narrative Humanoid-Robot Interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 Multi-Class Human/Object Detection on Robot Manipulators using Proprioceptive Sensing [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 Dynamic Forgetting and Spatio-Temporal Periodic Interest Modeling for Local-Life Service Recommendation [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 [en] What Is Your AI Agent Buying? Evaluation, Implications and Emerging Questions for Agentic E-Commerce [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-04 Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-03 [en] Set the Stage: Enabling Storytelling with Multiple Robots through Roleplaying Metaphors [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-02 Video-based Vehicle Surveillance in the Wild: License Plate, Make, and Model Recognition with Self Reflective Vision-Language Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-08-02 [VLH] VLH: Vision-Language-Haptics Foundation Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-31 Human-Exoskeleton Kinematic Calibration to Improve Hand Tracking for Dexterous Teleoperation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-31 [Multi-Agent] User Experience Estimation in Human-Robot Interaction Via Multi-Instance Learning of Multimodal Social Signals [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-30 [Magentic-UI] Magentic-UI: Towards Human-in-the-loop Agentic Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-30 Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-30 [en] Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-30 [Viser] Viser: Imperative, Web-based 3D Visualization in Python [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-29 Automatic Classification of User Requirements from Online Feedback – A Replication Study [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-29 Emergent interactions lead to collective frustration in robotic matter [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-29 [Multi-Agent] Sound Source Localization for Human-Robot Interaction in Outdoor Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-28 A Human-in-the-loop Approach to Robot Action Replanning through LLM Common-Sense Reasoning [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-26 [en] Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-26 [en] Robot Excavation and Manipulation of Geometrically Cohesive Granular Media [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-25 [GEAR] GEAR: Gaze-Enabled Human-Robot Collaborative Assembly [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-25 Towards Multimodal Social Conversations with Robots: Using Vision-Language Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-25 [Humanoid] Salsa as a Nonverbal Embodied Language – The CoMPAS3D Dataset and Benchmarks [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-24 [ViGText] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-24 MetaMorph – A Metamodelling Approach For Robot Morphology [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-23 An Exploratory Study on Human-Robot Interaction using Semantics-based Situational Awareness [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-23 Robot-mediated physical Human-Human Interaction in Neurorehabilitation: a position paper [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-22 AI or Human? Understanding Perceptions of Embodied Robots with LLMs [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-22 [en] Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-22 [Beyond Algorethics] Beyond Algorethics: Addressing the Ethical and Anthropological Challenges of AI Recommender Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-21 Gaze-supported Large Language Model Framework for Bi-directional Human-Robot Interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-21 Therapist-Exoskeleton-Patient Interaction: An Immersive Gait Therapy [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-21 [en] Strong, Accurate, and Low-Cost Robot Manipulator [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-20 Digital twin and extended reality for teleoperation of the electric vehicle battery disassembly [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-18 A Study of Teleoperation Methods in a Simulated Virtual Eye Surgery Environment [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-17 [AnyPos] AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-17 [Multi-Agent] ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-16 From Coarse to Nuanced: Cross-Modal Alignment of Fine-Grained Linguistic Cues and Visual Salient Regions for Dynamic Emotion Recognition [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-16 [InstructFLIP] InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-16 Design and Development of an Automated Contact Angle Tester (ACAT) for Surface Wettability Measurement [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-15 [Multi-Agent] Human-Robot collaboration in surgery: Advances and challenges towards autonomous surgical assistants [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-15 [en] Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-14 [en] Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-13 [en] SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-10 [UniTac] UniTac: Whole-Robot Touch Sensing Without Tactile Sensors [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-10 [Multi-Agent] FiDTouch: A 3D Wearable Haptic Display for the Finger Pad [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-10 Pluri-perspectivism in Human-robot Co-creativity with Older Adults [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-09 [VisualTrap] VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-09 Integrating Perceptions: A Human-Centered Physical Safety Model for Human-Robot Interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-09 Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-09 Effects of Wrist-Worn Haptic Feedback on Force Accuracy and Task Speed during a Teleoperated Robotic Surgery Task [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-09 [LangNavBench] LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-08 [en] Robust Speech-Workload Estimation for Intelligent Human-Robot Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-07 [VOTE] VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-07 [en] Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-07 Fine-Grained Vision-Language Modeling for Multimodal Training Assistants in Augmented Reality [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-05 [en] Human-centered AI with focus on Human-robot interaction (Book chapter) [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-03 Safe and Socially Aware Multi-Robot Coordination in Multi-Human Social Care Settings [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-03 [en] LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-03 [en] Personalised Explanations in Long-term Human-Robot Interactions [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-02 [VLAD] VLAD: A VLM-Augmented Autonomous Driving Framework with Hierarchical Planning and Interpretable Decision Process [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-02 How Do Vision-Language Models Process Conflicting Information Across Modalities? [pdf] ⚠️ ⭐️⭐️⭐️
2025-07-02 [cVLA] cVLA: Towards Efficient Camera-Space VLAs [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-30 Passage-traversing optimal path planning with sampling-based algorithms [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-30 Towards Universal Shared Control in Teleoperation Without Haptic Feedback [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-30 [en] User Concerns Regarding Social Robots for Mood Regulation: A Case Study on the “Sunday Blues” [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-27 Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-27 Bootstrapping Human-Like Planning via LLMs [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-25 [Multi-Agent] Personalized Mental State Evaluation in Human-Robot Interaction using Federated Learning [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-25 How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction? [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-25 [en] Generating and Customizing Robotic Arm Trajectories using Neural Networks [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-24 [en] The MOTIF Hand: A Robotic Hand for Multimodal Observations with Thermal, Inertial, and Force Sensors [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-23 [en] TritonZ: A Remotely Operated Underwater Rover with Manipulator Arm for Exploration and Rescue Operations [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-23 [Multi-Agent] Situated Haptic Interaction: Exploring the Role of Context in Affective Perception of Robotic Touch [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-19 History-Augmented Vision-Language Models for Frontier-Based Zero-Shot Object Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-19 On using AI for EEG-based BCI applications: problems, current challenges and future trends [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 [Designing Intent] Designing Intent: A Multimodal Framework for Human-Robot Cooperation in Industrial Workspaces [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 [en] Vision in Action: Learning Active Perception from Human Demonstrations [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-18 I Know You’re Listening: Adaptive Voice for HRI [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-17 Design an Editable Speech-to-Sign-Language Transformer System: A Human-Centered AI Approach [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-16 Multimodal “Puppeteer”: An Exploration of Robot Teleoperation Via Virtual Counterpart with LLM-Driven Voice and Gesture Interaction in Augmented Reality [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-16 [en] A Cooperative Contactless Object Transport with Acoustic Robots [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-13 [en] Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-13 Robotic System for Chemical Experiment Automation with Dual Demonstration of End-effector and Jig Operations [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-12 [RT-VC] RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-12 Using Vision Language Models to Detect Students’ Academic Emotion through Facial Expressions [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 Integrating Quantized LLMs into Robotics Systems as Edge AI to Leverage their Natural Language Processing Capabilities [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 A Navigation Framework Utilizing Vision-Language Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 Test-Time Adaptation for Generalizable Task Progress Estimation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 A Unified Framework for Probabilistic Dynamic-, Trajectory- and Vision-based Virtual Fixtures [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-11 Cybernetic Marionette: Channeling Collective Agency Through a Wearable Robot in a Live Dancer-Robot Duet [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-10 [Help or Hindrance] Help or Hindrance: Understanding the Impact of Robot Communication in Action Teams [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-10 [en] Towards Biosignals-Free Autonomous Prosthetic Hand Control via Imitation Learning [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-09 [en] LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-09 [en] BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-09 Surgeon Style Fingerprinting and Privacy Risk Quantification via Discrete Diffusion Models in a Vision-Language-Action Framework [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-07 Active Test-time Vision-Language Navigation [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-07 [en] Attention-Based Convolutional Neural Network Model for Human Lower Limb Activity Recognition using sEMG [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-06 [HMVLM] HMVLM: Multistage Reasoning-Enhanced Vision-Language Model for Long-Tailed Driving Scenarios [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-05 [GEX] GEX: Democratizing Dexterity with Fully-Actuated Dexterous Hand and Exoskeleton Glove [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-05 [en] Multimodal Limbless Crawling Soft Robot with a Kirigami Skin [pdf] ⚠️ ⭐️⭐️⭐️
2025-06-02 EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-30 Learning API Functionality from Demonstrations for Tool-based Agents [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-30 Towards Tangible Immersion for Cobot Programming-by-Demonstration: Visual, Tactile and Haptic Interfaces for Mixed-Reality Cobot Automation in Semiconductor Manufacturing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-29 Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-29 Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-28 [ForceVLA] ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-26 [DiffVLA] DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-26 Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-26 The Many Challenges of Human-Like Agents in Virtual Game Environments [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-26 [en] CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-23 [Trajectory] DTRT: Enhancing Human Intent Estimation and Role Allocation for Physical Human-Robot Collaboration [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-23 [VideoGameBench] VideoGameBench: Can Vision-Language Models complete popular video games? [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-22 [Circle-RoPE] Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-22 [DriveMoE] DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-21 [ClickSight] ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-21 Proactive Hierarchical Control Barrier Function-Based Safety Prioritization in Close Human-Robot Interaction Scenarios [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-20 Sketch Interface for Teleoperation of Mobile Manipulator to Enable Intuitive and Intended Operation: A Proof of Concept [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-20 Robotic Monitoring of Colorimetric Leaf Sensors for Precision Agriculture [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-20 Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-20 [Multi-Agent] UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-20 [en] Certifiably Safe Manipulation of Deformable Linear Objects via Joint Shape and Tension Prediction [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-19 Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-19 [Multi-Agent] Interpretable Robotic Friction Learning via Symbolic Regression [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-16 Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-16 [en] Open-Source Multi-Viewpoint Surgical Telerobotics [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-15 Context-aware collaborative pushing of heavy objects using skeleton-based intention prediction [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-14 [Flash-VL 2B] Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-14 Grasp EveryThing (GET): 1-DoF, 3-Fingered Gripper with Tactile Sensing for Robust Grasping [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 [CLTP] CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 [en] A Social Robot with Inner Speech for Dietary Guidance [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-13 WaLLM – Insights from an LLM-Powered Chatbot deployment via WhatsApp [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Intuitive Human-Robot Interfaces Leveraging on Autonomy Features for the Control of Highly-redundant Robots [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 [Adaptive] Hybrid Control Strategies for Safe and Adaptive Robot-Assisted Dressing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 [AcoustoBots] AcoustoBots: A swarm of robots for acoustophoretic multimodal interactions [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 When Near Becomes Far: From Rayleigh to Optimal Near-Field and Far-Field Boundaries [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 [BodyGPS] BodyGPS: Anatomical Positioning System [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 Circulators based on Coupled Quantum Anomalous Hall Insulators and Resonators [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 [UAV-CodeAgents] UAV-CodeAgents: Scalable UAV Mission Planning via Multi-Agent ReAct and Vision-Language Reasoning [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-12 [TPT-Bench] TPT-Bench: A Large-Scale, Long-Term and Robot-Egocentric Dataset for Benchmarking Target Person Tracking [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 An Empirical Study of Fuzz Harness Degradation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 Polymer-Shell Coating of Mie-Resonant Silicon Nanospheres for Controlled Fabrication of Self-Assembled Monolayer [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 Preferential Attachment Trees with Vertex Death: Persistence of the Maximum Degree [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-09 Context Informed Incremental Learning Improves Myoelectric Control Performance in Virtual Reality Object Manipulation Tasks [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation [pdf] yc4ny/SVAD ⭐️⭐️⭐️
2025-05-08 A Survey [pdf] hzxie/awesome-3d-scene-generation ⭐️⭐️⭐️
2025-05-08 Predicting Structure and Motion via Ray Origin and Endpoint Diffusion [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Generating Physically Stable and Buildable LEGO Designs from Text [pdf] AvaLovelace1/LegoGPT ⭐️⭐️⭐️
2025-05-08 Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Preference Alignment via Comparison Oracles [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Understanding Perception and Reasoning through Model Merging [pdf] shiqichen17/vlm_merging ⭐️⭐️⭐️
2025-05-08 Primordial black-hole formation and heavy r-process element synthesis from the cosmological QCD transition. Two aspects of an inhomogeneous early Universe [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Marsden–Meyer–Weinstein reduction for $k$-contact field theories [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Representation Stability for Marked Graph Complexes [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Dataset of Misleading Narratives Surrounding Recent UK General Elections [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Emergence of Spin-Polarized Unconventional Skin Effect in Hatano-Nelson Model [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Study on Improvement of Image Quality in Quantum Polarized Microscopy using an Entangled-Photon Source [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Conversational Process Model Redesign [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 The Brownian marble [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Novel Forms of Early Dark Energy [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 todd [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 implications for the observed abundance of ultra-violet luminous galaxies at z>10 [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Stabilization of Kac polynomials [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Scalable Bernoulli factories for Bayesian inference with intractable likelihoods [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 The effective energy of a lattice metamaterial [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Non-Markovianity in collision models with initial intra-environment correlations [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Statistical Characterization of Entanglement Degradation Under Markovian Noise in Composite Quantum Systems [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Two-dimensional water waves with constant vorticity and general bottom topography [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Theoretical modeling of approximate universality of tidally deformed neutron stars [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Empowering Scientific Workflows with Federated Agents [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Efficient Data Filtering and Verification for High-Quality LLM Training Data [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 On differentiation of integrals in Lebesgue spaces [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 an LLM-based Literary Translation evaluation metric with Professional Question Answering [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Robustly optimal dynamics for active matter reservoir computing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A new time-dependent quantum theory based on Tsallis’ distribution [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 A Budget-Constrained Routing Perspective [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Evidence of chiral fermion edge modes through geometric engineering of thermal Hall in $α$-RuCl$_3$ [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Variable Selection for Fixed and Random Effects in Multilevel Functional Mixed Effects Models [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Fermi lune and transdimensional orbital magnetism in rhombohedral multilayer graphene [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Post-Training Compression for Ultra-Low Power Hyperdimensional Computing [pdf] ⚠️ ⭐️⭐️⭐️
2025-05-08 Dynamic injection of a compressible gas into a confined porous layer [pdf] ⚠️ ⭐️⭐️⭐️

📊 统计

最后更新: 2025-08-22