2025-08-20 |
[NoteIt] NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-20 |
Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-19 |
The Social Context of Human-Robot Interactions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-19 |
[BetaWeb] BetaWeb: Towards a Blockchain-enabled Trustworthy Agentic Web |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-19 |
[STER-VLM] STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-18 |
One-Class Intrusion Detection with Dynamic Graphs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-18 |
A Surveillance Based Interactive Robot |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-18 |
[en] Precise Action-to-Video Generation Through Visual Action Prompts |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-16 |
[SimInterview] SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-14 |
A Multimodal Neural Network for Recognizing Subjective Self-Disclosure Towards Social Robots |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-13 |
Whole-Body Bilateral Teleoperation with Multi-Stage Object Parameter Estimation for Wheeled Humanoid Locomanipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-13 |
[HapticGiant] HapticGiant: A Novel Very Large Kinesthetic Haptic Interface with Hierarchical Force Control |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-12 |
Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-12 |
[Imposing AI] Imposing AI: Deceptive design patterns against sustainability |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-12 |
Generative AI for Critical Infrastructure in Smart Grids: A Unified Framework for Synthetic Data Generation and Anomaly Detection |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-12 |
Autonomous Mobile Plant Watering Robot : A Kinematic Approach |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-11 |
[Conversational DNA] Conversational DNA: A New Visual Language for Understanding Dialogue Structure in Human and AI |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
Towards Embodied Agentic AI: Review and Classification of LLM- and VLM-Driven Robot Autonomy and Interaction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
Large Language Models Transform Organic Synthesis From Reaction Prediction to Automation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
Building Effective Safety Guardrails in AI Education Tools |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
Examining the legibility of humanoid robot arm movements in a pointing task |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
[ADSEL] ADSEL: Adaptive dual self-expression learning for EEG feature selection via incomplete multi-dimensional emotional tagging |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
[Multi-Agent] Dancing with a Robot: An Experimental Study of Child-Robot Interaction in a Performative Art Setting |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-07 |
[Multi-Agent] Affecta-Context: The Context-Guided Behavior Adaptation Framework |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-06 |
[StepWrite] StepWrite: Adaptive Planning for Speech-Driven Text Generation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-06 |
Improving Tactile Gesture Recognition with Optical Flow |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-05 |
Decoding and Engineering the Phytobiome Communication for Smart Agriculture |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
[HyCodePolicy] HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
Would you let a humanoid play storytelling with your child? A usability study on LLM-powered narrative Humanoid-Robot Interaction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
Multi-Class Human/Object Detection on Robot Manipulators using Proprioceptive Sensing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
Dynamic Forgetting and Spatio-Temporal Periodic Interest Modeling for Local-Life Service Recommendation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
[en] What Is Your AI Agent Buying? Evaluation, Implications and Emerging Questions for Agentic E-Commerce |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-04 |
Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-03 |
[en] Set the Stage: Enabling Storytelling with Multiple Robots through Roleplaying Metaphors |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-02 |
Video-based Vehicle Surveillance in the Wild: License Plate, Make, and Model Recognition with Self Reflective Vision-Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-08-02 |
[VLH] VLH: Vision-Language-Haptics Foundation Model |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-31 |
Human-Exoskeleton Kinematic Calibration to Improve Hand Tracking for Dexterous Teleoperation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-31 |
[Multi-Agent] User Experience Estimation in Human-Robot Interaction Via Multi-Instance Learning of Multimodal Social Signals |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-30 |
[Magentic-UI] Magentic-UI: Towards Human-in-the-loop Agentic Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-30 |
Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-30 |
[en] Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-30 |
[Viser] Viser: Imperative, Web-based 3D Visualization in Python |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-29 |
Automatic Classification of User Requirements from Online Feedback – A Replication Study |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-29 |
Emergent interactions lead to collective frustration in robotic matter |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-29 |
[Multi-Agent] Sound Source Localization for Human-Robot Interaction in Outdoor Environments |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-28 |
A Human-in-the-loop Approach to Robot Action Replanning through LLM Common-Sense Reasoning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-26 |
[en] Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-26 |
[en] Robot Excavation and Manipulation of Geometrically Cohesive Granular Media |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-25 |
[GEAR] GEAR: Gaze-Enabled Human-Robot Collaborative Assembly |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-25 |
Towards Multimodal Social Conversations with Robots: Using Vision-Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-25 |
[Humanoid] Salsa as a Nonverbal Embodied Language – The CoMPAS3D Dataset and Benchmarks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-24 |
[ViGText] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-24 |
MetaMorph – A Metamodelling Approach For Robot Morphology |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-23 |
An Exploratory Study on Human-Robot Interaction using Semantics-based Situational Awareness |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-23 |
Robot-mediated physical Human-Human Interaction in Neurorehabilitation: a position paper |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-22 |
AI or Human? Understanding Perceptions of Embodied Robots with LLMs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-22 |
[en] Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-22 |
[Beyond Algorethics] Beyond Algorethics: Addressing the Ethical and Anthropological Challenges of AI Recommender Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-21 |
Gaze-supported Large Language Model Framework for Bi-directional Human-Robot Interaction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-21 |
Therapist-Exoskeleton-Patient Interaction: An Immersive Gait Therapy |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-21 |
[en] Strong, Accurate, and Low-Cost Robot Manipulator |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-20 |
Digital twin and extended reality for teleoperation of the electric vehicle battery disassembly |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-18 |
A Study of Teleoperation Methods in a Simulated Virtual Eye Surgery Environment |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-17 |
[AnyPos] AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-17 |
[Multi-Agent] ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-16 |
From Coarse to Nuanced: Cross-Modal Alignment of Fine-Grained Linguistic Cues and Visual Salient Regions for Dynamic Emotion Recognition |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-16 |
[InstructFLIP] InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-16 |
Design and Development of an Automated Contact Angle Tester (ACAT) for Surface Wettability Measurement |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-15 |
[Multi-Agent] Human-Robot collaboration in surgery: Advances and challenges towards autonomous surgical assistants |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-15 |
[en] Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-14 |
[en] Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-13 |
[en] SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-10 |
[UniTac] UniTac: Whole-Robot Touch Sensing Without Tactile Sensors |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-10 |
[Multi-Agent] FiDTouch: A 3D Wearable Haptic Display for the Finger Pad |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-10 |
Pluri-perspectivism in Human-robot Co-creativity with Older Adults |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-09 |
[VisualTrap] VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-09 |
Integrating Perceptions: A Human-Centered Physical Safety Model for Human-Robot Interaction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-09 |
Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-09 |
Effects of Wrist-Worn Haptic Feedback on Force Accuracy and Task Speed during a Teleoperated Robotic Surgery Task |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-09 |
[LangNavBench] LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-08 |
[en] Robust Speech-Workload Estimation for Intelligent Human-Robot Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-07 |
[VOTE] VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-07 |
[en] Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-07 |
Fine-Grained Vision-Language Modeling for Multimodal Training Assistants in Augmented Reality |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-05 |
[en] Human-centered AI with focus on Human-robot interaction (Book chapter) |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-03 |
Safe and Socially Aware Multi-Robot Coordination in Multi-Human Social Care Settings |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-03 |
[en] LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-03 |
[en] Personalised Explanations in Long-term Human-Robot Interactions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-02 |
[VLAD] VLAD: A VLM-Augmented Autonomous Driving Framework with Hierarchical Planning and Interpretable Decision Process |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-02 |
How Do Vision-Language Models Process Conflicting Information Across Modalities? |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-07-02 |
[cVLA] cVLA: Towards Efficient Camera-Space VLAs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-30 |
Passage-traversing optimal path planning with sampling-based algorithms |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-30 |
Towards Universal Shared Control in Teleoperation Without Haptic Feedback |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-30 |
[en] User Concerns Regarding Social Robots for Mood Regulation: A Case Study on the “Sunday Blues” |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-27 |
Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-27 |
Bootstrapping Human-Like Planning via LLMs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
[Multi-Agent] Personalized Mental State Evaluation in Human-Robot Interaction using Federated Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction? |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
[en] Generating and Customizing Robotic Arm Trajectories using Neural Networks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-24 |
[en] The MOTIF Hand: A Robotic Hand for Multimodal Observations with Thermal, Inertial, and Force Sensors |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-23 |
[en] TritonZ: A Remotely Operated Underwater Rover with Manipulator Arm for Exploration and Rescue Operations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-23 |
[Multi-Agent] Situated Haptic Interaction: Exploring the Role of Context in Affective Perception of Robotic Touch |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-19 |
History-Augmented Vision-Language Models for Frontier-Based Zero-Shot Object Navigation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-19 |
On using AI for EEG-based BCI applications: problems, current challenges and future trends |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-18 |
[Designing Intent] Designing Intent: A Multimodal Framework for Human-Robot Cooperation in Industrial Workspaces |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-18 |
[en] Vision in Action: Learning Active Perception from Human Demonstrations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-18 |
I Know You’re Listening: Adaptive Voice for HRI |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-17 |
Design an Editable Speech-to-Sign-Language Transformer System: A Human-Centered AI Approach |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-16 |
Multimodal “Puppeteer”: An Exploration of Robot Teleoperation Via Virtual Counterpart with LLM-Driven Voice and Gesture Interaction in Augmented Reality |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-16 |
[en] A Cooperative Contactless Object Transport with Acoustic Robots |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-13 |
[en] Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-13 |
Robotic System for Chemical Experiment Automation with Dual Demonstration of End-effector and Jig Operations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-12 |
[RT-VC] RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-12 |
Using Vision Language Models to Detect Students’ Academic Emotion through Facial Expressions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
Integrating Quantized LLMs into Robotics Systems as Edge AI to Leverage their Natural Language Processing Capabilities |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
A Navigation Framework Utilizing Vision-Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
Test-Time Adaptation for Generalizable Task Progress Estimation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
A Unified Framework for Probabilistic Dynamic-, Trajectory- and Vision-based Virtual Fixtures |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
Cybernetic Marionette: Channeling Collective Agency Through a Wearable Robot in a Live Dancer-Robot Duet |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-10 |
[Help or Hindrance] Help or Hindrance: Understanding the Impact of Robot Communication in Action Teams |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-10 |
[en] Towards Biosignals-Free Autonomous Prosthetic Hand Control via Imitation Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-09 |
[en] LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-09 |
[en] BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-09 |
Surgeon Style Fingerprinting and Privacy Risk Quantification via Discrete Diffusion Models in a Vision-Language-Action Framework |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-07 |
Active Test-time Vision-Language Navigation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-07 |
[en] Attention-Based Convolutional Neural Network Model for Human Lower Limb Activity Recognition using sEMG |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-06 |
[HMVLM] HMVLM: Multistage Reasoning-Enhanced Vision-Language Model for Long-Tailed Driving Scenarios |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-05 |
[GEX] GEX: Democratizing Dexterity with Fully-Actuated Dexterous Hand and Exoskeleton Glove |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-05 |
[en] Multimodal Limbless Crawling Soft Robot with a Kirigami Skin |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-02 |
EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-30 |
Learning API Functionality from Demonstrations for Tool-based Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-30 |
Towards Tangible Immersion for Cobot Programming-by-Demonstration: Visual, Tactile and Haptic Interfaces for Mixed-Reality Cobot Automation in Semiconductor Manufacturing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-29 |
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-29 |
Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
[ForceVLA] ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
[DiffVLA] DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
The Many Challenges of Human-Like Agents in Virtual Game Environments |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
[en] CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
[Trajectory] DTRT: Enhancing Human Intent Estimation and Role Allocation for Physical Human-Robot Collaboration |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
[VideoGameBench] VideoGameBench: Can Vision-Language Models complete popular video games? |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-22 |
[Circle-RoPE] Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-22 |
[DriveMoE] DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-21 |
[ClickSight] ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-21 |
Proactive Hierarchical Control Barrier Function-Based Safety Prioritization in Close Human-Robot Interaction Scenarios |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
Sketch Interface for Teleoperation of Mobile Manipulator to Enable Intuitive and Intended Operation: A Proof of Concept |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
Robotic Monitoring of Colorimetric Leaf Sensors for Precision Agriculture |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
[Multi-Agent] UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
[en] Certifiably Safe Manipulation of Deformable Linear Objects via Joint Shape and Tension Prediction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-19 |
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-19 |
[Multi-Agent] Interpretable Robotic Friction Learning via Symbolic Regression |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-16 |
Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-16 |
[en] Open-Source Multi-Viewpoint Surgical Telerobotics |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-15 |
Context-aware collaborative pushing of heavy objects using skeleton-based intention prediction |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-14 |
[Flash-VL 2B] Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-14 |
Grasp EveryThing (GET): 1-DoF, 3-Fingered Gripper with Tactile Sensing for Robust Grasping |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
[CLTP] CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
[en] A Social Robot with Inner Speech for Dietary Guidance |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
WaLLM – Insights from an LLM-Powered Chatbot deployment via WhatsApp |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Intuitive Human-Robot Interfaces Leveraging on Autonomy Features for the Control of Highly-redundant Robots |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[Adaptive] Hybrid Control Strategies for Safe and Adaptive Robot-Assisted Dressing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[AcoustoBots] AcoustoBots: A swarm of robots for acoustophoretic multimodal interactions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
When Near Becomes Far: From Rayleigh to Optimal Near-Field and Far-Field Boundaries |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[BodyGPS] BodyGPS: Anatomical Positioning System |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Circulators based on Coupled Quantum Anomalous Hall Insulators and Resonators |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[UAV-CodeAgents] UAV-CodeAgents: Scalable UAV Mission Planning via Multi-Agent ReAct and Vision-Language Reasoning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[TPT-Bench] TPT-Bench: A Large-Scale, Long-Term and Robot-Egocentric Dataset for Benchmarking Target Person Tracking |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
An Empirical Study of Fuzz Harness Degradation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Polymer-Shell Coating of Mie-Resonant Silicon Nanospheres for Controlled Fabrication of Self-Assembled Monolayer |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Preferential Attachment Trees with Vertex Death: Persistence of the Maximum Degree |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Context Informed Incremental Learning Improves Myoelectric Control Performance in Virtual Reality Object Manipulation Tasks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation |
[pdf] |
yc4ny/SVAD |
⭐️⭐️⭐️ |
2025-05-08 |
A Survey |
[pdf] |
hzxie/awesome-3d-scene-generation |
⭐️⭐️⭐️ |
2025-05-08 |
Predicting Structure and Motion via Ray Origin and Endpoint Diffusion |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Generating Physically Stable and Buildable LEGO Designs from Text |
[pdf] |
AvaLovelace1/LegoGPT |
⭐️⭐️⭐️ |
2025-05-08 |
Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Preference Alignment via Comparison Oracles |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Understanding Perception and Reasoning through Model Merging |
[pdf] |
shiqichen17/vlm_merging |
⭐️⭐️⭐️ |
2025-05-08 |
Primordial black-hole formation and heavy r-process element synthesis from the cosmological QCD transition. Two aspects of an inhomogeneous early Universe |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Marsden–Meyer–Weinstein reduction for $k$-contact field theories |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Representation Stability for Marked Graph Complexes |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A Dataset of Misleading Narratives Surrounding Recent UK General Elections |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Emergence of Spin-Polarized Unconventional Skin Effect in Hatano-Nelson Model |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A Study on Improvement of Image Quality in Quantum Polarized Microscopy using an Entangled-Photon Source |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Conversational Process Model Redesign |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
The Brownian marble |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Novel Forms of Early Dark Energy |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
todd |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
implications for the observed abundance of ultra-violet luminous galaxies at z>10 |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Stabilization of Kac polynomials |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Scalable Bernoulli factories for Bayesian inference with intractable likelihoods |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
The effective energy of a lattice metamaterial |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Non-Markovianity in collision models with initial intra-environment correlations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Statistical Characterization of Entanglement Degradation Under Markovian Noise in Composite Quantum Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Two-dimensional water waves with constant vorticity and general bottom topography |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Theoretical modeling of approximate universality of tidally deformed neutron stars |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Empowering Scientific Workflows with Federated Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Efficient Data Filtering and Verification for High-Quality LLM Training Data |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
On differentiation of integrals in Lebesgue spaces |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
an LLM-based Literary Translation evaluation metric with Professional Question Answering |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Robustly optimal dynamics for active matter reservoir computing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A new time-dependent quantum theory based on Tsallis’ distribution |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A Budget-Constrained Routing Perspective |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Evidence of chiral fermion edge modes through geometric engineering of thermal Hall in $α$-RuCl$_3$ |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Variable Selection for Fixed and Random Effects in Multilevel Functional Mixed Effects Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Fermi lune and transdimensional orbital magnetism in rhombohedral multilayer graphene |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Post-Training Compression for Ultra-Low Power Hyperdimensional Computing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Dynamic injection of a compressible gas into a confined porous layer |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |