2025-06-26 |
Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-26 |
[STEP Planner] STEP Planner: Constructing cross-hierarchical subgoal tree as an embodied long-horizon task planner |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
[SPARK] SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
[PSALM-V] PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-25 |
Generating and Customizing Robotic Arm Trajectories using Neural Networks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-24 |
[Mem4Nav] Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-24 |
[The MOTIF Hand] The MOTIF Hand: A Robotic Hand for Multimodal Observations with Thermal, Inertial, and Force Sensors |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-23 |
[SViP] SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-23 |
Robotic Manipulation of a Rotating Chain with Bottom End Fixed |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-22 |
Adapting Vision-Language Models for Evaluating World Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-21 |
[VLA-OS] VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-21 |
[CLiViS] CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-20 |
Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-19 |
An Optimization-Augmented Control Framework for Single and Coordinated Multi-Arm Robotic Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-18 |
[FindingDory] FindingDory: A Benchmark to Evaluate Memory in Embodied Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-18 |
[HEAL] HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-17 |
Can Pretrained Vision-Language Embeddings Alone Guide Robot Navigation? |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-17 |
[CDP] CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-17 |
[en] TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-17 |
[NetRoller] NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-12 |
[Mirage-1] Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-12 |
An $O(n$)-Algorithm for the Higher-Order Kinematics and Inverse Dynamics of Serial Manipulators using Spatial Representation of Twists |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-12 |
[Gondola] Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-11 |
A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-09 |
[Multi-robot] Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-09 |
[HiBerNAC] HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-08 |
[Prime the search] Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-07 |
[RoboCerebra] RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-06 |
[Astra] Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-06 |
[SPRINT] SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-05 |
[Multi-robot] Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-04 |
Zero-Shot Temporal Interaction Localization for Egocentric Videos |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-04 |
Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-03 |
Geometric Visual Servo Via Optimal Transport |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-03 |
Grounded Vision-Language Interpreter for Integrated Task and Motion Planning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-06-02 |
Agentic Episodic Control |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-31 |
[en] LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-29 |
[Agentic Robot] Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-29 |
Toward Memory-Aided World Models: Benchmarking via Spatial Consistency |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-29 |
[OWL] OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
[DORAEMON] DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
Reinforced Reasoning for Embodied Planning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
[LabUtopia] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-28 |
Spring-Brake! Handed Shearing Auxetics Improve Efficiency of Hopping and Standing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-27 |
[PartInstruct] PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
Deep Active Inference Agents for Delayed and Long-Horizon Environments |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-26 |
[Task Memory Engine] Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
[ComfyMind] ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
[USTBench] USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-23 |
[en] BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-22 |
Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-21 |
[HCRMP] HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-21 |
[UAV-Flow Colosseo] UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
[APEX] APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
Fast and scalable multi-robot deployment planning under connectivity constraints |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-20 |
[Multi-Agent] Think, Reflect, Create: Metacognitive Learning for Zero-Shot Robotic Planning with LLMs |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-19 |
Granular Loco-Manipulation: Repositioning Rocks Through Strategic Sand Avalanche |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-17 |
[OneTwoVLA] OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-15 |
[SRT-H] SRT-H: A Hierarchical Framework for Autonomous Surgery via Language Conditioned Imitation Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-15 |
[FlowDreamer] FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-15 |
[AORRTC] AORRTC: Almost-Surely Asymptotically Optimal Planning with RRT-Connect |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
Symbolically-Guided Visual Plan Inference from Uncurated Video Data |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-13 |
Multi-step manipulation task and motion planning guided by video demonstration |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Revisiting the Excess Volatility Puzzle Through the Lens of the Chiarella Model |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Topological Indices Among Strong Support Vertex |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Piloting Structure-Based Drug Design via Modality-Specific Optimal Schedule |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
[CHD] CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-12 |
Cooperative Assembly with Autonomous Mobile Manipulators in an Underwater Scenario |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-11 |
Efficient Robotic Policy Learning via Latent Space Backward Planning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Neuro-Symbolic Concepts |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Leakage-resilient Algebraic Manipulation Detection Codes with Optimal Parameters |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Preferential Attachment Trees with Vertex Death: Persistence of the Maximum Degree |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
New Advances in Phonons: From Band Topology to Quasiparticle Chirality |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Alternating Methods for Large-Scale AC Optimal Power Flow with Unit Commitment |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
Above-room-temperature ferromagnetism in large-area epitaxial Fe3GaTe2/graphene van der Waals heterostructures |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-09 |
An Empirical Study of Fuzz Harness Degradation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation |
[pdf] |
yc4ny/SVAD |
⭐️⭐️⭐️ |
2025-05-08 |
A Survey |
[pdf] |
hzxie/awesome-3d-scene-generation |
⭐️⭐️⭐️ |
2025-05-08 |
Predicting Structure and Motion via Ray Origin and Endpoint Diffusion |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
An Omni Foundation Model for Interleaved Multi-Modal Generation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Evaluating Legally Consistent Bias in Machine Learning |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Training Flow Matching Models via Online RL |
[pdf] |
yifan123/flow_grpo |
⭐️⭐️⭐️ |
2025-05-08 |
Generating Physically Stable and Buildable LEGO Designs from Text |
[pdf] |
AvaLovelace1/LegoGPT |
⭐️⭐️⭐️ |
2025-05-08 |
Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Comparison of integral equations used to study $T_{cc}^+$ |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Preference Alignment via Comparison Oracles |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Understanding Perception and Reasoning through Model Merging |
[pdf] |
shiqichen17/vlm_merging |
⭐️⭐️⭐️ |
2025-05-08 |
Primordial black-hole formation and heavy r-process element synthesis from the cosmological QCD transition. Two aspects of an inhomogeneous early Universe |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Marsden–Meyer–Weinstein reduction for $k$-contact field theories |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Representation Stability for Marked Graph Complexes |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Reduced Basis Method for Driven-Dissipative Quantum Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A Dataset of Misleading Narratives Surrounding Recent UK General Elections |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Emergence of Spin-Polarized Unconventional Skin Effect in Hatano-Nelson Model |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
A Study on Improvement of Image Quality in Quantum Polarized Microscopy using an Entangled-Photon Source |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
towards Spatial Intelligence Thorough Evaluation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Calculation of ground state energy of Lithium and Beryllium based on variational method |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Resolution of the Solar Convective Conundrum? New Results Using the Time-Distance Deep-Focus Method |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Conversational Process Model Redesign |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
The Brownian marble |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Novel Forms of Early Dark Energy |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Quantum effects in rotating thermal states on anti-de Sitter space-time |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
todd |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
implications for the observed abundance of ultra-violet luminous galaxies at z>10 |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Stabilization of Kac polynomials |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Scalable Bernoulli factories for Bayesian inference with intractable likelihoods |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Cell size heterogeneity controls crystallization of the developing fruit fly wing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
The effective energy of a lattice metamaterial |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Artifact Sharing for Information Retrieval Research |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Non-Markovianity in collision models with initial intra-environment correlations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Statistical Characterization of Entanglement Degradation Under Markovian Noise in Composite Quantum Systems |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Two-dimensional water waves with constant vorticity and general bottom topography |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Theoretical modeling of approximate universality of tidally deformed neutron stars |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Empowering Scientific Workflows with Federated Agents |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Efficient Data Filtering and Verification for High-Quality LLM Training Data |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
On differentiation of integrals in Lebesgue spaces |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
an LLM-based Literary Translation evaluation metric with Professional Question Answering |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Regularization by noise for the energy- and mass-critical nonlinear Schrödinger equations |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |
2025-05-08 |
Robustly optimal dynamics for active matter reservoir computing |
[pdf] |
⚠️ |
⭐️⭐️⭐️ |