Fundamental Theory
English 中文
This directory collects papers and code implementations related to fundamental theory in embodied AI.
Main Contents
- Cognitive Foundations of Embodied Intelligence
- Computational Models of Embodied Intelligence
- Learning Theory in Embodied Intelligence
- Evaluation Methods for Embodied Intelligence
Manually Added Papers
Auto-Updated Papers
Date | Title | Paper | Code | Rating |
---|---|---|---|---|
2025-06-26 | [Agent-RewardBench] Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-26 | Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-25 | The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-25 | Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-24 | Robotic Perception with a Large Tactile-Vision-Language Model for Physical Property Inference | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-24 | [en] Is an object-centric representation beneficial for robotic manipulation ? | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-24 | [UniTac-NV] UniTac-NV: A Unified Tactile Representation For Non-Vision-Based Tactile Sensors | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-24 | Robust Embodied Self-Identification of Morphology in Damaged Multi-Legged Robots | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-24 | [en] Evolutionary Gait Reconfiguration in Damaged Legged Robots | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | Optimization-Induced Dynamics of Lipschitz Continuity in Neural Networks | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | [TAMMs] TAMMs: Temporal-Aware Multimodal Model for Satellite Image Change Understanding and Forecasting | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | [Matrix-Game] Matrix-Game: Interactive World Foundation Model | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | [en] Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | [Drive-R1] Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-23 | Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-21 | [DRAMA-X] DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-20 | With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-20 | [en] Kinematic Model Optimization via Differentiable Contact Manifold for In-Space Manipulation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-18 | [EmojiVoice] EmojiVoice: Towards long-term controllable expressivity in robot speech | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-18 | [MEM1] MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-18 | [KG-FGNN] KG-FGNN: Knowledge-guided GNN Foundation Model for Fertilisation-oriented Soil GHG Flux Prediction | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-17 | [EVA02-AT] EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-17 | From Points to Places: Towards Human Mobility-Driven Spatiotemporal Foundation Models via Understanding Places | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-16 | Knowledge Graph Fusion with Large Language Models for Accurate, Explainable Manufacturing Process Planning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-16 | Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-16 | Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-16 | [en] IKDiffuser: Fast and Diverse Inverse Kinematics Solution Generation for Multi-arm Robotic Systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-15 | [en] Building Trustworthy AI by Addressing its 16+2 Desiderata with Goal-Directed Commonsense Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-14 | [AgentOrchestra] AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-14 | A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-13 | Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | [LogiPlan] LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | [SlotPi] SlotPi: Physics-informed Object-centric Reasoning Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | [RICE] RICE: Reactive Interaction Controller for Cluttered Canopy Environment | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | [en] An $O(n$)-Algorithm for the Higher-Order Kinematics and Inverse Dynamics of Serial Manipulators using Spatial Representation of Twists | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-12 | [en] TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | [OctoNav] OctoNav: Towards Generalist Embodied Navigation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | [CausalVQA] CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | Know What You Don’t Know: Uncertainty Calibration of Process Reward Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-11 | [HopaDIFF] HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-10 | Hybrid Reasoning for Perception, Explanation, and Autonomous Action in Manufacturing | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-10 | ROS-related Robotic Systems Development with V-model-based Application of MeROS Metamodel | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-09 | [en] SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-09 | Reproducibility in the Control of Autonomous Mobility-on-Demand Systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | [Mind the Web] Mind the Web: The Security of Web Use Agents | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | [en] Less is More: some Computational Principles based on Parcimony, and Limitations of Natural Intelligence | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | [en] LLM-Enhanced Rapid-Reflex Async-Reflect Embodied Agent for Real-Time Decision-Making in Dynamically Changing Environments | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-08 | [Theorem-of-Thought] Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-06 | [MOGO] MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-06 | Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-06 | [en] Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-05 | Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-05 | [CzechLynx] CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-05 | Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-05 | [MORSE-500] MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-05 | [en] Towards provable probabilistic safety for scalable embodied AI systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-04 | [SemNav] SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-04 | [AssetOpsBench] AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-04 | Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-04 | [“Don’t Do That!”] “Don’t Do That!”: Guiding Embodied Systems through Large Language Model-based Constraint Generation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-04 | A Framework Leveraging Large Language Models for Autonomous UAV Control in Flying Networks | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-03 | Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-03 | [en] Geometric Visual Servo Via Optimal Transport | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-02 | [iQUEST] iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-06-02 | [Fire360] Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360-Degree Firefighting Videos | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-30 | [GridRoute] GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-30 | Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-30 | [en] P: A Universal Measure of Predictive Intelligence | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-30 | Bi-Manual Joint Camera Calibration and Scene Representation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | Conceptual Framework Toward Embodied Collective Adaptive Intelligence | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | [GAM-Agent] GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | [MAPLE] MAPLE: A Mobile Assistant with Persistent Finite State Machines for Recovery Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | Eye-tracking-Driven Shared Control for Robotic Arms:Wizard of Oz Studies to Assess Design Choices | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | [Data-to-Dashboard] Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | Representing local protein environments with atomistic foundation models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-29 | [MAPLE] MAPLE: A Mobile Agent with Persistent Finite State Machines for Structured Task Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | [3DLLM-Mem] 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | [EPiC] EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | [WorkForceAgent-R1] WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | New Tools are Needed for Tracking Adherence to AI Model Behavioral Use Clauses | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | [en] Spring-Brake! Handed Shearing Auxetics Improve Efficiency of Hopping and Standing | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-28 | [ASyMOB] ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-27 | Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-27 | [RRO] RRO: LLM Agent Optimization Through Rising Reward Trajectories | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-27 | [CoDA] CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-27 | Prostate Cancer Screening with Artificial Intelligence-Enhanced Micro-Ultrasound: A Comparative Study with Traditional Methods | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-27 | Don’t Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-26 | [ReasonPlan] ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-26 | Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-26 | [RFTF] RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-26 | [SaSi] SaSi: A Self-augmented and Self-interpreted Deep Learning Approach for Few-shot Cryo-ET Particle Detection | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-26 | [DFIR-Metric] DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-25 | [LIMOPro] LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-23 | [CXReasonBench] CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-23 | Controlled Agentic Planning & Reasoning for Mechanism Synthesis | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-23 | [en] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-23 | [en] Knot So Simple: A Minimalistic Environment for Spatial Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | [Date Fragments] Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | [SpatialScore] SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | [SPaRC] SPaRC: A Spatial Pathfinding Reasoning Challenge | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | [en] SEM: Enhancing Spatial Understanding for Robust Robot Manipulation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-22 | [Beyond Correlation] Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-21 | [VERDI] VERDI: VLM-Embedded Reasoning for Autonomous Driving | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-20 | [KORGym] KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-20 | [en] Memory-Centric Embodied Question Answer | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-20 | Visual Agentic Reinforcement Fine-Tuning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-19 | [PLAICraft] PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-19 | [MM-PRM] MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-19 | Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-19 | [PEER pressure] PEER pressure: Model-to-Model Regularization for Single Source Domain Generalization | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-19 | [ARIW-Framework] ARIW-Framework: Adaptive Robust Iterative Watermarking Framework | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-18 | [en] BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-16 | [PoE-World] PoE-World: Compositional World Modeling with Products of Programmatic Experts | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-16 | A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron? | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-16 | [en] PARSEC: Preference Adaptation for Robotic Object Rearrangement from Scene Context | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-16 | [en] Multi-Modal Multi-Task (M3T) Federated Foundation Models for Embodied AI: Potentials and Challenges for Edge Integration | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-16 | [en] Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-14 | [EWMBench] EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-13 | [ARC-NCA] ARC-NCA: Towards Developmental Solutions to the Abstraction and Reasoning Corpus | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | [LAMM-ViT] LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | Learning from Peers in Reasoning Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | Learning Dynamics in Continual Pre-Training for Large Language Models | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | Joint Graph Convolution and Sequential Modeling for Scalable Network Traffic Estimation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | Towards user-centered interactive medical image segmentation in VR with an assistive AI agent | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-12 | [en] Cooperative Assembly with Autonomous Mobile Manipulators in an Underwater Scenario | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-10 | [en] STRIVE: Structured Representation Integrating VLM Reasoning for Efficient Object Navigation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Towards Robust Few-Shot Text Classification Using Transformer Architectures and Dual Loss Strategies | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Adapting a Segmentation Foundation Model for Medical Image Classification | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Ohana trees and Taylor expansion for the $λ$I-calculus. No variable gets left behind or forgotten! | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | [en] Neuro-Symbolic Concepts | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Representation of tensor functions using low-order structural tensor set: two-dimensional point group | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Revisiting the connection of baryon number, lepton number, and operator dimension | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Rydberg atomic spectrum analyzer with microwave-dressed-state-locking and multimode Floquet theory | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Preferential Attachment Trees with Vertex Death: Persistence of the Maximum Degree | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-09 | Bi-LSTM based Multi-Agent DRL with Computation-aware Pruning for Agent Twins Migration in Vehicular Embodied AI Networks | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | [pdf] | yc4ny/SVAD | ⭐️⭐️⭐️ |
2025-05-08 | A Survey | [pdf] | hzxie/awesome-3d-scene-generation | ⭐️⭐️⭐️ |
2025-05-08 | Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | An Omni Foundation Model for Interleaved Multi-Modal Generation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Evaluating Legally Consistent Bias in Machine Learning | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Training Flow Matching Models via Online RL | [pdf] | yifan123/flow_grpo | ⭐️⭐️⭐️ |
2025-05-08 | Generating Physically Stable and Buildable LEGO Designs from Text | [pdf] | AvaLovelace1/LegoGPT | ⭐️⭐️⭐️ |
2025-05-08 | A Solovay-Kitaev theorem for quantum signal processing | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Comparison of integral equations used to study $T_{cc}^+$ | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Preference Alignment via Comparison Oracles | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Understanding Perception and Reasoning through Model Merging | [pdf] | shiqichen17/vlm_merging | ⭐️⭐️⭐️ |
2025-05-08 | Primordial black-hole formation and heavy r-process element synthesis from the cosmological QCD transition. Two aspects of an inhomogeneous early Universe | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Marsden–Meyer–Weinstein reduction for $k$-contact field theories | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Representation Stability for Marked Graph Complexes | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Reduced Basis Method for Driven-Dissipative Quantum Systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | A Dataset of Misleading Narratives Surrounding Recent UK General Elections | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Emergence of Spin-Polarized Unconventional Skin Effect in Hatano-Nelson Model | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | A Study on Improvement of Image Quality in Quantum Polarized Microscopy using an Entangled-Photon Source | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | towards Spatial Intelligence Thorough Evaluation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Calculation of ground state energy of Lithium and Beryllium based on variational method | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Resolution of the Solar Convective Conundrum? New Results Using the Time-Distance Deep-Focus Method | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Conversational Process Model Redesign | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Novel Forms of Early Dark Energy | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Manifest Gauge Invariance for Structure Dependent Radiative Corrections to Processes Involving Atoms and Nuclei | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Quantum effects in rotating thermal states on anti-de Sitter space-time | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | todd | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | implications for the observed abundance of ultra-violet luminous galaxies at z>10 | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Stabilization of Kac polynomials | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Scalable Bernoulli factories for Bayesian inference with intractable likelihoods | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Cell size heterogeneity controls crystallization of the developing fruit fly wing | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | The effective energy of a lattice metamaterial | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Asymmetric decay of quantum many-body scars in XYZ quantum spin chains | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Artifact Sharing for Information Retrieval Research | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Non-Markovianity in collision models with initial intra-environment correlations | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Statistical Characterization of Entanglement Degradation Under Markovian Noise in Composite Quantum Systems | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Two-dimensional water waves with constant vorticity and general bottom topography | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Theoretical modeling of approximate universality of tidally deformed neutron stars | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Empowering Scientific Workflows with Federated Agents | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Efficient Data Filtering and Verification for High-Quality LLM Training Data | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | Sideways on the highways | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | On differentiation of integrals in Lebesgue spaces | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
2025-05-08 | an LLM-based Literary Translation evaluation metric with Professional Question Answering | [pdf] | ⚠️ | ⭐️⭐️⭐️ |
📊 Statistics
- Total Papers: 195
- Code Implementations: 5
- Last Updated: June 2025
Last updated: 2025-06-28