Weekly Papers
机器人论文速览
按领域分类展示近期机器人、具身智能、运动控制相关论文
数据来源:arXiv / robotics_arXiv_daily · 每日更新
更新于 2026/06/08 15:07
具身大模型
共 12 篇 · 最新 2026-06-05
-
[en] RhinoVLA Technical Report
RhinoVLA 技术报告
-
[en] LARA: Latent Action Representation Alignment for Vision-Language-Action Models
LARA:视觉-语言-动作模型的潜在动作表示对齐
-
[MemDreamer] MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism
MemDreamer:基于层次图记忆和智能体检索机制解耦感知与推理的长视频理解
-
Self-evolving LLM agents with in-distribution Optimization
自进化LLM智能体
-
[Socratic-SWE] Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills
Socratic-SWE:自进化编码智能体
-
[HARP-VLA] HARP-VLA: Human-Robot Aligned Representation Learning for Vision-Language-Action Model
HARP-VLA:面向视觉-语言-动作模型的人机对齐表征学习
-
[DeMaVLA] DeMaVLA: A Vision-Language-Action Foundation Model for Generalizable Deformable Manipulation
DeMaVLA:面向可泛化可变形操作的视觉-语言-动作基础模型
-
[Before Parc Fermé] Before Parc Fermé: RL-Time Pruning for Efficient Embodied LLMs in Autonomous Driving
进站前:面向自动驾驶高效具身大模型的强化学习时剪枝
-
Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization
从细粒度视觉差异中学习:通过上下文视觉对比优化缓解多模态幻觉
-
Multi-Turn Multi-Agent Dialogue for Collaborative Reconstruction Improves VLM Performance on Spatial Reasoning, But Only Barely
协作重建的多轮多智能体对话提升VLM空间推理性能,但效果有限
-
Personalize Your Large Vision-language Models With In-context Prompt Tuning
基于上下文提示微调个性化大型视觉语言模型
-
[Astra] Astra: a generalizable report generation foundation model for 3D computed tomography
Astra:用于3D计算机断层扫描的可泛化报告生成基础模型
强化学习
共 12 篇 · 最新 2026-06-05
-
[en] Affordance-Based Hierarchical Reinforcement Learning for Quadruped Pedipulation
基于可供性的四足机器人足部操作层次强化学习
-
[en] Robotic Policy Adaptation via Weight-Space Meta-Learning
基于权重空间元学习的机器人策略自适应
-
[en] Spline Policy: A Structured Representation for Robot Policies
样条策略:一种机器人策略的结构化表示
-
[en] Learning All-Terrain Locomotion for a Planetary Rover with Actively Articulated Suspension
行星探测车主动铰接悬架的全地形运动学习
-
[Learning] RDGen: Demonstration Generation for High-Quality Robot Learning via Reinforcement Learning
RDGen:基于强化学习的高质量机器人学习演示生成
-
[Don't Fool Me Twice] Don't Fool Me Twice: Adapting to Adversity in the Wild with Experience-Driven Reasoning
别再骗我两次:基于经验驱动推理的野外环境适应性
-
[DRIFT] DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization
DRIFT:用于高效多轮优化的解耦展开与重要性加权微调
-
Skill Reuse as Compression in Agentic RL
智能体强化学习中作为压缩的技能复用
-
Value Functions as Supermartingale Certificates
价值函数作为超鞅证书
-
Answer-Set-Programming-based Abstractions for Reinforcement Learning
基于回答集编程的强化学习抽象
-
[LongTraceRL] LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards
LongTraceRL:从搜索智能体轨迹与评分奖励中学习长上下文推理
-
[Learning to Adapt] Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration
学习适应:通过认知感知探索实现自我改进的Web智能体
运动规划
共 8 篇 · 最新 2026-06-05
-
[en] Predictive Style Matching: Natural and Robust Humanoid Locomotion
预测风格匹配:自然且鲁棒的人形机器人运动
-
[en] Rapid co-design of Buoyancy-assisted robots for Challenging Locomotion using Gaussian Evolutionary Specialists
基于高斯进化专家的浮力辅助机器人快速协同设计
-
[en] Three-dimensional hydro-cluttered locomotion by an undulatory robot
波动机器人的三维水下杂乱环境运动
-
Learning Terrain-Aware Whole-Body Control for Perceptive Legged Loco-Manipulation
学习地形感知的全身控制以用于感知式足式移动操作
-
Actuator-Aware Inverse Kinematics with Joint-Limit Admissibility for Torque-Controlled Redundant Robots
面向扭矩控制冗余机器人的关节极限容许执行器感知逆运动学
-
[IDOL] IDOL: Inverse-Dynamics-Guided Future Prediction for End-to-End Autonomous Driving
IDOL:面向端到端自动驾驶的逆动力学引导未来预测
-
Trajectory Planning for Non-Communicating Mobile Robots using Inverse Optimal Control
基于逆最优控制的非通信移动机器人轨迹规划
-
[en] Adaptive Artificial Time-Delay Control with Barrier Lyapunov Constraints for Euler-Lagrange Robots
欧拉-拉格朗日机器人的带障碍李雅普诺夫约束自适应人工时滞控制
仿真平台
共 6 篇 · 最新 2026-06-05
-
[en] QuadVerse: An Integrated Framework Aligning Visual-Physical Reality for Quadruped Simulation
QuadVerse:对齐四足机器人仿真视觉与物理现实的集成框架
-
Amortized Neural Optimization for Pre-Layout Signal Integrity Design Space Exploration using Differentiable Surrogates
基于可微代理的预布局信号完整性设计空间探索的摊销神经优化
-
Network Recovery from Cascade Data: A Debiased Jacobian-Based Machine Learning Approach
从级联数据恢复网络:基于去偏雅可比的机器学习方法
-
[CoMetaPNS] CoMetaPNS: Continually Meta-learning Personalized Neural Surrogates for Cardiac Electrophysiology Simulations
CoMetaPNS:用于心脏电生理仿真的持续元学习个性化神经代理模型
-
[Learning] Batched Differentiable Rigid Body Dynamics in PyTorch for GPU-Accelerated Robot Learning
PyTorch中面向GPU加速机器人学习的批量可微刚体动力学
-
Simulation of collision avoidance behavior in crowd movement by data-driven approach
基于数据驱动的人群运动避碰行为仿真
具身操作
共 5 篇 · 最新 2026-06-05
-
[en] ActionMap: Robot Policy Learning via Voxel Action Heatmap
ActionMap:基于体素动作热图的机器人策略学习
-
[en] Simulation-Driven Imitation Learning for Biosignals-Free Shared-Autonomy Prosthetic Grasping
仿真驱动的无生物信号共享自主假肢抓取模仿学习
-
[Hyper-Dexterous] Surface Constraint Policy for Learning Surface-Constrained and Dynamically Feasible Robot Skills
超灵巧:面向表面约束与动态可行机器人技能学习的表面约束策略
-
[Haptic Sorter] Haptic Sorter: A Unified Planning Framework for Online Shape Estimation and Real-Time Pose Inference
触觉分拣器:面向在线形状估计与实时位姿推断的统一规划框架
-
Learning Controlled Separation of Small Objects Between Two Fingers with a Tactile Skin
基于触觉皮肤学习双指间小物体的受控分离
机器人安全
共 3 篇 · 最新 2026-06-05
-
[en] Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering
Shield-Loco:基于预测安全过滤的运动策略防护
-
[en] Re-imagining ISO 26262 in the Age of Autonomous Vehicles: Enhancing Controllability through Transferability and Predictability
重新审视自动驾驶时代的ISO 26262
-
[Collision] Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration
探究视觉-语言模型在安全人机协作中的碰撞基础
任务规划
共 5 篇 · 最新 2026-06-05
-
[en] CAPE: Contrastive Action-conditioned Parallel Encoding for Embodied Planning
CAPE:用于具身规划的对比动作条件并行编码
-
[en] Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints
复杂逻辑约束下长时序任务规划的神经符号学习
-
On-Device Robotic Planning: Eliminating Inference Redundancy for Efficient Decision-Making
端侧机器人规划:消除推理冗余以实现高效决策
-
[HypoAgent] HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs
HypoAgent:基于知识图谱的交互式溯因假设生成智能体框架
-
[AutoSci] AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle
AutoSci:面向完整科学研究生命周期的以内存为中心的智能体系统
具身交互
共 5 篇 · 最新 2026-06-05
-
[en] A Multi-Operator Mixed-Reality Interface for Multi-Robot Control and Coordination: Co-Located and Private Workspace Collaboration
多操作员混合现实接口用于多机器人控制与协调
-
[en] What Is My Robot Thinking? Design Considerations for Transparent and Trustworthy Shared Autonomy
我的机器人在想什么?透明可信共享自主性设计考量
-
A study on a Real-Time VR-Based Teleoperation Framework for Manipulator in Dynamic Environment
动态环境下基于VR的机械臂实时遥操作框架研究
-
[en] Shaft-integrated Force Sensing with Transformer-based Dynamics Compensation for Telesurgery
面向远程手术的基于Transformer动力学补偿的轴集成力传感
-
Toward Accessible Mobile Money: A Voice-Driven, Biometrically Secured USSD Automation Framework for Visually Impaired Users
面向无障碍移动支付:面向视障用户的语音驱动生物识别安全USSD自动化框架
环境感知
共 7 篇 · 最新 2026-06-05
-
[en] Does Appearance Help? A Systematic Study of Image-Based Re-Identification in Online 3D Multi-Pedestrian Tracking
外观助益吗?基于图像的在线3D多行人跟踪重识别系统研究
-
Impact of Synthetic Lesional MR Images in Automated Focal Cortical Dysplasia Detection in Low-Data Scenarios
合成病变MR图像影响
-
Video-Based Prediction of In-Flight Particle Characteristics in Atmospheric Plasma Spraying
基于视频的大气等离子喷涂飞行粒子特性预测
-
Mitosis Detection in the Wild: Multi-Tumor and Context-Aware Generalization in the MIDOG 2025 Challenge
野外有丝分裂检测
-
Triangle Splatting SLAM
三角形泼溅SLAM
-
[VolFill] VolFill: Single-View Amodal 3D Scene Reconstruction with Volumetric Flow Matching
VolFill:基于体积流匹配的单视图非模态3D场景重建
-
Learning Global Motion with Compact Gaussians for Feed-Forward 4D Reconstruction
基于紧凑高斯的全局运动学习用于前馈4D重建
基础理论
共 17 篇 · 最新 2026-06-05
-
Supervision versus Demonstration-Based In-Context Learning for Multiword Expression Classification
多词表达分类的上下文学习
-
Automatic, Debiased, and Invariant Counterfactual Generation under General Interventions
一般干预下的自动、去偏与不变反事实生成
-
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope
AI智能体如何重塑知识工作
-
Unsupervised Continual Clustering via Forward-Backward Knowledge Distillation
无监督连续聚类
-
Covariance Shrinkage via Stochastic Interpolation
通过随机插值的协方差收缩
-
Vision-Language Models Suppress Female Representations Under Ambiguous Input
视觉语言模型在模糊输入下抑制女性表征
-
Skill Availability and Presentation Granularity in Large-Language-Model Agents: A Controlled SkillsBench Study
大语言模型智能体中的技能可用性与呈现粒度:受控SkillsBench研究
-
Positional versus Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization
位置与符号注意力头:学习动态、RoPE几何与长度泛化
-
[dashi] dashi: A Python library for Dataset Shift Characterization to Support Trustworthy AI Development and Deployment
dashi:支持可信AI开发与部署的数据集偏移表征Python库
-
How can embedding models bind concepts?
嵌入模型如何绑定概念?
-
[Assign and Add] Assign and Add: A Mechanistic Study of Compositional Arithmetic
Assign and Add:组合算术的机制研究
-
Language Models Learn Constructional Semantics, Not To Mention Syntax: Investigating LM Understanding of Paired-Focus Constructions
语言模型学习构式语义而非句法:探究LM对配对焦点构式的理解
-
Effective Biological Representation Learning by Masking Gene Expression
通过掩蔽基因表达进行有效的生物表征学习
-
Separating Secrets from Placeholders: A Hybrid CNN-CodeBERT Framework for Three-Class Credential Leakage Detection
分离秘密与占位符:用于三类凭证泄露检测的混合CNN-CodeBERT框架
-
[DOA] DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Translation with SpeechLLMs
DOA:基于SpeechLLM长篇同声传译的免训练仅解码器注意力策略
-
Can Generative AI help people navigate Radical Moral Disagreements? The CONSIDER prototype
生成式AI能帮助人们应对极端道德分歧吗?CONSIDER原型
-
Self-Tuning Regularization for Image Scanning Microscopy
图像扫描显微镜的自调整正则化
世界模型
共 1 篇 · 最新 2026-05-29
-
[AR Forcing] AR Forcing: Towards Long-Horizon Robot Navigation World Model
AR Forcing:迈向长视野机器人导航世界模型
具身导航
共 3 篇 · 最新 2026-05-29
-
[TARIC] TARIC: Memory-Augmented Traversability-Aware Outdoor VLN under Interrupted Semantic Cues
TARIC:中断语义线索下记忆增强的可穿越性感知户外视觉语言导航
-
[LiftNav] LiftNav: Path Planning via Semantic Lifting in TSDF-Guided Gaussian Splatting
LiftNav:基于TSDF引导高斯泼溅语义提升的路径规划
-
The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning
剑、盾与阿喀琉斯之踵:刻画大语言模型在导航规划中空间推理的语言归纳偏置
数据集评测
共 6 篇 · 最新 2026-05-29
-
[en] nuReasoning: A Reasoning-Centric Dataset and Benchmark for Long-Tail Autonomous Driving
nuReasoning:面向长尾自动驾驶的以推理为中心的数据集与基准
-
[SOCO] SOCO: Benchmarking Semantic Object Correspondence in Vision Foundation Models
SOCO:视觉基础模型中的语义对象对应基准测试
-
[FAM-Bench] FAM-Bench: A Multimodal Benchmark for Condition-Aware Food-as-Medicine Reasoning
FAM-Bench:面向条件感知食疗推理的多模态基准
-
[FBHM] FBHM: Functional Benchmarking and Steering of VLMs for Hateful Meme Detection
FBHM:面向仇恨模因检测的VLM功能基准测试与引导
-
Scaling Conversational Hungarian ASR: The BEA-Dialogue+ Corpus
扩展对话式匈牙利语ASR:BEA-Dialogue+语料库
-
[Chem-PerturBridge] Chem-PerturBridge: a harmonized compendium of small molecule perturbation transcriptomic effects
Chem-PerturBridge:小分子扰动转录组效应的协调概要
群体智能
共 1 篇 · 最新 2026-05-29
-
[Dreaming Of Others] Dreaming Of Others: Latent Teammate Modeling In World Models For Multi-Agent Reinforcement Learning
梦见他人:多智能体强化学习中基于世界模型的潜在队友建模