HybridLeg robots Olaf and Snogie use impact-safe design and self-recovery to enable scalable, real-world hardware ...
Abstract: Efficient multi-agent path finding (MAPF) is essential for large-scale warehousing and logistics systems. Despite the potential of reinforcement learning (RL) methods, current approaches ...
Abstract: Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in ...
An overview of our research on agentic RL. In this work, we systematically investigate three dimensions of agentic RL: data, algorithms, and reasoning modes. Our findings reveal: Real end-to-end ...