Rapid Locomotion via Reinforcement Learning
Gabriel Margolis* ¹, Ge Yang* ¹ ², Kartik Paigwar ¹, Tao Chen ¹, Pulkit Agrawal ¹ ²
¹ MIT Improbable AI Lab ² NSF AI Institute for Artificial Intelligence and Fundamental Interactions
Abstract: Agile maneuvers such as sprinting and high-speed turning in the wild are challenging for legged robots. We present an end-to-end learned controller that achieves record agility for the MIT Mini Cheetah, sustaining speeds up to 3.9 m/s. This system runs and turns fast on natural terrains like grass, ice, and gravel and responds robustly to disturbances. Our controller is a neural network trained in simulation via reinforcement learning and transferred to the real world. The two key components are (i) an adaptive curriculum on velocity commands and (ii) an online system identification strategy for sim-to-real transfer leveraged from prior work.
Indoor Sprint (3.9 m/s)
Outdoor 10-Meter Dash (3.4 m/s)
Indoor Spin Test (5.7 rad/s)
Spinning on Icy Patch
Climbing a Gravel Hill
Recovery from Tripping
High-Speed Controlled Failure
Operating with Stuck Joint
Media Coverage
Wired | Popular Science | Gizmodo | The Verge | MIT News
Bibtex
Our Related Projects
Gabriel Margolis, Tao Chen, Kartik Paigwar, Xiang Fu, Donghyun Kim, Sangbae Kim, Pulkit Agrawal
External Related Projects
Acknowledgment
The authors thank the members of the Improbable AI Lab and the Biomimetic Robotics Laboratory for providing valuable feedback on the project direction and the manuscript. We are grateful to MIT Supercloud and the Lincoln Laboratory Supercomputing Center for providing HPC resources. The Mini Cheetah robot used in this work was donated by the MIT Biomimetic Robotics Laboratory and NAVER. The Biomimetic Robotics Laboratory also provided hardware support for the robot. This research was supported by the DARPA Machine Common Sense Program and in part by the MIT-IBM Watson AI Lab, and the National Science Foundation under Cooperative Agreement PHY-2019786 (The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, http://iaifi.org/). This research was also sponsored by the United States Air Force Research Laboratory and the United States Air Force Artificial Intelligence Accelerator and was accomplished under Cooperative Agreement Number FA8750-19-2-1000.