Versatile Loco-Manipulation through Flexible Interlimb Coordination
Xinghao Zhu*1, Yuxin Chen*12, Lingfeng Sun*1, Farzad Niroui1, Simon Le Cleac'h1, Jiuguang Wang1, Kuan Fang13
1RAI Institute   2University of California, Berkeley   3Cornell University
*These authors contributed equally to this work
Abstract

The ability to flexibly leverage limbs for loco-manipulation is essential for enabling autonomous robots to operate in unstructured environments. Yet, prior work on loco-manipulation is often constrained to specific tasks or predetermined limb configurations. In this work, we present ReLIC, an approach that enables versatile loco-manipulation through flexible interlimb coordination. The key to our approach is an adaptive controller that seamlessly bridges the execution of manipulation motions and the generation of stable gaits based on task demands. Through the interplay between two controller modules, ReLIC dynamically assigns each limb for manipulation or locomotion and robustly coordinates them to achieve the task success. Using efficient reinforcement learning in simulation, ReLIC learns to perform stable gaits in accordance with the manipulation goals in the real world. To solve diverse and complex tasks, we further propose to interface the learned controller with different types of task specifications, including target trajectories, contact points, and natural language instructions. Evaluated on 12 real-world tasks that require diverse and complex coordination patterns, ReLIC demonstrates its versatility and robustness by achieving a success rate of 78.9% on average.

Methods
Methods overview diagram

Reinforcement Learning for Interlimb Coordination (ReLIC): Based on various types of task specifications, ReLIC enables the robot to perform versatile loco-manipulation. Conditioned on the dynamic assignment of limb functions, the ReLIC controller generates the actions through the interplay between a model-based module that prioritizes task success, and an RL policy that robustly maintain stable gaits in accordance with the manipulation behaviors. This design decouples the two challenging subproblems without relying on rigid heuristics or sacrificing inter-module coordination.

Ablations
Failures