Loco-Manipulation for Quadruped Robot: Deploying RL Policies on Unitree Go2

by Zhizhen Zhang, Steph Li

This project develops and deploys a reinforcement-learning control stack for unified locomotion and manipulation (“loco-manipulation”) on a Unitree Go2 quadruped with arm. Traditional pipelines split gait and arm control; this work targets full-body, learned control for agile mobility and interaction.
PPO policies are adapted with Unitree SDK2 and executed on Jetson Orin. A VR/WebRTC layer supplies high-level operator intent alongside autonomous behaviours. The controller uses a hybrid architecture that combines a learned policy with safety guards, synchronises arm and leg motion, and fuses proprioceptive observations with vision. Performance is benchmarked across terrains and manipulation tasks, and Sim2Real strategies such as latency modelling and domain randomisation narrow the gap between simulation and hardware. The system demonstrates real-time VR parallel control with end to end deployment from policy execution to hardware actuation.

🖼️View Poster

Themes

Most Innovative Use of XR

Technologies and Skills

Reinforcement Learning
Unitree Go2
Full-body loco-manipulation control
Jetson Orin deployment
VR teleoperation
Real-time control & synchronisation
Sim2Real
Robotics kinematics & dynamics
Python
Benchmarking & evaluation

School

School of Electrical Engineering and Computer Science

Supervisors

Dr Yadan Luo, Prof Helen Huang

Project Source

Winter Research Program 2025 - UQ Employability