This project implements a hybrid control strategy combining Reinforcement Learning (RL) with Model Predictive Control (MPC) for robust and adaptive bipedal walking on uneven terrain. The RL component learns a policy for nominal walking, while MPC provides real-time corrections for disturbances.
The hybrid architecture consists of two main components:
The hybrid controller showed significant improvement over standalone RL or MPC approaches, demonstrating: