Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion | IEEE Journals & Magazine | IEEE Xplore