Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title User-Conditioned Neural Control Policies for Mobile Robotics
Organization Unit
Authors
  • Leonard Bauersfeld
  • Elia Kaufmann
  • Davide Scaramuzza
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
ISBN 979-8-3503-2365-8
ISSN 1050-4729
Page Range 1342 - 1348
Event Title 2023 IEEE International Conference on Robotics and Automation, ICRA 2023
Event Type conference
Event Location London, United Kingdom of Great Britain and Northern Ireland
Event Start Date May 29 - 2023
Event End Date June 2 - 2023
Series Name IEEE International Conference on Robotics and Automation. Proceedings
Publisher Institute of Electrical and Electronics Engineers
Abstract Text Recently, learning-based controllers have been shown to push mobile robotic systems to their limits and provide the robustness needed for many real-world applications. However, only classical optimization-based control frameworks offer the inherent flexibility to be dynamically adjusted during execution by, for example, setting target speeds or actuator limits. We present a framework to overcome this shortcoming of neural controllers by conditioning them on an auxiliary input. This advance is enabled by including a feature-wise linear modulation layer (FiLM). We use model-free reinforcement-learning to train quadrotor control policies for the task of navigating through a sequence of waypoints in minimum time. By conditioning the policy on the maximum available thrust or the viewing direction relative to the next waypoint, a user can regulate the aggressiveness of the quadrotor's flight during deployment. We demonstrate in simulation and in real-world experiments that a single control policy can achieve close to time-optimal flight performance across the entire performance envelope of the robot, reaching up to 60 km/h and 4.5 g in acceleration. The ability to guide a learned controller during task execution has implications beyond agile quadrotor flight, as conditioning the control policy on human intent helps safely bringing learning based systems out of the well-defined laboratory environment into the wild.
Digital Object Identifier 10.1109/ICRA48891.2023.10160851
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)