The quantum cartpole: A benchmark environment for non-linear reinforcement learning
Kai Meinerz, Simon Trebst, Mark Rudner, Evert van Nieuwenburg
SciPost Phys. Core 7, 026 (2024) · published 7 May 2024
- doi: 10.21468/SciPostPhysCore.7.2.026
- Submissions/Reports
Abstract
Feedback-based control is the de-facto standard when it comes to controlling classical stochastic systems and processes. However, standard feedback-based control methods are challenged by quantum systems due to measurement induced backaction and partial observability. Here we remedy this by using weak quantum measurements and model-free reinforcement learning agents to perform quantum control. By comparing control algorithms with and without state estimators to stabilize a quantum particle in an unstable state near a local potential energy maximum, we show how a trade-off between state estimation and controllability arises. For the scenario where the classical analogue is highly nonlinear, the reinforcement learned controller has an advantage over the standard controller. Additionally, we demonstrate the feasibility of using transfer learning to develop a quantum control agent trained via reinforcement learning on a classical surrogate of the quantum control problem. Finally, we present results showing how the reinforcement learning control strategy differs from the classical controller in the non-linear scenarios.
Authors / Affiliations: mappings to Contributors and Organizations
See all Organizations.- 1 Kai Meinerz,
- 1 Simon Trebst,
- 2 Mark Rudner,
- 3 Evert van Nieuwenburg
- 1 Universität zu Köln / University of Cologne [UoC]
- 2 University of Washington [UW]
- 3 Instituut Lorentz / Lorentz Institute