Policy Optimization as Predictable Online Learning Problems: Imitation Learning and Beyond
Efficient policy optimization is fundamental to solving real-world reinforcement learning problems, where agent-environment interactions can be costly. In this talk, I will discuss my recent research toward improving policy optimization efficiency from the perspective of…