Smooth Imitation Learning via Smooth Costs and Smooth Policies

reinforcement_learning