Learning time-dependent Feedback Policies with Model-Based Policy Search R. Lioutikov Tue, 01.01.2013 PDF Cite