Robot Learning from Demonstration and Interaction (CS309)

  • semester
    spring 2021
  • time
    tue 15:30-17:00
    thu 15:30-17:00
  • lecture room
    Zoom (via Canvas)
  • lab
    AHG
  • instructor
    Rudolf Lioutikov
  • e-mail

Course Description

Currently, most robots are programmed meticulously by robotics experts in a controlled laboratory setting—a model that is obviously not scalable to large-scale deployment. Robot learning from demonstration has emerged as an alternative paradigm for teaching robots new tasks by simply showing them what to do, rather than by writing code. Thus, learning from demonstration research tries to answer the question: “How can robots learn to interpret and generalize human demonstrations?” Solving this core research problem will enable the next generation of personal robots to revolutionize the home and workplace in coming years.

This research stream will place students at the cutting-edge of robot learning from demonstration research, working with robots to perform complex manipulation tasks, such as autonomously building IKEA furniture. Students will be given instruction in three core areas of robotics: manipulation, perception, and human-robot interaction. Additionally, students will learn and practice programming skills via hands-on mini-projects in each of these areas. After these key competencies have been acquired, students will devise and implement research projects in the area of learning from demonstration.

Organization

Class sessions will be held in PAR 208 on Tuesdays and Thursdays from 15:30 to 17:00 if not announced otherwise. Attendance is mandatory. Students are expected to email the instructor in advance to inform of any potential absences.

Prerequisites

None, except some intrinsic motivation and a healthy thirst for knowledge.

Programming Language

Course programming assignments will be in Python 3. We do not assume that students have previous experience with the language, but the students are expected to learn the basics very rapidly.

Coursework

There are six types of assignments that the students are expected to complete.

Homework Exercises

Throughout the semester, problem sets will be assigned and graded. The goal of these problems is to get the students comfortable with the material and prepared for the final project.

Lecture Notes

Students will be randomly selected to present a summary of the last two lectures at the next lecture.

Final Project

In the final project the students apply learned methods on a robot learning task performed on a collaborative robot platform. The students report their acquired knowledge in a short research paper as well as a short project presentation.

Reading Assignments

In addition, the students are expected to read through literature related to the course and the final project. In particular, there will be 4 nongraded reading assignments corresponding to different sections of learning from demonstration survey by Argall et al.

Grading

The overall grade will consist of five of the coursework assignments. Grades will be assigned using both plus and minus grades.

Lecture Schedule

The schedule is subject to change due to pace of class, see website for updates.

Assignment dates will be announced shortly!

Inverse Reinforcement Learning
Day Topic Readings Assignments
Tue 01/19 Motivation, Organization    
Thu 01/21 Workspace Setup    
Tue 01/26 Python Introduction    
Thu 01/28 Linear Algebra Refresher    
Tue 02/02 Regression: OLS, Ridge Regression    
Thu 02/04 Regression: Lasso, Elastic Nets    
Tue 02/09 Regression: Features, Polynomial Regression    
Tue 02/11 Robotics: basics, position, orientation    
Tue 02/16 Robotics: homogeneous transformation    
Thu 02/18 Robotics: Inverse Kinematics, Singularities, Dynamics    
Tue 02/23 Teaching Types: Kinesthetic, Tele-Operation, Observation
Designing (Collaborative) Tasks: goals, requirements, challenges
1. Introduction
2. Design choices
 
Thu 02/25 Trajectory Representation    
Tue 03/02 Classification: FLD, Perceptron    
Thu 03/04 Classification: SVM    
Tue 03/09 Clustering: KDE, KNN  
Thu 03/11 Clustering: Mean-Shift, K-means    
Tue 03/23 Evaluation: Model Selection, Cross Validation    
Thu 03/25 Reinforcement Learning: MDP, Features    
Tue 03/30 Pytorch Introduction    
Thu 04/01 Pytorch Continued    
Tue 04/06 Reinforcement Learning: Bellman Optimality, Exploration vs Exploitation    
Thu 04/08 Imitation Learning: Behavioral Cloning 4. Deriving a policy:
The source of the
state to action mapping
 
Tue 04/13 Imitation Learning: Inverse Reinforcement Learning    
Thu 04/15 Imitation Learning: Inverse Reinforcement Learning    
Tue 04/20 Project Meetings   Final Project
Thu 04/22 Project Meetings  
Tue 04/27 Project Meetings  
Thu 04/29 Project Meetings  
Tue 05/04 Project Meetings  
Thu 05/06 Project Meetings 5. Limitations of the
demonstration dataset
6. Future directions
7. Conclusion
Tue 05/14
(subject to change)
Final Project Presentations    

Office Hours

Several office hours are offered by various mentors. The office hours are held virtually via zoom. Always show up at the beginning of the office. The mentors are instructed to end the office hours if there are no students in the first 20 minutes.

Monday Tuesday Wednesday Thursday Friday
09:30
10:00 AditiAditi
10:30
11:00 SydneySteveSydney
11:30
12:00
12:30
13:00
13:30 IlhamYian
14:00 Zachary
14:30
15:00 Ilham
15:30 LectureLecture
16:00 Zachary
16:30
17:00 YianSteve
17:30
18:00

Academic Honesty Policy

You are encouraged to discuss the readings and concepts with classmates, but all written work must be your own. Programming assignments must be your own, except for 2-person teams when teams are authorized. You may NOT look online for existing implementations of algorithms related to the programming assignments, even as a reference. Your code will be analyzed by automatic tools that detect plagiarism to ensure that it is original.
Students caught cheating will automatically fail the course and will be reported to the university. If in doubt about the ethics of any particular action, look at the departmental guidelines and/or ask — ignorance of the rules will not shield you from potential consequences.

Notice About Students With Disabilities

The University of Texas at Austin provides upon request appropriate academic accommodations for qualified students with disabilities. For more information, contact the Division of Diversity and Community Engagement — Services for Students with Disabilities at 512-471-6529; 512-471-4641 TTY.

Notice About Missed Work Due To Religious Holy Days

A student who misses an examination, work assignment, or other project due to the observance of a religious holy day will be given an opportunity to complete the work missed within a reasonable time after the absence, provided that he or she has properly notified the instructor. It is the policy of the University of Texas at Austin that the student must notify the instructor at least fourteen days prior to the classes scheduled on dates he or she will be absent to observe a religious holy day. For religious holy days that fall within the first two weeks of the semester, the notice should be given on the first day of the semester. The student will not be penalized for these excused absences, but the instructor may appropriately respond if the student fails to complete satisfactorily the missed assignment or examination within a reasonable time after the excused absence.

Safety and In-Person Participation/Masks

The lecture will be completely online, but in case students volunteer to work with a real robot platform and the lab facilities allow for the student presence special precautions are required.

We will all need to make some adjustments in order to benefit from in-person classroom interactions in a safe and healthy manner. Our best protections against spreading COVID-19 on campus are masks (defined as cloth face coverings) and staying home if you are showing symptoms. Therefore, for the benefit of everyone, this is means that all students are required to follow these important rules.

  • Every student must wear a cloth face-covering properly in class and in all campus buildings at all times
     
  • Students are encouraged to participate in documented daily symptom screening.  This means that each class day in which on-campus activities occur, students must upload certification from the symptom tracking app and confirm that they completed their symptom screening for that day to Canvas.  Students should not upload the results of that screening, just the certificate that they completed it. If the symptom tracking app recommends that the student isolate rather than coming to class, then students must not return to class until cleared by a medical professional.
     
  • Information regarding safety protocols with and without symptoms can be found here.

If a student is not wearing a cloth face-covering properly in the classroom (or any UT building), that student must leave the classroom (and building). If the student refuses to wear a cloth face covering, class will be dismissed for the remainder of the period, and the student will be subject to disciplinary action as set forth in the university’s Institutional Rules/General Conduct 11-404(a)(3). Students who have a condition that precludes the wearing of a cloth face covering must follow the procedures for obtaining an accommodation working with Services for Students with Disabilities.