Data Analysis for Computer Engineering [EI7503]
| Dozent: | Hao Shen |
| Assistenten: | Domink Meyer (Übung), Martin Knopp (Praktikum) |
| Zielgruppe: | Wahlfach, Ergänzungsvorlesung (Master) |
| ECTS: | 6 |
| Umfang: | 2/1/1 (SWS Vorlesung/Übung/Praktikum) |
| Turnus: | Sommersemester |
| Anmeldung: | TUMonline |
| Zeit & Ort: | Vorlesung: Di. 13.30-15.00 Übung: Di. 15.00-16.00, Z995 Praktikum: Mi. 11.30-13.00, 0999 |
| Beginn: | erste Vorlesung am 16.04.2013 |
Inhalt
One simple prescription for problem solving consists of the following two steps:
- State the problem, in the simplest way possible.
- Find solutions to the stated problem.
Reinforcement learning (RL) is arguably the most general and challenging approach considering such a succinct paradigm. A RL agent interacts with its environment and uses its experience to make decisions towards solving the problem. This lecture will give an overview of basic concepts, techniques, and programming tools used in RL. We will focus on the algorithmic techniques in RL, and cover the following topics (not exclusively):
- Dynamic programming
- Monte Carlo methods
- Temporal difference learning
- Eligibility traces
- Function approximation
- Least-squares temporal difference learning (LSTD) and its generalizations
- Policy gradient methods
- Partially observable Markov decision process
After participating the lecture, students are expected to understand basic concepts in RL and be able to develop several state-of-the-art RL algorithms in real engineering applications.
Course Materials
Week 01: lecture, tutorial (Textbook: Ch. 01, 03)
Week 02: lecture, tutorial (Textbook: Ch. 02, 03, 04)
Week 03: tutorial (Textbook: Ch. 04)
Week 04: lecture (Textbook: Ch. 05)
Week 05: lecture (Textbook: Ch. 06)
Week 06: lecture (Textbook: Ch. 07)
Week 07: lecture (Textbook: Ch. 08)
Week 08: lecture
Week 09: lecture
Assignments
Assignment 0.5: here,
Assignment 1: here
Assignment 2: here, supplementary material: 01, 02, 03
Assignment 3: here
Assignment 4: here
Assignment 5: here
Programming Assignment 01: here, python code
Programming Assignment 02: here, NIPS08, ICML09, python code
Poster 1: Autonomous reinforcement learning on raw visual input data in a real world application
Deep Auto-Encoder Neural Networks in Reinforcement Learning
Poster 2: A Theoretical and Empirical Analysis of Expected Sarsa
Prüfung
- Homework including two programming tasks and one poster presentation of a recent research article in the area (33%)
- Oral examination (66%)
Empfohlene Literatur
- Sutton, R. S. & Barto, A. G., Reinforcement Learning: An Introduction. The MIT Press, 1998 (version 2 in progress)
- Bertsekas, D. P. & Tsitsiklis, J., Neuro-dynamic programming. Athena Scientific, 1996
- Szepesvári, S., Algorithms for Reinforcement Learning. Morgan & Claypool, 2010 (a draft)