Information
Code | CENG0049 |
Name | Deep Reinforcement Learning |
Term | 2024-2025 Academic Year |
Semester | . Semester |
Duration (T+A) | 3-0 (T-A) (17 Week) |
ECTS | 6 ECTS |
National Credit | 3 National Credit |
Teaching Language | Türkçe |
Level | Doktora Dersi |
Type | Normal |
Mode of study | Yüz Yüze Öğretim |
Catalog Information Coordinator | Mehmet SARIGÜL |
Course Goal / Objective
The goal of a Deep Reinforcement Learning course is to provide students with a comprehensive understanding of the principles, algorithms, and applications of reinforcement learning (RL) in combination with deep learning techniques. The course aims to equip students with the knowledge and skills necessary to design, implement, and deploy deep RL algorithms to solve complex sequential decision-making problems.
Course Content
This course covers the Introduction to Reinforcement Learning, Basics of Deep Learning, Deep Q-Networks (DQN), Q-Learning and value-based methods, Experience replay and target networks, Extensions and improvements to DQN (e.g., Double DQN, Dueling DQN), Policy Gradient Methods, Proximal Policy Optimization (PPO), Trust Region Policy Optimization (TRPO), Proximal Policy Optimization algorithms, Exploration and Exploitation, Exploration in deep reinforcement learning, Model-Based Reinforcement Learning, Advanced Topics in Deep RL, Deep RL for Robotics and Control, Deep RL in Natural Language Processing, Deep RL in Game Playing, Deep RL agents for game playing (e.g., AlphaGo, AlphaZero), Techniques for value function estimation in games, Monte Carlo methods and tree search algorithms, Ethical and Societal Considerations, Research Papers and Recent Advancements.
Course Precondition
Knowledge of basic programming, linear algebra, and probability theory.
Resources
Sewak, M. (2019). Deep reinforcement learning. Singapore: Springer Singapore.
Notes
Sutton, Richard S., and Andrew G. Barto. Reinforcement learning: An introduction. MIT press, 2018.
Course Learning Outcomes
Order | Course Learning Outcomes |
---|---|
LO01 | Understanding of the fundamentals of deep reinforcement learning |
LO02 | Proficiency in Deep Learning Techniques |
LO03 | Knowledge of Deep Reinforcement Learning Algorithms |
LO04 | Ability to Implement Deep RL Algorithms |
Relation with Program Learning Outcome
Order | Type | Program Learning Outcomes | Level |
---|---|---|---|
PLO01 | Bilgi - Kuramsal, Olgusal | On the basis of the competencies gained at the undergraduate level, it has an advanced level of knowledge and understanding that provides the basis for original studies in the field of Computer Engineering. | 3 |
PLO02 | Bilgi - Kuramsal, Olgusal | By reaching scientific knowledge in the field of engineering, he/she reaches the knowledge in depth and depth, evaluates, interprets and applies the information. | 3 |
PLO03 | Yetkinlikler - Öğrenme Yetkinliği | Being aware of the new and developing practices of his / her profession and examining and learning when necessary. | 3 |
PLO04 | Yetkinlikler - Öğrenme Yetkinliği | Constructs engineering problems, develops methods to solve them and applies innovative methods in solutions. | 2 |
PLO05 | Yetkinlikler - Öğrenme Yetkinliği | Designs and applies analytical, modeling and experimental based researches, analyzes and interprets complex situations encountered in this process. | 3 |
PLO06 | Yetkinlikler - Öğrenme Yetkinliği | Develops new and / or original ideas and methods, develops innovative solutions in system, part or process design. | 2 |
PLO07 | Beceriler - Bilişsel, Uygulamalı | Has the skills of learning. | 2 |
PLO08 | Beceriler - Bilişsel, Uygulamalı | Being aware of new and emerging applications of Computer Engineering examines and learns them if necessary. | 3 |
PLO09 | Beceriler - Bilişsel, Uygulamalı | Transmits the processes and results of their studies in written or oral form in the national and international environments outside or outside the field of Computer Engineering. | |
PLO10 | Beceriler - Bilişsel, Uygulamalı | Has comprehensive knowledge about current techniques and methods and their limitations in Computer Engineering. | 1 |
PLO11 | Beceriler - Bilişsel, Uygulamalı | Uses information and communication technologies at an advanced level interactively with computer software required by Computer Engineering. | 2 |
PLO12 | Bilgi - Kuramsal, Olgusal | Observes social, scientific and ethical values in all professional activities. | 2 |
Week Plan
Week | Topic | Preparation | Methods |
---|---|---|---|
1 | Introduction to reinforcement learning principles and terminology. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
2 | Basics of deep learning, including artificial neural networks and training techniques. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
3 | Q-Learning and value-based methods in reinforcement learning. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
4 | Deep Q-Networks (DQN) and experience replay. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
5 | Policy optimization and the REINFORCE algorithm. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
6 | Actor-Critic methods and advantage functions. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
7 | Proximal Policy Optimization (PPO) and policy gradients. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
8 | Mid-Term Exam | Ölçme Yöntemleri: Yazılı Sınav |
|
9 | Exploration and exploitation trade-offs in deep RL. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
10 | Model-based reinforcement learning and model learning techniques. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
11 | Monte Carlo Tree Search and its applications. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
12 | Advanced topics in deep RL, such as hierarchical RL or meta-learning. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
13 | Deep RL for robotics and control systems. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
14 | Deep RL for game playing and game theory. | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
15 | Review | Reading the lecture notes | Öğretim Yöntemleri: Anlatım |
16 | Term Exams | Ölçme Yöntemleri: Yazılı Sınav |
|
17 | Term Exams | Ölçme Yöntemleri: Yazılı Sınav |
Student Workload - ECTS
Works | Number | Time (Hour) | Workload (Hour) |
---|---|---|---|
Course Related Works | |||
Class Time (Exam weeks are excluded) | 14 | 3 | 42 |
Out of Class Study (Preliminary Work, Practice) | 14 | 5 | 70 |
Assesment Related Works | |||
Homeworks, Projects, Others | 0 | 0 | 0 |
Mid-term Exams (Written, Oral, etc.) | 1 | 14 | 14 |
Final Exam | 1 | 28 | 28 |
Total Workload (Hour) | 154 | ||
Total Workload / 25 (h) | 6,16 | ||
ECTS | 6 ECTS |