國立中興大學教學大綱

課程名稱	(中) 強化式學習(6784)
	(Eng.) Reinforcement Learning
開課單位	電機系
課程類別	選修	學分	3	授課教師	LINDOR ERIC HENRICKSON
選課單位	電機控產專 / 產專班	授課使用語言	英文	英文/EMI	N	開課學期	1122
課程簡述	Introduction to theory and algorithms for reinforcement learning (RL), a fundamental area of machine learning & AI where agents learn to optimally make decisions and achieve goals by interacting with their environment. This class will involve a combination of formal lectures as well as homework projects using the Python programming language.
先修課程名稱						課程含自主學習	N

課程與核心能力關聯配比(%)

課程目標之教學方法與評量方法

課程目標

核心能力

配比(%)

教學方法

評量方法

(1) Learn the mathematical & algorithmic foundations of Reinforcement Learning
(2) Explore & understand advanced approaches to RL (e.g., Deep Q-learning, Actor-Critic, Policy Gradient)
(3) Use Python to solve hands-on RL programming problems

習作

講授

口頭報告

作業

授課內容(單元名稱與內容、習作/每週授課、考試進度-共18週)

週次	授課內容
第1週	Class Introduction & Reinforcement Learning Overview
第2週	Python Review & Software Development Best Practices
第3週	Markov Decision Processes (MDPs)
第4週	Dynamic Programming - Prediction & Control
第5週	Monte Carlo Methods
第6週	Temporal Difference Learning
第7週	n-Step Temporal Difference Methods
第8週	Supervised learning, Neural networks & PyTorch
第9週	On-policy Prediction with Function Approximation
第10週	Control with Value Function Approximation
第11週	Policy Gradient Methods
第12週	Actor-Critic Methods
第13週	Evolutionary Algorithms
第14週	Rollout Algorithms, Off-policy AC, Multiagent
第15週	Class Review
第16週	Final presentation preparation
第17週	Final project presentations
第18週	Final project presentations (continued)

學習評量方式

Homework 50%, Final Project 50%

教科書＆參考書目(書名、作者、書局、代理商、說明)

Reinforcement Learning: An Introduction, 2nd Ed., R. Sutton & A. Barto (MIT Press, 2018)

課程教材（教師個人網址請列在本校內之網址）

課程輔導時間

聯合國全球永續發展目標

提供體驗課程：N

請尊重智慧財產權及性別平等意識，不得非法影印他人著作。
更新日期西元年/月/日：2024/02/23 10:07:20	列印日期西元年/月/日：2024 / 5 / 03
MyTB教科書訂購平台：http://www.mytb.com.tw/