國立中興大學教學大綱

課程名稱	(中) 強化式學習(6784)
	(Eng.) Reinforcement Learning
開課單位	電機系
課程類別	選修	學分	3	授課教師	LINDOR ERIC HENRICKSON
選課單位	電系統產 / 產專班	授課使用語言	英文	開課學期	1142
課程簡述	Introduction to theory and algorithms for reinforcement learning (RL), a fundamental area of machine learning & AI where agents learn to optimally make decisions and achieve goals by interacting with their environment. This class will involve a combination of formal lectures as well as homework projects using the Python programming language.
先修課程名稱

課程與核心能力關聯配比(%)

課程目標之教學方法與評量方法

課程目標

核心能力

配比(%)

教學方法

評量方法

(1) Learn the mathematical & algorithmic foundations of Reinforcement Learning
(2) Explore & understand advanced approaches to RL (e.g., Deep Q-learning, Actor-Critic, Policy Gradient)
(3) Use Python to solve hands-on RL programming problems

習作

講授

出席狀況

口頭報告

作業

授課內容(單元名稱與內容、習作/每週授課、考試進度-共16週加自主學習)

週次	授課內容
第1週	Class Introduction & Reinforcement Learning Overview
第2週	Markov Decision Processes (MDPs)
第3週	Dynamic Programming - Prediction & Control
第4週	Monte Carlo Methods
第5週	Temporal Difference Learning
第6週	n-Step Temporal Difference Methods
第7週	Supervised learning, Neural networks & PyTorch
第8週	On-policy Prediction with Function Approximation
第9週	Control with Value Function Approximation
第10週	Policy Gradient Methods
第11週	Actor-Critic Methods
第12週	Evolutionary Algorithms
第13週	Continuing tasks, Rollout Algorithms, Off-policy AC, Multi-agent
第14週	Class Review
第15週	Final project presentations
第16週	Final project presentations (continued)
自主學習內容	Supplemental lecture materials & code examples (self-study)

學習評量方式

Homework 20%, Final Project 30%, Attendance 50%

教科書＆參考書目(書名、作者、書局、代理商、說明)

Reinforcement Learning: An Introduction, 2nd Ed., R. Sutton & A. Barto (MIT Press, 2018)

課程教材（教師個人網址請列在本校內之網址）

課程輔導時間

聯合國全球永續發展目標(連結網址)

提供體驗課程：N

請尊重智慧財產權及性別平等意識，不得非法影印他人著作。
更新日期西元年/月/日：2026/01/08 15:10:58	列印日期西元年/月/日：2026 / 5 / 31
MyTB教科書訂購平台：http://www.mytb.com.tw/