東海課程-哲學系價值與文化：AI對齊問題的哲學探討(甘偵蓉老師1122-0378)

0378 - 價值與文化：AI對齊問題的哲學探討 The Alignment Problem: A Philosophical Investigation

教育目標 Course Target

本課程從不同哲學角度探討機器學習AI系統的對齊問題，其中包括知識論、形上學、認知道德哲學、倫理學、語言哲學，檢視AI開發者擬設計能理解並依循人類價值觀、道德和意向的AI系統可能面臨的挑戰和考量。This course explores the alignment of machine learning AI systems from different philosophical perspectives, including epistemology, metaphysics, cognitive moral philosophy, ethics, and language philosophy, and examines AI developers' plans to design systems that can understand and follow human values, morals, and intentions. Challenges and considerations that AI systems may face.

參考書目 Reference Books

1. Russell, S., & Norvig, P. (2021). Chap. 1 Introduction. In Artificial intelligence: A modern approach (4th ed.). University of California, Berkeley.
2. Christian, B. R. (2021). Introduction. In The alignment problem: Machine learning and human values. Norton, W. W. & Company, Inc.
3. Gabriel, I. (2020). Artificial intelligence, values, and alignment. Minds & Machines, 30, 411–437. https://doi.org/10.1007/s11023-020-09539-2
4. Christian, B. R. (2021). Chap. 1 Representation. In The alignment problem: Machine learning and human values. Norton, W. W. & Company, Inc.
5. Ratoff, W. (2021). Can the predictive processing model of the mind ameliorate the value-alignment problem? Ethics and Information Technology, 23, 739–750. https://doi.org/10.1007/s10676-021-09611-0
6. Russell, S. (2019). Chap. 7 AI: A different approach & Chapter 8 Provably beneficial AI & Chapter 10 Problem solved? In Human compatible: Artificial intelligence and the problem of control. Penguin.
7. Cruz, J. M. (2019). Shared moral foundations of embodied artificial intelligence. https://sites.williams.edu/jcruz/files/2019/04/AIEthics.pdf
8. Aligned with Whom? Direct and social goals for AI systems" by Anton Korinek, Avital Balwit (2022)
9. Kasirzadeh, A., & Gabriel, I. (2023). In conversation with artificial intelligence: Aligning language models with human values. Philosophy & Technology, 36(1), 27.https://doi.org/10.1007/s13347-023-00606-x

1. Russell, S., & Norvig, P. (2021). Chap. 1 Introduction. In Artificial intelligence: A modern approach (4th ed.). University of California, Berkeley.
2. Christian, B. R. (2021). Introduction. In The alignment problem: Machine learning and human values. Norton, W. W. & Company, Inc.
3. Gabriel, I. (2020). Artificial intelligence, values, and alignment. Minds & Machines, 30, 411–437. https://doi.org/10.1007/s11023-020-09539-2
4. Christian, B. R. (2021). Chap. 1 Representation. In The alignment problem: Machine learning and human values. Norton, W. W. & Company, Inc.
5. Ratoff, W. (2021). Can the predictive processing model of the mind ameliorate the value-alignment problem? Ethics and Information Technology, 23, 739–750. https://doi.org/10.1007/s10676-021- 09611-0
6. Russell, S. (2019). Chap. 7 AI: A different approach & Chapter 8 Provably beneficial AI & Chapter 10 Problem solved? In Human compatible: Artificial intelligence and the problem of control. Penguin.
7. Cruz, J. M. (2019). Shared moral foundations of embodied artificial intelligence. https://sites.williams.edu/jcruz/files/2019/04/AIEthics.pdf
8. Aligned with Whom? Direct and social goals for AI systems" by Anton Korinek, Avital Balwit (2022)
9. Kasirzadeh, A., & Gabriel, I. (2023). In conversation with artificial intelligence: Aligning language models with human values. Philosophy & Technology, 36(1), 27. https://doi.org/10.1007/ s13347-023-00606-x

評分方式 Grading

評分項目 Grading Method	配分比例 Grading percentage	說明 Description
閱讀筆記與提問單閱讀筆記與提問單 Reading notes and question sheets	60	在每次上課前必須繳交閱讀文章重點與問題500字
個人/分組期末口頭報告個人/分組期末口頭報告 Individual/group final oral presentation	10	同儕互評與老師評分加總
參與和討論參與和討論 Participate and discuss	10	含出席及參與課程討論

交換生/外籍生選課登記 - 請點選下方按鈕加入登記清單，再列印出選課申請表給任課教師簽名
Add this class to your wishlist by click the button below.

請先登入才能進行選課登記 Please login first

相似課程 Related Course

很抱歉，沒有符合條件的課程。

Description

學分 Credit：0-1
上課時間 Course Time：Friday/3,4[H207]
授課教師 Teacher：甘偵蓉
修課班級 Class：哲學系2-4
選課備註 Memo：密集授課，隔週授課，共9次，每次2小時

選課狀態 Attendance

There're now 15 person in the class.
目前選課人數為 15 人。

請先登入才能進行選課登記 Please login first

Home

哲學系

course information of 112 - 2 | 0378 The Alignment Problem: A Philosophical Investigation(價值與文化：AI對齊問題的哲學探討)