2023
Active Information Gathering Agent
June 11, 2023
RL
Project
Final project in Deep Reinforcement Learning class; collaborated with Botian Xu.
文献综述:AlphaGo系列文章调研
June 10, 2023
RL
Survey
本文系统梳理了AlphaGo家族算法的原理、变革演进及扩展应用,涵盖AlphaGo到MuZero的发展主线及落地案例。
A Survey of Model-Based Reinforcement Learning
May 28, 2023
RL
Survey
This survey reviews recent advances in model-based reinforcement learning (MBRL), focusing on model learning and policy optimization frameworks.
2022
2021
乐理之和声学(1)
November 12, 2021
Chinese
Music
Notes
谢鹏老师《即兴伴奏》前半学期课程内容整理。部分参考bilibili中BV14x411s7KZ教程。第二期施工可能会引入《调性和声》一书的内容。多图预警。