Update README.md

dev
wsy182 2024-12-01 18:35:49 +08:00
parent 9cfd4f88af
commit 3ff448b15d
1 changed files with 2 additions and 9 deletions

View File

@ -255,16 +255,9 @@
**示例方法**
- Imitation Learning + Reinforcement Learning
:
- Imitation Learning + Reinforcement Learning:
- 先使用监督学习模仿玩家风格,再用强化学习微调策略。
- AlphaZero-like Framework
:
- AlphaZero-like Framework:
- 结合深度强化学习和搜索(如 MCTS强化对局策略。
**适用场景**