From 4a9f45b2dff8f621612bbdacfb46c4ad10c99583 Mon Sep 17 00:00:00 2001 From: wsy182 <2392948297@qq.com> Date: Mon, 2 Dec 2024 13:20:57 +0800 Subject: [PATCH] Update README.md --- README.md | 12 +++++++++++- 1 file changed, 11 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 448b990..28f2958 100644 --- a/README.md +++ b/README.md @@ -205,4 +205,14 @@ TensorBoard 通常会记录和可视化多种训练指标。你提到的这些 - **`train/entropy_loss`**:熵损失,反映策略的探索程度。 - **`train/clip_range`**:剪裁范围,反映策略更新的限制。 - **`train/clip_fraction`**:被剪裁的比例,反映策略更新的稳定性。 -- **`train/approx_kl`**:近似 KL 散度,反映策略更新的幅度和稳定性。 \ No newline at end of file +- **`train/approx_kl`**:近似 KL 散度,反映策略更新的幅度和稳定性。 + + + + + +## 参考 + +https://github.com/mangenotwork/CLI-Sichuan-Mahjong //golang命令行麻将 + +https://github.com/lauyikfung/SichuaMahjongAI //SichuaMahjongAI \ No newline at end of file