随笔档案「2025年7月11日」：[PaperReading] DeepSeekMath: Pushing the... - fariver

2025年7月11日

[PaperReading] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

摘要：目录DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsTL;DRMethodData CollectionDeepSeekMath-Base 7B训练与评估Reinforcement 阅读全文

posted @ 2025-07-11 20:08 fariver 阅读(168) 评论(0) 推荐(0)

fariver

公告