摘要: 目录DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsTL;DRMethodData CollectionDeepSeekMath-Base 7B训练与评估​Reinforcement 阅读全文
posted @ 2025-07-11 20:08 fariver 阅读(168) 评论(0) 推荐(0)