unsloth的一些文档,很不错,好好看看zz
https://unsloth.ai/docs/get-started/unsloth-notebooks
unsloth提供的一些notebook
这是notebook的github
https://github.com/unslothai/notebooks/
https://unsloth.ai/docs/zh/kai-shi-shi-yong/reinforcement-learning-rl-guide
RL
https://unsloth.ai/docs/zh/kai-shi-shi-yong/reinforcement-learning-rl-guide/tutorial-train-your-own-reasoning-model-with-grpo
GRPO
https://unsloth.ai/docs/zh
整体的文档

浙公网安备 33010602011771号