摘要: https://www.modelscope.cn/models/unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF/summary 下载llama-cli https://github.com/ggerganov/llama.cpp/releases 利用model 阅读全文
posted @ 2025-02-05 21:21 bregman 阅读(372) 评论(0) 推荐(1)
摘要: token生成 代码 transformers.generation.GenerationMixin.generate 文档资料 机器如何生成文本? https://cloud.tencent.com/developer/article/1620772 NLP的巨人肩膀 https://zhuanl 阅读全文
posted @ 2025-02-05 15:37 bregman 阅读(83) 评论(0) 推荐(0)