胸中有泰勒 - 博客园

2026年3月8日

摘要： 1. 什么是 Tool Calling (工具调用)？核心定义： Tool Calling 是指赋予大语言模型（LLM）使用外部工具的能力。如果说 LLM 是一个博学的“大脑”，那么 Tool Calling 就是给它装上了“手”和“眼睛”。 1.1 为什么 LLM 需要工具？(The Why) 阅读全文

posted @ 2026-03-08 09:16 胸中有泰勒阅读(68) 评论(0) 推荐(0)

2025年10月10日

Seed1.5 LLM 技术报告

摘要： Abstract（摘要） We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning. 本文提阅读全文

posted @ 2025-10-10 17:09 胸中有泰勒阅读(263) 评论(0) 推荐(0)

2025年4月15日

Deepseek核心算法GPRO以及传统算法PPO

摘要：下面是PPO算法：现在开始讲解GRPO: 1: policy model π_θ ← π_{θ_init} 2: for iteration = 1, ..., I do 3: reference model π_ref ← π_θ 初始策略模型可以是没训练的语言模型。将该模型作为当前的策略模型阅读全文

posted @ 2025-04-15 21:29 胸中有泰勒阅读(1119) 评论(0) 推荐(0)

ziahng

公告