摘要: Diffusion Transformer 高效训练 阅读全文
posted @ 2025-04-01 16:54 KeanShi 阅读(169) 评论(0) 推荐(0)
摘要: 知识蒸馏技术(Knowledge Distillation,KD)原理解读 阅读全文
posted @ 2025-02-05 15:24 KeanShi 阅读(441) 评论(0) 推荐(0)
摘要: 从代码角度详解LLaVA 阅读全文
posted @ 2024-12-20 14:48 KeanShi 阅读(1770) 评论(0) 推荐(0)
摘要: LLaVA & LLaVolta 代码排坑指南 阅读全文
posted @ 2024-12-12 15:52 KeanShi 阅读(573) 评论(0) 推荐(0)
摘要: LLaVA (Large Language and Vision Assistant),proposed by Haotian Liu (UWM), et al. 阅读全文
posted @ 2024-11-20 16:22 KeanShi 阅读(978) 评论(0) 推荐(0)
摘要: 大模型推理加速技术 —— KV-cache 详细图解与公式推导 阅读全文
posted @ 2024-11-13 20:47 KeanShi 阅读(699) 评论(0) 推荐(0)
摘要: LLM 中的 位置编码(Positional Encoding, PE) 阅读全文
posted @ 2024-11-11 23:22 KeanShi 阅读(769) 评论(0) 推荐(0)
摘要: Transformer by Google brain 阅读全文
posted @ 2024-11-08 19:58 KeanShi 阅读(188) 评论(0) 推荐(0)
摘要: Qwen Team, Alibaba Group 阅读全文
posted @ 2024-10-27 21:44 KeanShi 阅读(4912) 评论(0) 推荐(0)
摘要: FastV, a plug-and-play method proposed by Liang Chen (ICL, Peking University), et al. 阅读全文
posted @ 2024-10-21 15:47 KeanShi 阅读(428) 评论(0) 推荐(0)