一文读懂多模态大模型:强化学习技术全面解读 SFT、RLHF、RLAIF、DPO
posted @ 2025-04-15 18:03 ExplorerMan 阅读(2349) 评论(0) 推荐(1)
posted @ 2025-04-15 18:03 ExplorerMan 阅读(2349) 评论(0) 推荐(1)
posted @ 2025-04-09 20:24 ExplorerMan 阅读(131) 评论(0) 推荐(0)
posted @ 2025-04-09 19:43 ExplorerMan 阅读(171) 评论(0) 推荐(0)
posted @ 2025-04-09 19:40 ExplorerMan 阅读(26) 评论(0) 推荐(0)
posted @ 2025-04-01 17:41 ExplorerMan 阅读(302) 评论(0) 推荐(0)
posted @ 2025-03-31 20:59 ExplorerMan 阅读(1270) 评论(0) 推荐(0)
posted @ 2025-03-25 15:08 ExplorerMan 阅读(110) 评论(0) 推荐(0)
posted @ 2025-03-21 15:56 ExplorerMan 阅读(1093) 评论(0) 推荐(0)
posted @ 2025-03-20 16:23 ExplorerMan 阅读(170) 评论(0) 推荐(0)
posted @ 2025-03-19 19:57 ExplorerMan 阅读(320) 评论(0) 推荐(0)