摘要: 原始LLaVA论文: 标题: "Visual Instruction Tuning" arXiv链接: https://arxiv.org/abs/2304.08485 会议: NeurIPS 2023 LLaVA-1.5 论文: 标题: "Improved Baselines with Visua 阅读全文
posted @ 2025-09-15 16:47 jack-chen666 阅读(17) 评论(0) 推荐(0)
摘要: https://github.com/ByteVisionLab/TokenFlow https://arxiv.org/abs/2412.03069 阅读全文
posted @ 2025-09-15 09:52 jack-chen666 阅读(10) 评论(0) 推荐(0)
摘要: https://arxiv.org/abs/2503.09573 阅读全文
posted @ 2025-09-15 09:46 jack-chen666 阅读(15) 评论(0) 推荐(0)
摘要: https://arxiv.org/abs/2411.07975 https://github.com/deepseek-ai/Janus 阅读全文
posted @ 2025-09-15 09:43 jack-chen666 阅读(21) 评论(0) 推荐(0)
摘要: https://janusai.pro/ https://huggingface.co/deepseek-ai/Janus-Pro-7B https://arxiv.org/abs/2501.17811 https://github.com/deepseek-ai/Janus 阅读全文
posted @ 2025-09-15 09:33 jack-chen666 阅读(12) 评论(0) 推荐(0)
摘要: https://arxiv.org/pdf/2505.00703 https://github.com/CaraJ7/T2I-R1 阅读全文
posted @ 2025-09-15 09:29 jack-chen666 阅读(13) 评论(0) 推荐(0)
摘要: https://arxiv.org/abs/2509.08827 https://huggingface.co/papers/2509.08827 阅读全文
posted @ 2025-09-15 09:07 jack-chen666 阅读(26) 评论(0) 推荐(0)
摘要: The Landscape of Agentic Reinforcement__Learning for LLMs.pdf https://medium.com/data-science-in-your-pocket/the-landscape-of-agentic-reinforcement-le 阅读全文
posted @ 2025-09-15 09:06 jack-chen666 阅读(56) 评论(0) 推荐(0)
摘要: https://www.physicalintelligence.company/download/pi05.pdf https://github.com/Physical-Intelligence/openpi https://mp.weixin.qq.com/s/4FwNUULBzMrqEOm9 阅读全文
posted @ 2025-09-15 09:04 jack-chen666 阅读(27) 评论(0) 推荐(0)
摘要: https://mp.weixin.qq.com/s/fwOGuKy2Wtz_xXx3nCT28w 论文题目:LLaDA-VLA: Vision Language Diffusion Action Models 论文链接:https://arxiv.org/abs/2509.06932 项目主页:h 阅读全文
posted @ 2025-09-15 09:00 jack-chen666 阅读(26) 评论(0) 推荐(0)