霜尘FrostDust - 博客园

[置顶] 论文速读 | 26年

摘要： Dec.22-Dec.28 Reading list mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs Unified Vision-Language-Action Model Large Vid 阅读全文

posted @ 2025-12-22 10:32 霜尘FrostDust 阅读(20) 评论(0) 推荐(0)

[置顶] 论文速读 | 25年10月

摘要： Mastering the game of Go with deep neural networks and tree search AlphaGo 2016 人类数据训练网络 —— 自我对弈强化学习 —— MCTS(PUCT) Mastering the game of Go without hu 阅读全文

posted @ 2025-10-09 11:06 霜尘FrostDust 阅读(39) 评论(0) 推荐(1)

[置顶] 论文速读 | 25年9月

摘要： What can rl bring to vla generalization? an empirical study. arxiv 在vla模型的最后一层外接MLP来得到Q-value，从而可以使用PPO等强化学习算法进行微调 PPO表现优于DPO、GRPO等 RL微调vla使其泛化性提高 Sho 阅读全文

posted @ 2025-09-03 21:52 霜尘FrostDust 阅读(43) 评论(0) 推荐(0)

[置顶] 服务器相关操作指令

摘要：课题组服务器操作指南1文档课题组服务器操作指南24 服务器管理指南21 设置内网linux服务器访问外网 ssh连接pycharm和jupyter docker容器VNC设置远程桌面 vncserver -kill :1 (结束终端) vncserver -localhost no :1 -geo 阅读全文

posted @ 2025-02-21 15:00 霜尘FrostDust 阅读(50) 评论(0) 推荐(0)

[置顶] 常用指令

摘要： nanom:linux命令之nano vim :linux命令之vim 在 Ubuntu 中安装、切换多版本 GCC 编译器:[参考指南]（https://www.sysgeek.cn/ubuntu-install-gcc-compiler/）阅读全文

posted @ 2025-02-18 17:20 霜尘FrostDust 阅读(21) 评论(0) 推荐(0)

2026年2月23日

Codex体验

摘要： codex api来源(right code):https://right.codes/register?aff=7de4258d (本人有返利) codex-cli安装教程使用记录：使用codex创建简易版本的diffusionn policy 交互式对话，增加代码可视化创建的文件代码运行阅读全文

posted @ 2026-02-23 22:25 霜尘FrostDust 阅读(24) 评论(0) 推荐(0)

2026年2月2日

解决ubuntu24/win11重启后检测不到wifi硬件

摘要：电脑配置 win11+ubuntu24.04双系统主板微星Pro Z790-A Max Wifi 问题描述首先我在ubuntu24系统上跑了一夜代码，后面又没息屏放了两天，今天来实验室操作不多久电脑就卡死了，执行ubuntu安全重启(同时按住Crtl+ALT不松开,依次按下PrtSc,R,E, 阅读全文

posted @ 2026-02-02 15:31 霜尘FrostDust 阅读(85) 评论(0) 推荐(0)

2025年11月17日

论文速读 | 2025年11月

摘要： FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies CORL 2025 project Object-Centric Latent Action Lea 阅读全文

posted @ 2025-11-17 21:23 霜尘FrostDust 阅读(29) 评论(0) 推荐(0)

2025年8月22日

论文阅读笔记 | 25年8月

摘要： XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning key word: ICRL ICLR2025 dunnolab work 必要性：ICRL需要数量足够大且具有一定复杂性的训练数据，阅读全文

posted @ 2025-08-22 21:15 霜尘FrostDust 阅读(28) 评论(0) 推荐(1)

2025年7月21日

记录下ubuntu24.04更新内核后 wifi驱动和显卡驱动掉了的解决过程

摘要：情况说明我自己安装了ubuntu24双系统，正常使用半个月后，ubuntu桌面自己弹窗需要重启完成更新。重启后发现外接显示器无法显示，终端执行nvidia-smi命令显示“NVIDIA-SMI has failed because it couldn’t communicate with the 阅读全文

posted @ 2025-07-21 19:46 霜尘FrostDust 阅读(2816) 评论(0) 推荐(1)

2025年7月16日

50系列显卡 win主机安装ubuntu24双系统踩坑记录

摘要：我的配置：i9-14900k+Rtx5060ti, 已经安装好了win11系统注意！！50系列显卡win主机若要安装双系统ubuntu，ubuntu版本只能选择ubuntu24.04 (截至本文25年7月），若想要安装ubuntu22,会出现以下报错： `What I’ve Tried: 1、a 阅读全文

posted @ 2025-07-16 11:20 霜尘FrostDust 阅读(590) 评论(1) 推荐(0)

2025年7月3日

论文速读记录 | 25年7月

摘要： Decision Transformret-action space In-Context Reinforcement Learning for Variable Action Spaces 来源：ICML2024 arxiv openreview Motivation: 经典ICRL架构如AD和D 阅读全文

posted @ 2025-07-03 10:25 霜尘FrostDust 阅读(33) 评论(0) 推荐(0)

FrostDust

公告