12 2024 档案

摘要:LB.4 推理框架 主求解函数(Solver Function) # Solver Function # 主求解函数,循环生成多轮样本并提取最终答案 from tqdm import tqdm num_generations = 2 # 每轮生成 128 个样本,设置总轮次数 def solve(q 阅读全文
posted @ 2024-12-30 13:35 HaibaraYuki 阅读(40) 评论(0) 推荐(0)
摘要:【第一章】绪论 1.数据结构的基本概念 (1)数据结构 \(DataStructure=(D,R)\) \(D\)是\(\underline{数据元素}\)的有限集合,\(R\)是\(D\)上关系的结合 (2)抽象数据结构 \(ADT\) 为整数定义一个抽象数据类型,包含整数的常见运算,每个运算对应 阅读全文
posted @ 2024-12-29 16:53 HaibaraYuki 阅读(132) 评论(0) 推荐(0)
摘要: 阅读全文
posted @ 2024-12-29 15:34 HaibaraYuki 阅读(8) 评论(0) 推荐(0)
摘要:Q-learning(Notebook) environment !apt-get update !apt install -y python3.9 !pip install virtualenv %cd /kaggle/working !virtualenv venv -p $(which pyt 阅读全文
posted @ 2024-12-28 18:51 HaibaraYuki 阅读(19) 评论(0) 推荐(0)
摘要:export KAGGLE_CONFIG_DIR=/home/user/kaggle_config 阅读全文
posted @ 2024-12-28 18:40 HaibaraYuki 阅读(32) 评论(0) 推荐(0)
摘要:各版本python的Notebook !apt-get update !apt install -y python3.9 !pip install virtualenv %cd /kaggle/working !virtualenv venv -p $(which python3.9) # !vir 阅读全文
posted @ 2024-12-28 17:20 HaibaraYuki 阅读(100) 评论(0) 推荐(0)
摘要:LoRA (Low-Rank Adaptation) LoRA官方文档 Qwen2.5-0.5B微调Notebook Data preprocess pip&&import !pip config !pip install modelscope==1.18.0 !pip install transf 阅读全文
posted @ 2024-12-26 13:14 HaibaraYuki 阅读(35) 评论(0) 推荐(0)
摘要:https://zhuanlan.zhihu.com/p/681353195?utm_campaign=shareopn&utm_medium=social&utm_psn=1854692762752008192&utm_source=wechat_session 阅读全文
posted @ 2024-12-24 14:42 HaibaraYuki 阅读(9) 评论(0) 推荐(0)
摘要:https://www.latexlive.com/ https://zhuanlan.zhihu.com/p/702423411 阅读全文
posted @ 2024-12-24 14:41 HaibaraYuki 阅读(6) 评论(0) 推荐(0)
摘要:VMware虚拟机突然连接不上网络 https://blog.csdn.net/dong__ge/article/details/123581117 VMware 虚拟机克隆详细教程 https://blog.csdn.net/weixin_36665875/article/details/1063 阅读全文
posted @ 2024-12-24 13:08 HaibaraYuki 阅读(10) 评论(0) 推荐(0)
摘要:https://github.com/Linjunjie99/RL-LLM-DT 阅读全文
posted @ 2024-12-23 13:54 HaibaraYuki 阅读(14) 评论(0) 推荐(0)
摘要:https://github.com/GAIR-NLP/O1-Journey#about-the-team 阅读全文
posted @ 2024-12-23 13:53 HaibaraYuki 阅读(12) 评论(0) 推荐(0)
摘要:exploring the GPT-2 (124M) OpenAI checkpoint pipeline # https://www.bilibili.com/video/BV12s421u7sZ?spm_id_from=333.788.videopod.sections&vd_source=06 阅读全文
posted @ 2024-12-21 22:33 HaibaraYuki 阅读(91) 评论(0) 推荐(0)
摘要:全概率公式: $$ \quad P(B)=\sum_{i = 1}^{n} P(A_{i}) P(B | A_{i} $$ 阅读全文
posted @ 2024-12-21 00:05 HaibaraYuki 阅读(9) 评论(0) 推荐(0)