2025 年 8月 20 日随笔档案 - Luna-Evelyn

2025年8月20日

摘要： GPT-2 文章中指出监督学习的核心弱点：脆弱性与敏感性，监督学习在训练数据分布上表现优异，但是数据分布一旦稍有变化，则性能急剧下降，这样训练出来的系统称为Narrow Expert，单任务单领域的训练范式无法进行举一反三的泛化功能。因此，文章主要宣传的是下游任务中Zero-shot的思想文章中指阅读全文

posted @ 2025-08-20 22:43 Luna-Evelyn 阅读(16) 评论(0) 推荐(0)

GPT-1技术报告

摘要： GPT-1(Generative Pre-Training) 1、模型结构：OpenAI由2018年介绍了一种名为“生成式预训练”（Generative Pre-Training，简称GPT）的新型语言模型，该模型通过在大规模语料库上进行训练，能够学习自然语言的模式和规律，从而实现更好的语言理解 G 阅读全文

posted @ 2025-08-20 00:03 Luna-Evelyn 阅读(17) 评论(0) 推荐(0)

The Blog

Do not go gentle into that good night.
Old age should burn and rave at close of day.
Rage, rage against the dying light.

公告

The Blog

Do not go gentle into that good night. Old age should burn and rave at close of day. Rage, rage against the dying light.

公告

Do not go gentle into that good night.
Old age should burn and rave at close of day.
Rage, rage against the dying light.