MiraMira - 博客园

2024年3月15日

【Coursera GenAI with LLM】 Week 3 Reinforcement Learning from Human Feedback Class Notes

摘要： Helpful? Honest? Harmless? Make sure AI response in those 3 ways. If not, we need RLHF is reduce the toxicity of the LLM. Reinforcement learning: is a 阅读全文

posted @ 2024-03-15 12:15 MiraMira 阅读(51) 评论(0) 推荐(0)

2024年3月14日

【Coursera GenAI with LLM】 Week 2 PEFT Class Notes

摘要： With PEFT, we only train on small portion of parameters! What's using memory while training model? Trainable weights Optimizer states Gradients Forwar 阅读全文

posted @ 2024-03-14 11:04 MiraMira 阅读(61) 评论(0) 推荐(0)

2024年3月13日

【Coursera GenAI with LLM】 Week 2 Fine-tuning LLMs with instruction Class Notes

摘要： GenAI Project Lifecycle: After picking pre-trained models, we can fine-tune! In-context learning (ICL): zero / one / few shot inference. Including a f 阅读全文

posted @ 2024-03-13 17:04 MiraMira 阅读(60) 评论(0) 推荐(0)

Mira's Blog

公告