chenfengshijie - 博客园

2024年9月4日

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model(2024,8)

摘要： Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model(2024,8) Paper TODO: 目前没有开源代码,实时关注一下official code,Meta的工作基本开源的.本文给出了一阅读全文

posted @ 2024-09-04 23:02 chenfengshijie 阅读(363) 评论(0) 推荐(1)

Towards Robust Blind Face Restoration with Codebook Lookup Transformer(NeurIPS 2022) | Codeformer

摘要： Towards Robust Blind Face Restoration with Codebook Lookup Transformer(NeurIPS 2022) 这篇论文试图解决的是盲目面部恢复（blind face restoration）问题，这是一个高度不确定的任务，通常需要辅助指导来阅读全文

posted @ 2024-09-04 22:58 chenfengshijie 阅读(336) 评论(0) 推荐(0)

RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs(IEEE,2023,8)

摘要： RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs(IEEE,2023,8) Paper GitHub 动机:认为之前的模型都只关注了图像的纹理信息,而忽视了人脸的细节信阅读全文

posted @ 2024-09-04 22:55 chenfengshijie 阅读(106) 评论(0) 推荐(0)

2024年8月19日

ControlNeXt: Powerful and Efficient Control for Image and Video Generation(2024,8)

摘要： ControlNeXt: Powerful and Efficient Control for Image and Video Generation(2024,8) paper Github 进一步在ControlNet上进行了改进,主要针对一下两点对于每一个模块添加一个Zero-Conv也会占用阅读全文

posted @ 2024-08-19 21:22 chenfengshijie 阅读(143) 评论(0) 推荐(0)

2024年7月17日

ChatGLM

摘要： Paper Reading：ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools。阅读全文

posted @ 2024-07-17 15:40 chenfengshijie 阅读(348) 评论(0) 推荐(0)

2024年6月13日

VideoGeneration

摘要：一些读过的视频生成相关的论文阅读全文

posted @ 2024-06-13 22:08 chenfengshijie 阅读(295) 评论(0) 推荐(0)

2024年1月23日

生成方向论文速览

摘要：记录一下阅读的生成方向的论文阅读全文