上一页 1 2 3 4 5 6 7 8 ··· 14 下一页
摘要: Pre title: Replacing softmax with ReLU in Vision Transformers accepted: Arxiv 2023 paper: https://export.arxiv.org/abs/2309.08586 code: None 关键词:atten 阅读全文
posted @ 2023-12-12 10:52 NoNoe 阅读(369) 评论(0) 推荐(0)
摘要: tl;dr: pytorch的 torch.optim.lr_scheduler.OneCycleLR 就很不错,能兼顾warmup和余弦学习率,也不用下载额外的包 import torch from torch.optim.lr_scheduler import CosineAnnealingLR 阅读全文
posted @ 2023-12-04 15:57 NoNoe 阅读(1600) 评论(0) 推荐(0)
摘要: Pre title: Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning accepted: ICCV 2023 paper: https://arxiv.org/abs 阅读全文
posted @ 2023-12-04 15:55 NoNoe 阅读(497) 评论(0) 推荐(0)
摘要: Pre title: R-Drop: Regularized Dropout for Neural Networks accepted: NeurIPS 2021 paper: https://arxiv.org/abs/2106.14448 code: https://github.com/dro 阅读全文
posted @ 2023-12-01 10:59 NoNoe 阅读(397) 评论(0) 推荐(0)
摘要: 1. Pre title: Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference accepted: arXiv 2023 (ICLR 2024 Submission) paper 阅读全文
posted @ 2023-11-12 10:24 NoNoe 阅读(3021) 评论(0) 推荐(1)
摘要: Pre title: EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling accepted: arXiv2023 paper: https://arxiv.org/abs/2310.04691 co 阅读全文
posted @ 2023-11-07 18:28 NoNoe 阅读(428) 评论(0) 推荐(0)
摘要: Pre title: Painterly Image Harmonization using Diffusion Model accepted: AAAI2023 paper: https://arxiv.org/abs/2212.08846 code: https://github.com/bcm 阅读全文
posted @ 2023-11-03 18:07 NoNoe 阅读(450) 评论(0) 推荐(0)
摘要: Pre title: Disentangling Writer and Character Styles for Handwriting Generation accepted: CVPR2023 paper: https://arxiv.org/abs/2303.14736 code: https 阅读全文
posted @ 2023-10-25 18:35 NoNoe 阅读(1035) 评论(0) 推荐(1)
摘要: Pre title: SimCSE: Simple Contrastive Learning of Sentence Embeddings accepted: EMNLP 2021 paper: https://arxiv.org/abs/2104.08821 code: https://githu 阅读全文
posted @ 2023-10-23 10:45 NoNoe 阅读(318) 评论(0) 推荐(0)
摘要: prologue title: [pytorch] 训练时冻结一部分模型的参数 —— module.requires_grad_(False) 代码用到一个解码器\(dec\),希望用它预测生成结果\(g\)的counting encode并用以计算损失,以此约束生成器生成合理的结果(能解码出正确的 阅读全文
posted @ 2023-10-17 19:59 NoNoe 阅读(716) 评论(0) 推荐(0)
上一页 1 2 3 4 5 6 7 8 ··· 14 下一页