摘要:
Transformer in Computer Vision 2020-12-03 19:18:25 Survey 1: A Survey on Visual Transformer, Kai Han, et al. [Paper] Survey 2: Transformers in Vision:
阅读全文
posted @ 2020-12-03 19:45
AHU-WangXiao
阅读(1975)
推荐(1)
posted @ 2019-06-13 17:17
AHU-WangXiao
阅读(110)
推荐(0)
摘要:
Summary on deep learning framework PyTorch Updated on 2018-07-22 21:25:42 import osos.environ["CUDA_VISIBLE_DEVICES"]="4" export CUDA_VISIBLE_DEVICES=
阅读全文
posted @ 2017-08-13 16:07
AHU-WangXiao
阅读(6170)
推荐(2)
posted @ 2017-06-06 15:51
AHU-WangXiao
阅读(11)
推荐(0)
摘要:
Latex 经常见到的问题和解决方法 2017-04-10 22:05:48 【资源下载】 1). Texlive 2021 下载地址:https://mirrors.sjtug.sjtu.edu.cn/ctan/systems/texlive/Images/ 2). AweSome LaTex:
阅读全文
posted @ 2017-04-10 22:07
AHU-WangXiao
阅读(12072)
推荐(2)
摘要:
Conclusions about Deep Learning with Python file_path = '{}/{}_ep{:04d}.pth.tar'.format(directory, net_type, self.epoch) 0. Install the specific versi
阅读全文
posted @ 2017-03-01 16:17
AHU-WangXiao
阅读(1541)
推荐(0)
摘要:
Matlab 进阶学习记录 Error: Invalid MEX-file '/media/wangxiao/Acer/dataset/LDES/utility/mexfiles/mpolar.mexa64': /usr/local/MATLAB/R2017a/bin/glnxa64/../../s
阅读全文
posted @ 2016-07-31 16:32
AHU-WangXiao
阅读(3386)
推荐(0)
posted @ 2016-06-27 17:28
AHU-WangXiao
阅读(25)
推荐(0)
摘要:
Ubuntu + Deep Learning (Caffe, PyTorch) 配置参考文献 sudo apt install nvidia-cuda-toolkit pip install gpustat watch --color -n1 gpustat -cpu [Note]: the RTX
阅读全文
posted @ 2016-04-13 10:11
AHU-WangXiao
阅读(28216)
推荐(0)
posted @ 2022-10-04 10:31
AHU-WangXiao
阅读(34)
推荐(0)
摘要:
Weakly Alignment-Free RGBT Salient Object Detection With Deep Correlation Network 2022-07-23 19:27:08 Paper: IEEE Xplore Full-Text PDF: 1. Background
阅读全文
posted @ 2022-07-23 19:28
AHU-WangXiao
阅读(254)
推荐(0)
摘要:
Visual Prompt Tuning 2022-07-16 19:13:50 Paper: [2203.12119] Visual Prompt Tuning (arxiv.org) Code: KMnP/vpt: 🔥 Visual Prompt Tuning [ECCV 2022] http
阅读全文
posted @ 2022-07-16 20:40
AHU-WangXiao
阅读(1185)
推荐(0)
摘要:
ActionCLIP: A New Paradigm for Video Action Recognition 2022-07-02 17:38:37 Paper: 2109.08472.pdf (arxiv.org) Code: https://github.com/sallymmx/Action
阅读全文
posted @ 2022-07-02 17:39
AHU-WangXiao
阅读(468)
推荐(0)
摘要:
opencv4.6.0 + rtx2070 + ubuntu16.04 install tutorial ref-1: https://blog.csdn.net/qvodgg/article/details/108410549 ref-2: https://zhuanlan.zhihu.com/p
阅读全文
posted @ 2022-06-25 10:37
AHU-WangXiao
阅读(181)
推荐(0)
摘要:
AEGNN: Asynchronous Event-based Graph Neural Networks 2022-06-07 17:01:45 Paper: https://rpg.ifi.uzh.ch/docs/CVPR22_Schaefer.pdf Code: https://uzh-rpg
阅读全文
posted @ 2022-06-07 17:03
AHU-WangXiao
阅读(144)
推荐(0)
posted @ 2022-06-03 20:29
AHU-WangXiao
阅读(2)
推荐(0)
posted @ 2022-04-12 20:50
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2022-04-06 14:46
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2022-04-02 10:41
AHU-WangXiao
阅读(1)
推荐(0)
摘要:
VL-BERT: PRE-TRAINING OF GENERIC VISUALLINGUISTIC REPRESENTATIONS 2022-03-30 20:35:13 Paper: https://openreview.net/forum?id=SygXPaEYvH Code: https://
阅读全文
posted @ 2022-03-30 20:37
AHU-WangXiao
阅读(78)
推荐(0)
摘要:
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training 2022-03-22 14:22:12 Paper: https://ojs.aaai.org/index.php/AAAI/ar
阅读全文
posted @ 2022-03-22 14:23
AHU-WangXiao
阅读(357)
推荐(0)
摘要:
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions 2022-03-20 17:34:51 Paper: https://arxiv.org/pdf/2010.12831.pdf Cod
阅读全文
posted @ 2022-03-20 17:38
AHU-WangXiao
阅读(140)
推荐(0)
摘要:
Visualbert: A simple and performant baseline for vision and language 2022-03-20 15:19:04 Paper: https://arxiv.org/pdf/1908.03557 1. Background and Mot
阅读全文
posted @ 2022-03-20 15:27
AHU-WangXiao
阅读(427)
推荐(0)
摘要:
Fusion of Detected Objects in Text for Visual Question Answering 2022-03-18 16:29:58 Paper: https://aclanthology.org/D19-1219/ Code: https://github.co
阅读全文
posted @ 2022-03-18 16:31
AHU-WangXiao
阅读(89)
推荐(0)
摘要:
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation 2022-03-18 10:04:06 Paper: https://proceedings.neurips.cc/pa
阅读全文
posted @ 2022-03-18 10:13
AHU-WangXiao
阅读(1263)
推荐(0)
摘要:
ActBERT: Learning Global-Local Video-Text Representations 2022-03-17 16:41:43 Paper: http://openaccess.thecvf.com/content_CVPR_2020/papers/Zhu_ActBERT
阅读全文
posted @ 2022-03-17 16:51
AHU-WangXiao
阅读(160)
推荐(0)
摘要:
12-in-1: Multi-Task Vision and Language Representation Learning 2022-03-17 09:45:41 Paper: https://openaccess.thecvf.com/content_CVPR_2020/papers/Lu_1
阅读全文
posted @ 2022-03-17 14:28
AHU-WangXiao
阅读(318)
推荐(0)
摘要:
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision 2022-03-16 21:02:21 Paper: http://proceedings.mlr.press/v139
阅读全文
posted @ 2022-03-16 21:20
AHU-WangXiao
阅读(643)
推荐(0)
posted @ 2022-03-07 19:55
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-12-19 19:16
AHU-WangXiao
阅读(0)
推荐(0)
摘要:
Capsule-based Object Tracking with Natural Language Specification 2021-12-18 19:28:39 Paper: https://dl.acm.org/doi/abs/10.1145/3474085.3475349 1. Bac
阅读全文
posted @ 2021-12-18 19:31
AHU-WangXiao
阅读(222)
推荐(0)
摘要:
CLIP: Learning Transferable Visual Models From Natural Language Supervision 2021-11-25 21:29:02 Paper: https://arxiv.org/pdf/2103.00020.pdf Code: http
阅读全文
posted @ 2021-11-25 21:30
AHU-WangXiao
阅读(145)
推荐(0)
posted @ 2021-11-02 18:42
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-10-28 17:43
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-10-20 22:02
AHU-WangXiao
阅读(0)
推荐(0)
摘要:
CPT: COLORFUL PROMPT TUNING FOR PRE-TRAINED VISION-LANGUAGE MODELS 2021-09-28 11:41:22 Paper: https://arxiv.org/pdf/2109.11797.pdf Other blog: https:/
阅读全文
posted @ 2021-09-28 11:43
AHU-WangXiao
阅读(1068)
推荐(0)
posted @ 2021-09-12 11:16
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-08-31 10:36
AHU-WangXiao
阅读(0)
推荐(0)
摘要:
Representation Learning for Event-based Visuomotor Policies 2021-08-30 10:15:07 Paper: https://arxiv.org/pdf/2103.00806.pdf Code: https://github.com/m
阅读全文
posted @ 2021-08-30 10:19
AHU-WangXiao
阅读(159)
推荐(0)
posted @ 2021-08-18 09:38
AHU-WangXiao
阅读(3)
推荐(0)
posted @ 2021-08-11 20:31
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-08-10 21:02
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-08-07 20:31
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-08-04 21:45
AHU-WangXiao
阅读(0)
推荐(0)
posted @ 2021-07-29 11:10
AHU-WangXiao
阅读(5)
推荐(0)
摘要:
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text 2021-07-22 08:54:20 Paper: https://arxiv.org/pdf/2104.11178.
阅读全文
posted @ 2021-07-22 11:38
AHU-WangXiao
阅读(1170)
推荐(0)
摘要:
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation 2021-07-21 20:23:07 Paper: https://arxiv.org/pdf/2107.00249.pdf Code: No
阅读全文
posted @ 2021-07-21 20:34
AHU-WangXiao
阅读(1105)
推荐(0)
摘要:
AST: Audio Spectrogram Transformer 2021-07-21 19:38:36 Paper: https://arxiv.org/pdf/2104.01778.pdf Code: https://github.com/YuanGongND/ast 1. Backgrou
阅读全文
posted @ 2021-07-21 20:14
AHU-WangXiao
阅读(1542)
推荐(0)
摘要:
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts 2021-07-20 08:58:37 Paper: cvpr2021 Code: https://git
阅读全文
posted @ 2021-07-20 09:50
AHU-WangXiao
阅读(391)
推荐(0)