Coursera, Deep Learning 5, Sequence Models, week4, Transformer Network

 

 

 

 self-attention

 multi-head attention

 

posted @ 2021-11-10 11:16  mashuai_191  阅读(51)  评论(0)    收藏  举报