2023年4月6日

segment anything

摘要: What is the structure of the model? A ViT-H image encoder that runs once per image and outputs an image embedding A prompt encoder that embeds input p 阅读全文

posted @ 2023-04-06 22:51 MissSimple 阅读(218) 评论(0) 推荐(0) 编辑

导航