[10] CrossViT: Cross-Attention Multi-Scale Vision Transformer for ImageClassification
·
Paper Review/etc
[Paper] https://openaccess.thecvf.com//content/ICCV2021/papers/Chen_CrossViT_Cross-Attention_Multi-Scale_Vision_Transformer_for_Image_Classification_ICCV_2021_paper.pdf [Github] https://github.com/IBM/CrossViT GitHub - IBM/CrossViT: Official implementation of CrossViT. https://arxiv.org/abs/2103.14899 Official implementation of CrossViT. https://arxiv.org/abs/2103.14899 - GitHub - IBM/CrossViT: ..
[9] Supervised Contrastive Learning
·
Paper Review/etc
[Paper] https://arxiv.org/pdf/2004.11362.pdf [Github] https://github.com/HobbitLong/SupContrast
[8] MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE,AND MOBILE-FRIENDLY VISION TRANSFORMER
·
Paper Review/etc
[Paper] https://arxiv.org/pdf/2110.02178.pdf [Github] https://github.com/apple/ml-cvnets GitHub - apple/ml-cvnets: CVNets: A library for training computer vision networks CVNets: A library for training computer vision networks - GitHub - apple/ml-cvnets: CVNets: A library for training computer vision networks github.com 이번 포스팅에서는 mobileViT 논문에 대해 리뷰하겠습니다 :) Abstract 비전 분야에서 많이 쓰이는 대표적인 신경망 중 하나가..
[7] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
·
Paper Review/etc
[Paper] https://arxiv.org/pdf/2010.11929.pdf [Github] https://github.com/google-research/vision_transformer GitHub - google-research/vision_transformer Contribute to google-research/vision_transformer development by creating an account on GitHub. github.com 이번 포스팅에서는 자연어처리 분야에서 눈부신 성능 발전을 가져온 Transformer 구조를 비전 분야에도 적용한 Vision Transformer(ViT) 에 대해 설명하겠습니다. Transformer의 전체적인 구조와 작동과정은 아래 두 글을 참고..