标签: ViT
- 05 Jan 2023 DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
- 28 Dec 2022 What do Vision Transformers Learn? A Visual Exploration
- 27 Dec 2022 Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
- 27 Dec 2022 Training data-efficient image transformers & distillation through attention