SwinTransformer¶
The SwinTransformer models are based on the Swin Transformer: Hierarchical Vision Transformer using Shifted Windows paper. SwinTransformer V2 models are based on the Swin Transformer V2: Scaling Up Capacity and Resolution paper.
Model builders¶
The following model builders can be used to instantiate an SwinTransformer model (original and V2) with and without pre-trained weights.
All the model builders internally rely on the torchvision.models.swin_transformer.SwinTransformer
base class. Please refer to the source code for
more details about this class.
|
Constructs a swin_tiny architecture from Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. |
|
Constructs a swin_small architecture from Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. |
|
Constructs a swin_base architecture from Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. |
|
Constructs a swin_v2_tiny architecture from Swin Transformer V2: Scaling Up Capacity and Resolution. |
|
Constructs a swin_v2_small architecture from Swin Transformer V2: Scaling Up Capacity and Resolution. |
|
Constructs a swin_v2_base architecture from Swin Transformer V2: Scaling Up Capacity and Resolution. |