vit_b_16
-
torchvision.models.
vit_b_16
(pretrained: bool = False, progress: bool = True, **kwargs: Any) → torchvision.models.vision_transformer.VisionTransformer[source] Constructs a vit_b_16 architecture from “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale”.