PyTorch Hub

For Researchers —

Explore and extend models
from the latest
cutting edge research

HybridNets

HybridNets - End2End Perception Network

3D ResNet

Resnet Style Video classification networks pretrained on the Kinetics 400 dataset

SlowFast

SlowFast networks pretrained on the Kinetics 400 dataset

X3D networks pretrained on the Kinetics 400 dataset

YOLOP

YOLOP pretrained on the BDD100K dataset

MiDaS

MiDaS models for computing relative depth from a single image.

ntsnet

classify birds using this fine-grained image classifier

FastPitch 2

The FastPitch model for generating mel spectrograms from text

GPUNet

GPUNet is a new family of Convolutional Neural Networks designed to max out the performance of NVIDIA GPU and TensorRT.

HiFi GAN

The HiFi GAN model for generating waveforms from mel spectrograms

Once-for-All

Once-for-all (OFA) decouples training and search, and achieves efficient inference across various edge devices and resource constraints.

Open-Unmix

Reference implementation for music source separation

Silero Speech-To-Text ...

A set of compact enterprise-grade pre-trained STT Models for multiple languages.

Silero Text-To-Speech ...

A set of compact enterprise-grade pre-trained TTS Models for multiple languages

Silero Voice Activity ...

Pre-trained Voice Activity Detector

YOLOv5

Ultralytics YOLOv5 🚀 for object detection, instance segmentation and image classification.

Deeplabv3

DeepLabV3 models with ResNet-50, ResNet-101 and MobileNet-V3 backbones

Transformer (NMT)

Transformer models for English-French and English-German translation.

ResNext WSL

ResNext models trained with billion scale weakly-supervised data.

DCGAN on FashionGen

A simple generative image model for 64x64 images

Progressive Growing of...

High-quality image generation of fashion, celebrity faces

Semi-supervised and se...

ResNet and ResNext models introduced in the "Billion scale semi-supervised learning for image classification" paper

PyTorch-Transformers

PyTorch implementations of popular NLP Transformers

U-Net for brain MRI

U-Net with batch normalization for biomedical image segmentation with pretrained weights for abnormality segmentation in brain MRI

EfficientNet

EfficientNets are a family of image classification models, which achieve state-of-the-art accuracy, being an order-of-magnitude smaller and faster....

ResNet50

ResNet50 model trained with mixed precision using Tensor Cores.

ResNeXt101

ResNet with bottleneck 3x3 Convolutions substituted by 3x3 Grouped Convolutions, trained with mixed precision using Tensor Cores.

SE-ResNeXt101

ResNeXt with Squeeze-and-Excitation module added, trained with mixed precision using Tensor Cores.

Single Shot MultiBox Detector model for object detection

Tacotron 2

The Tacotron 2 model for generating mel spectrograms from text

WaveGlow

WaveGlow model for generating speech from mel spectrograms (generated by Tacotron2)

RoBERTa

A Robustly Optimized BERT Pretraining Approach

AlexNet

The 2012 ImageNet winner achieved a top-5 error of 15.3%, more than 10.8 percentage points lower than that of the runner up.

Densenet

Dense Convolutional Network (DenseNet), connects each layer to every other layer in a feed-forward fashion.

Fully-Convolutional Network model with ResNet-50 and ResNet-101 backbones

GhostNet

Efficient networks by generating more features from cheap operations

GoogLeNet

GoogLeNet was based on a deep convolutional neural network architecture codenamed "Inception" which won ImageNet 2014.

HarDNet

Harmonic DenseNet pre-trained on ImageNet

IBN-Net

Networks with domain/appearance invariance

Inception_v3

Also called GoogleNetv3, a famous ConvNet trained on ImageNet from 2015

MEAL_V2

Boosting Tiny and Efficient Models using Knowledge Distillation.

MobileNet v2

Efficient networks optimized for speed and memory, with residual blocks

ProxylessNAS

Proxylessly specialize CNN architectures for different hardware platforms.

ResNeSt

A new ResNet variant.

ResNet

Deep residual networks pre-trained on ImageNet

ResNext

Next generation ResNets, more efficient and accurate

ShuffleNet v2

An efficient ConvNet optimized for speed and memory, pre-trained on ImageNet

SNNMLP

Brain-inspired Multilayer Perceptron with Spiking Neurons

SqueezeNet

Alexnet-level accuracy with 50x fewer parameters.

vgg-nets

Award winning ConvNets from 2014 ImageNet ILSVRC challenge

Wide ResNet

Wide Residual Networks

SimpleNet

Lets Keep it simple, Using simple architectures to outperform deeper and more complex architectures

All Research Models (52)

How it works —

Publishing Models

PyTorch Hub supports publishing pre-trained models (model definitions and pre-trained weights) to a GitHub repository by adding a simple hubconf.py file.

Loading models

Users can load pre-trained models using torch.hub.load() API.

Here’s an example showing how to load the resnet18 entrypoint from the pytorch/vision repo.

model = torch.hub.load('pytorch/vision', 'resnet18', pretrained=True)

PyTorch
Hub

For Researchers —

Explore and extend models
from the latest
cutting edge research

How it works —

Publishing Models

Loading models

Docs

Tutorials

Resources

PyTorch Hub

For Researchers —

Explore and extend models from the latest cutting edge research

How it works —

Publishing Models

Loading models

Docs

Tutorials

Resources

PyTorch
Hub

Explore and extend models
from the latest
cutting edge research