The Mask R-CNN model is based on the Mask R-CNN paper.
The detection module is in Beta stage, and backward compatibility is not guaranteed.
The following model builders can be used to instantiate a Mask R-CNN model, with or
without pre-trained weights. All the model builders internally rely on the
torchvision.models.detection.mask_rcnn.MaskRCNN base class. Please refer to the source
more details about this class.
Mask R-CNN model with a ResNet-50-FPN backbone from the Mask R-CNN paper.
Improved Mask R-CNN model with a ResNet-50-FPN backbone from the Benchmarking Detection Transfer Learning with Vision Transformers paper.