torchaudio.prototype.pipelines

The pipelines subpackage contains APIs to models with pretrained weights and relevant utilities.

RNN-T Streaming/Non-Streaming ASR

`EMFORMER_RNNT_BASE_MUSTC`	Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference.
`EMFORMER_RNNT_BASE_TEDLIUM3`	Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference.

HiFiGANVocoderBundle defines HiFiGAN Vocoder pipeline capable of transforming mel spectrograms into waveforms.

Data class that bundles associated information to use pretrained HiFiGANVocoder.

HiFiGAN Vocoder pipeline, trained on The LJ Speech Dataset [Ito and Johnson, 2017].

`VGGishBundle`	VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models.
`VGGishBundle.VGGish`	Implementation of VGGish model [Hershey et al., 2017].
`VGGishBundle.VGGishInputProcessor`	Converts raw waveforms to batches of examples to use as inputs to VGGish.

Pre-trained VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models.