torchaudio.prototype.pipelines¶
The pipelines subpackage contains APIs to models with pretrained weights and relevant utilities.
RNN-T Streaming/Non-Streaming ASR¶
Pretrained Models¶
Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference. |
|
Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference. |
HiFiGAN Vocoder¶
Interface¶
HiFiGANVocoderBundle
defines HiFiGAN Vocoder pipeline capable of transforming mel spectrograms into waveforms.
Data class that bundles associated information to use pretrained |
Pretrained Models¶
HiFiGAN Vocoder pipeline, trained on The LJ Speech Dataset [Ito and Johnson, 2017]. |
VGGish¶
Interface¶
VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models. |
|
Implementation of VGGish model [Hershey et al., 2017]. |
|
Converts raw waveforms to batches of examples to use as inputs to VGGish. |
Pretrained Models¶
Pre-trained VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models. |