EMFORMER_RNNT_BASE_LIBRISPEECH¶
- torchaudio.pipelines.EMFORMER_RNNT_BASE_LIBRISPEECH¶
ASR pipeline based on Emformer-RNNT, pretrained on LibriSpeech dataset [Panayotov et al., 2015], capable of performing both streaming and non-streaming inference.
The underlying model is constructed by
torchaudio.models.emformer_rnnt_base()
and utilizes weights trained on LibriSpeech using training scripttrain.py
here with default arguments.Please refer to
RNNTBundle
for usage instructions.