torchaudio.pipelines ==================== .. currentmodule:: torchaudio.pipelines .. py:module:: torchaudio.pipelines The pipelines subpackage contains API to access the models with pretrained weights, and information/helper functions associated the pretrained weights. RNN-T Streaming/Non-Streaming ASR --------------------------------- RNNTBundle ~~~~~~~~~~ .. autoclass:: RNNTBundle :members: sample_rate, n_fft, n_mels, hop_length, segment_length, right_context_length .. automethod:: get_decoder() -> torchaudio.models.RNNTBeamSearch .. automethod:: get_feature_extractor() -> RNNTBundle.FeatureExtractor .. automethod:: get_streaming_feature_extractor() -> RNNTBundle.FeatureExtractor .. automethod:: get_token_processor() -> RNNTBundle.TokenProcessor RNNTBundle - FeatureExtractor ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. autoclass:: torchaudio.pipelines::RNNTBundle.FeatureExtractor :special-members: __call__ RNNTBundle - TokenProcessor ~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. autoclass:: torchaudio.pipelines::RNNTBundle.TokenProcessor :special-members: __call__ EMFORMER_RNNT_BASE_LIBRISPEECH ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: EMFORMER_RNNT_BASE_LIBRISPEECH :no-value: wav2vec 2.0 / HuBERT - Representation Learning ---------------------------------------------- .. autoclass:: Wav2Vec2Bundle :members: sample_rate .. automethod:: get_model WAV2VEC2_BASE ~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_BASE :no-value: WAV2VEC2_LARGE ~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_LARGE :no-value: WAV2VEC2_LARGE_LV60K ~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_LARGE_LV60K :no-value: WAV2VEC2_XLSR53 ~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_XLSR53 :no-value: HUBERT_BASE ~~~~~~~~~~~ .. container:: py attribute .. autodata:: HUBERT_BASE :no-value: HUBERT_LARGE ~~~~~~~~~~~~ .. container:: py attribute .. autodata:: HUBERT_LARGE :no-value: HUBERT_XLARGE ~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: HUBERT_XLARGE :no-value: wav2vec 2.0 / HuBERT - Fine-tuned ASR ------------------------------------- Wav2Vec2ASRBundle ~~~~~~~~~~~~~~~~~ .. autoclass:: Wav2Vec2ASRBundle :members: sample_rate .. automethod:: get_model .. automethod:: get_labels WAV2VEC2_ASR_BASE_10M ~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_BASE_10M :no-value: WAV2VEC2_ASR_BASE_100H ~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_BASE_100H :no-value: WAV2VEC2_ASR_BASE_960H ~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_BASE_960H :no-value: WAV2VEC2_ASR_LARGE_10M ~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_10M :no-value: WAV2VEC2_ASR_LARGE_100H ~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_100H :no-value: WAV2VEC2_ASR_LARGE_960H ~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_960H :no-value: WAV2VEC2_ASR_LARGE_LV60K_10M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_10M :no-value: WAV2VEC2_ASR_LARGE_LV60K_100H ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_100H :no-value: WAV2VEC2_ASR_LARGE_LV60K_960H ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_960H :no-value: VOXPOPULI_ASR_BASE_10K_DE ~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: VOXPOPULI_ASR_BASE_10K_DE :no-value: VOXPOPULI_ASR_BASE_10K_EN ~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: VOXPOPULI_ASR_BASE_10K_EN :no-value: VOXPOPULI_ASR_BASE_10K_ES ~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: VOXPOPULI_ASR_BASE_10K_ES :no-value: VOXPOPULI_ASR_BASE_10K_FR ~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: VOXPOPULI_ASR_BASE_10K_FR :no-value: VOXPOPULI_ASR_BASE_10K_IT ~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: VOXPOPULI_ASR_BASE_10K_IT :no-value: HUBERT_ASR_LARGE ~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: HUBERT_ASR_LARGE :no-value: HUBERT_ASR_XLARGE ~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: HUBERT_ASR_XLARGE :no-value: Tacotron2 Text-To-Speech ------------------------ Tacotron2TTSBundle ~~~~~~~~~~~~~~~~~~ .. autoclass:: Tacotron2TTSBundle .. automethod:: get_text_processor .. automethod:: get_tacotron2 .. automethod:: get_vocoder Tacotron2TTSBundle - TextProcessor ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. autoclass:: torchaudio.pipelines::Tacotron2TTSBundle.TextProcessor :members: tokens :special-members: __call__ Tacotron2TTSBundle - Vocoder ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. autoclass:: torchaudio.pipelines::Tacotron2TTSBundle.Vocoder :members: sample_rate :special-members: __call__ TACOTRON2_WAVERNN_PHONE_LJSPEECH ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: TACOTRON2_WAVERNN_PHONE_LJSPEECH :no-value: TACOTRON2_WAVERNN_CHAR_LJSPEECH ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: TACOTRON2_WAVERNN_CHAR_LJSPEECH :no-value: TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH :no-value: TACOTRON2_GRIFFINLIM_CHAR_LJSPEECH ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .. container:: py attribute .. autodata:: TACOTRON2_GRIFFINLIM_CHAR_LJSPEECH :no-value: References ---------- .. footbibliography::