Tacotron2TTSBundle.Vocoder¶

class torchaudio.pipelines.Tacotron2TTSBundle.Vocoder¶

Interface of the vocoder part of Tacotron2TTS pipeline

Properties¶

abstract property Vocoder.sample_rate¶

The sample rate of the resulting waveform

abstract Vocoder.__call__(specgrams: Tensor, lengths: Optional[Tensor] = None) → Tuple[Tensor, Optional[Tensor]]¶

Generate waveform from the given input, such as spectrogram

Parameters:

specgrams (Tensor) – The input spectrogram. Shape: (batch, frequency bins, time). The expected shape depends on the implementation.
lengths (Tensor, or None, optional) – The valid length of each sample in the batch. Shape: (batch, ). (Default: None)

Returns:

Tensor:: The generated waveform. Shape: (batch, max length)
Tensor or None:: The valid length of each sample in the batch. Shape: (batch, ).

Return type:

(Tensor, Optional[Tensor])