Wav2Letter
- class torchaudio.models.Wav2Letter(num_classes: int = 40, input_type: str = 'waveform', num_features: int = 1)[source]
Wav2Letter model architecture from Wav2Letter: an End-to-End ConvNet-based Speech Recognition System [Collobert et al., 2016].
See also
- Parameters:
Methods
forward
- Wav2Letter.forward(x: Tensor) Tensor [source]
- Parameters:
x (torch.Tensor) – Tensor of dimension (batch_size, num_features, input_length).
- Returns:
Predictor tensor of dimension (batch_size, number_of_classes, input_length).
- Return type:
Tensor