Wav2Letter¶
- class torchaudio.models.Wav2Letter(num_classes: int = 40, input_type: str = 'waveform', num_features: int = 1)[source]¶
Wav2Letter model architecture from Wav2Letter: an End-to-End ConvNet-based Speech Recognition System [Collobert et al., 2016].
See also
- Parameters:
forward¶
- Wav2Letter.forward(x: Tensor) Tensor [source]¶
- Parameters:
x (torch.Tensor) – Tensor of dimension (batch_size, num_features, input_length).
- Returns:
Predictor tensor of dimension (batch_size, number_of_classes, input_length).
- Return type:
Tensor