torchaudio.models.hubert_base¶
- torchaudio.models.hubert_base(encoder_projection_dropout: float = 0.1, encoder_attention_dropout: float = 0.1, encoder_ff_interm_dropout: float = 0.0, encoder_dropout: float = 0.1, encoder_layer_drop: float = 0.05, aux_num_out: Optional[int] = None) Wav2Vec2Model [source]¶
Builds “base”
HuBERT
from HuBERT [Hsu et al., 2021]- Parameters:
encoder_projection_dropout (float) – See
wav2vec2_model()
.encoder_attention_dropout (float) – See
wav2vec2_model()
.encoder_ff_interm_dropout (float) – See
wav2vec2_model()
.encoder_dropout (float) – See
wav2vec2_model()
.encoder_layer_drop (float) – See
wav2vec2_model()
.aux_num_out (int or None, optional) – See
wav2vec2_model()
.
- Returns:
The resulting model.
- Return type: