Shortcuts

torchaudio.models.wavlm_model

torchaudio.models.wavlm_model(extractor_mode: str, extractor_conv_layer_config: Optional[List[Tuple[int, int, int]]], extractor_conv_bias: bool, encoder_embed_dim: int, encoder_projection_dropout: float, encoder_pos_conv_kernel: int, encoder_pos_conv_groups: int, encoder_num_layers: int, encoder_num_heads: int, encoder_num_buckets: int, encoder_max_distance: int, encoder_attention_dropout: float, encoder_ff_interm_features: int, encoder_ff_interm_dropout: float, encoder_dropout: float, encoder_layer_norm_first: bool, encoder_layer_drop: float, aux_num_out: Optional[int]) Wav2Vec2Model[source]

Builds custom WaveLM model [Chen et al., 2022]. The architecture is compatible with Wav2Vec2 model [Baevski et al., 2020], and so the output object is Wav2Vec2Model. Most of the arguments have the same meaning as in wav2vec2_model() so please refer there for documentation.

Parameters:
Returns:

The resulting model.

Return type:

Wav2Vec2Model

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources