HDemucs¶

class torchaudio.models.HDemucs(sources: List[str], audio_channels: int = 2, channels: int = 48, growth: int = 2, nfft: int = 4096, depth: int = 6, freq_emb: float = 0.2, emb_scale: int = 10, emb_smooth: bool = True, kernel_size: int = 8, time_stride: int = 2, stride: int = 4, context: int = 1, context_enc: int = 0, norm_starts: int = 4, norm_groups: int = 4, dconv_depth: int = 2, dconv_comp: int = 4, dconv_attn: int = 4, dconv_lstm: int = 4, dconv_init: float = 0.0001)[source]¶

Hybrid Demucs model from Hybrid Spectrogram and Waveform Source Separation [Défossez, 2021].

Methods¶

forward¶

HDemucs.forward(input: Tensor)[source]¶

HDemucs forward call

Parameters:

input (torch.Tensor) – input mixed tensor of shape (batch_size, channel, num_frames)

Returns:

Tensor: output tensor split into sources of shape (batch_size, num_sources, channel, num_frames)

Factory Functions¶

`hdemucs_low`	Builds low nfft (1024) version of `HDemucs`, suitable for sample rates around 8 kHz.
`hdemucs_medium`	Builds medium nfft (2048) version of `HDemucs`, suitable for sample rates of 16-32 kHz.
`hdemucs_high`	Builds medium nfft (4096) version of `HDemucs`, suitable for sample rates of 44.1-48 kHz.

HDemucs¶

Methods¶

forward¶

Factory Functions¶

Docs

Tutorials

Resources