HDemucs

class torchaudio.models.HDemucs(sources: List[str], audio_channels: int = 2, channels: int = 48, growth: int = 2, nfft: int = 4096, depth: int = 6, freq_emb: float = 0.2, emb_scale: int = 10, emb_smooth: bool = True, kernel_size: int = 8, time_stride: int = 2, stride: int = 4, context: int = 1, context_enc: int = 0, norm_starts: int = 4, norm_groups: int = 4, dconv_depth: int = 2, dconv_comp: int = 4, dconv_attn: int = 4, dconv_lstm: int = 4, dconv_init: float = 0.0001)[source]

Hybrid Demucs model from Hybrid Spectrogram and Waveform Source Separation [Défossez, 2021].

forward

HDemucs.forward(input: Tensor)[source]

HDemucs forward call

Parameters:

input (torch.Tensor) – input mixed tensor of shape (batch_size, channel, num_frames)

Returns:

Tensor: output tensor split into sources of shape (batch_size, num_sources, channel, num_frames)

HDemucs

forward

Docs

Tutorials

Resources