DR_VCTK¶
- class torchaudio.datasets.DR_VCTK(root: Union[str, Path], subset: str = 'train', *, download: bool = False, url: str = 'https://datashare.ed.ac.uk/bitstream/handle/10283/3038/DR-VCTK.zip')[source]¶
Device Recorded VCTK (Small subset version) [Sarfjoo and Yamagishi, 2018] dataset.
- Parameters:
root (str or Path) – Root directory where the dataset’s top level directory is found.
subset (str) – The subset to use. Can be one of
"train"
and"test"
. (default:"train"
).download (bool) – Whether to download the dataset if it is not found at root path. (default:
False
).url (str) – The URL to download the dataset from. (default:
"https://datashare.ed.ac.uk/bitstream/handle/10283/3038/DR-VCTK.zip"
)
__getitem__¶
- DR_VCTK.__getitem__(n: int) Tuple[Tensor, int, Tensor, int, str, str, str, int] [source]¶
Load the n-th sample from the dataset.
- Parameters:
n (int) – The index of the sample to be loaded
- Returns:
Tuple of the following items;
- Tensor:
Clean waveform
- int:
Sample rate of the clean waveform
- Tensor:
Noisy waveform
- int:
Sample rate of the noisy waveform
- str:
Speaker ID
- str:
Utterance ID
- str:
Source
- int:
Channel ID