COMMONVOICE¶

class torchaudio.datasets.COMMONVOICE(root: Union[str, Path], tsv: str = 'train.tsv')[source]¶

CommonVoice [Ardila et al., 2020] dataset.

Parameters:

root (str or Path) – Path to the directory where the dataset is located. (Where the tsv file is present.)
tsv (str, optional) – The name of the tsv file used to construct the metadata, such as "train.tsv", "test.tsv", "dev.tsv", "invalidated.tsv", "validated.tsv" and "other.tsv". (default: "train.tsv")

getitem¶

COMMONVOICE.__getitem__(n: int) → Tuple[Tensor, int, Dict[str, str]][source]¶

Load the n-th sample from the dataset.

Parameters:

n (int) – The index of the sample to be loaded

Returns:

Tuple of the following items;

Tensor:

Waveform

int:

Sample rate

Dict[str, str]:

Dictionary containing the following items from the corresponding TSV file;