Shortcuts

COMMONVOICE

class torchaudio.datasets.COMMONVOICE(root: Union[str, Path], tsv: str = 'train.tsv')[source]

CommonVoice [Ardila et al., 2020] dataset.

Parameters:
  • root (str or Path) – Path to the directory where the dataset is located. (Where the tsv file is present.)

  • tsv (str, optional) – The name of the tsv file used to construct the metadata, such as "train.tsv", "test.tsv", "dev.tsv", "invalidated.tsv", "validated.tsv" and "other.tsv". (default: "train.tsv")

__getitem__

COMMONVOICE.__getitem__(n: int) Tuple[Tensor, int, Dict[str, str]][source]

Load the n-th sample from the dataset.

Parameters:

n (int) – The index of the sample to be loaded

Returns:

Tuple of the following items;

Tensor:

Waveform

int:

Sample rate

Dict[str, str]:

Dictionary containing the following items from the corresponding TSV file;

  • "client_id"

  • "path"

  • "sentence"

  • "up_votes"

  • "down_votes"

  • "age"

  • "gender"

  • "accent"

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources