Shortcuts

LibriLightLimited

class torchaudio.datasets.LibriLightLimited(root: Union[str, Path], subset: str = '10min', download: bool = False)[source]

Subset of Libri-light [Kahn et al., 2020] dataset, which was used in HuBERT [Hsu et al., 2021] for supervised fine-tuning.

Parameters:
  • root (str or Path) – Path to the directory where the dataset is found or downloaded.

  • subset (str, optional) – The subset to use. Options: ["10min", "1h", "10h"] (Default: "10min").

  • download (bool, optional) – Whether to download the dataset if it is not found at root path. (default: False).

__getitem__

LibriLightLimited.__getitem__(n: int) Tuple[Tensor, int, str, int, int, int][source]

Load the n-th sample from the dataset.

Parameters:

n (int) – The index of the sample to be loaded

Returns:

Tuple of the following items;

Tensor:

Waveform

int:

Sample rate

str:

Transcript

int:

Speaker ID

int:

Chapter ID

int:

Utterance ID

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources