TEDLIUM¶
- class torchaudio.datasets.TEDLIUM(root: Union[str, Path], release: str = 'release1', subset: str = 'train', download: bool = False, audio_ext: str = '.sph')[source]¶
Tedlium [Rousseau et al., 2012] dataset (releases 1,2 and 3).
- Parameters:
root (str or Path) – Path to the directory where the dataset is found or downloaded.
release (str, optional) – Release version. Allowed values are
"release1"
,"release2"
or"release3"
. (default:"release1"
).subset (str, optional) – The subset of dataset to use. Valid options are
"train"
,"dev"
, and"test"
. Defaults to"train"
.download (bool, optional) – Whether to download the dataset if it is not found at root path. (default:
False
).audio_ext (str, optional) – extension for audio file (default:
".sph"
)
Properties¶
phoneme_dict¶
Methods¶
__getitem__¶
- TEDLIUM.__getitem__(n: int) Tuple[Tensor, int, str, int, int, int] [source]¶
Load the n-th sample from the dataset.
- Parameters:
n (int) – The index of the sample to be loaded
- Returns:
Tuple of the following items;
- Tensor:
Waveform
- int:
Sample rate
- str:
Transcript
- int:
Talk ID
- int:
Speaker ID
- int:
Identifier