Shortcuts

Wav2Vec2FABundle.Tokenizer

class torchaudio.pipelines.Wav2Vec2FABundle.Tokenizer

Interface of the tokenizer

Methods

__call__

abstract Tokenizer.__call__(transcript: List[str]) List[List[str]]

Tokenize the given transcript (list of word)

Note

The toranscript must be normalized.

Parameters:

transcript (list of str) – Transcript (list of word).

Returns:

List of token sequences

Return type:

(list of int)

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources