Table of Contents

Shortcuts

Wav2Vec2FABundle.Aligner

class torchaudio.pipelines.Wav2Vec2FABundle.Aligner: Interface of the aligner

Methods

call

abstract Aligner.__call__(emission: Tensor, tokens: List[List[int]]) → List[List[TokenSpan]]

Generate list of time-stamped token sequences

Parameters:

emission (Tensor) – Sequence of token probability distributions in log-domain. Shape: (time, tokens).
tokens (list of python:integer sequence) – Tokenized transcript. Output from torchaudio.pipelines.Wav2Vec2FABundle.Tokenizer.

Returns:

Tokens with time stamps and scores.

Return type:

(list of TokenSpan sequence)

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources