torchaudio.functional.amplitude_to_DB

torchaudio.functional.amplitude_to_DB(x: Tensor, multiplier: float, amin: float, db_multiplier: float, top_db: Optional[float] = None) → Tensor[source]

Turn a spectrogram from the power/amplitude scale to the decibel scale.

The output of each tensor in a batch depends on the maximum value of that tensor, and so may return different values for an audio clip split into snippets vs. a full clip.

Parameters:

x (Tensor) –
Input spectrogram(s) before being converted to decibel scale. The expected shapes are (freq, time), (channel, freq, time) or (..., batch, channel, freq, time).

Note

When top_db is specified, cut-off values are computed for each audio in the batch. Therefore if the input shape is 4D (or larger), different cut-off values are used for audio data in the batch. If the input shape is 2D or 3D, a single cutoff value is used.
multiplier (float) – Use 10. for power and 20. for amplitude
amin (float) – Number to clamp x
db_multiplier (float) – Log10(max(reference value and amin))
top_db (float or None, optional) – Minimum negative cut-off in decibels. A reasonable number is 80. (Default: None)

Returns:

Output tensor in decibel scale

Return type:

Tensor

torchaudio.functional.amplitude_to_DB

Docs

Tutorials

Resources