mindspore.dataset.audio

This module is to support audio augmentations. It includes two parts: transforms and utils. transforms is a high performance processing module with common audio operations. utils provides some general methods for audio processing.

Common imported modules in corresponding API examples are as follows:

import mindspore.dataset as ds
import mindspore.dataset.audio.transforms as audio

mindspore.dataset.audio.transforms

mindspore.dataset.audio.transforms.AllpassBiquad

Design two-pole all-pass filter for audio waveform of dimension of (…, time).

mindspore.dataset.audio.transforms.AmplitudeToDB

Converts the input tensor from amplitude/power scale to decibel scale.

mindspore.dataset.audio.transforms.Angle

Calculate the angle of the complex number sequence of shape (…, 2).

mindspore.dataset.audio.transforms.BandBiquad

Design two-pole band filter for audio waveform of dimension of (…, time).

mindspore.dataset.audio.transforms.BandpassBiquad

Design two-pole band-pass filter.

mindspore.dataset.audio.transforms.BandrejectBiquad

Design two-pole band filter for audio waveform of dimension of (…, time).

mindspore.dataset.audio.transforms.BassBiquad

Design a bass tone-control effect for audio waveform of dimension of (…, time).

mindspore.dataset.audio.transforms.ComplexNorm

Compute the norm of complex tensor input.

mindspore.dataset.audio.transforms.Contrast

Apply contrast effect.

mindspore.dataset.audio.transforms.FrequencyMasking

Apply masking to a spectrogram in the frequency domain.

mindspore.dataset.audio.transforms.LowpassBiquad

Design biquad lowpass filter and perform filtering.

mindspore.dataset.audio.transforms.TimeMasking

Apply masking to a spectrogram in the time domain.

mindspore.dataset.audio.transforms.TimeStretch

Stretch STFT in time at a given rate, without changing the pitch.

mindspore.dataset.audio.utils

mindspore.dataset.audio.utils.ScaleType

Scale Types.