mindspore.dataset.audio.Magphase

查看源文件
class mindspore.dataset.audio.Magphase(power=1.0)[源代码]

将shape为 \((..., 2)\) 的复值光谱图分离,输出幅度和相位。

参数:
  • power (float) - 范数的幂,必须是非负的。默认值: 1.0

异常:
  • RuntimeError - 当输入音频的shape不为(…, 2)。

支持平台:

CPU

样例:

>>> import numpy as np
>>> import mindspore.dataset as ds
>>> import mindspore.dataset.audio as audio
>>>
>>> # Use the transform in dataset pipeline mode
>>> waveform = np.random.random([5, 16, 2])  # 5 samples
>>> numpy_slices_dataset = ds.NumpySlicesDataset(data=waveform, column_names=["audio"])
>>> transforms = [audio.Magphase()]
>>> numpy_slices_dataset = numpy_slices_dataset.map(operations=transforms, input_columns=["audio"],
...                                                 output_columns=["spect", "phase"])
>>> for item in numpy_slices_dataset.create_dict_iterator(num_epochs=1, output_numpy=True):
...     print(item["spect"].shape, item["spect"].dtype)
...     break
(16,) float64
>>>
>>> # Use the transform in eager mode
>>> waveform = np.random.random([16, 2])  # 1 sample
>>> output = audio.Magphase()(waveform)
>>> print(output[0].shape, output[0].dtype)
(16,) float64
教程样例: