mindspore.ops.CTCGreedyDecoder

class mindspore.ops.CTCGreedyDecoder(merge_repeated=True)[源代码]

对输入中给定的logits执行贪婪解码。

更多参考详见 mindspore.ops.ctc_greedy_decoder()。

说明

在Ascend平台上，目前不支持配置 merge_repeated=False 。

参数：

merge_repeated (bool，可选) - 返回的结果中是否合并重复的类。默认值： True 。

输入：

inputs (Tensor) - shape: $(m a x_t i m e, b a t c h_s i z e, n u m_c l a s s e s)$ ，数据类型必须是float32或者float64。num_classes 为 num_labels + 1 classes，其中 num_labels 表示实际标签的个数，空标签默认使用 num_classes - 1。
sequence_length (Tensor) - shape: $(b a t c h_s i z e,)$ ，数据类型必须是int32，并且Tensor中的数值必须小于等于 max_time。

输出：

decoded_indices (Tensor) - shape: $(t o t a l_d e c o d e d_o u t p u t s, 2)$ ，数据类型为int64。
decoded_values (Tensor) - shape: $(t o t a l_d e c o d e d_o u t p u t s,)$ ，数据类型为int64。
decoded_shape (Tensor) - shape: $(b a t c h_s i z e, m a x_d e c o d e d_l e n g t h)$ ，数据类型为int64。
log_probability (Tensor) - shape: $(b a t c h_s i z e, 1)$ ，包含序列的对数概率，其数据类型与 inputs 保持一致。

支持平台：

Ascend GPU CPU

样例：

>>> import mindspore
>>> import numpy as np
>>> from mindspore import Tensor, ops
>>> inputs = Tensor(np.array([[[0.6, 0.4, 0.2], [0.8, 0.6, 0.3]],
...                           [[0.0, 0.6, 0.0], [0.5, 0.4, 0.5]]]), mindspore.float32)
>>> sequence_length = Tensor(np.array([2, 2]), mindspore.int32)
>>> decoded_indices, decoded_values, decoded_shape, log_probability = ops.CTCGreedyDecoder()(inputs,
...                                                                                          sequence_length)
>>> print(decoded_indices)
[[0 0]
 [0 1]
 [1 0]]
>>> print(decoded_values)
[0 1 0]
>>> print(decoded_shape)
[2 2]
>>> print(log_probability)
[[-1.2]
 [-1.3]]