mindspore.ops.NLLLoss

class mindspore.ops.NLLLoss(reduction='mean', ignore_index=- 100)[source]

Gets the negative log likelihood loss between logits and labels.

The nll loss with $r e d u c t i o n = n o n e$ can be described as:

ℓ (x, t) = L = {l_{1}, \dots, l_{N}}^{⊤}, l_{n} = - w_{t_{n}} x_{n, t_{n}}, w_{c} = weight [c] \cdot 1

where $x$ is the logits, $t$ is the labels, $w$ is the weight, N is the batch size, $c$ belonging to [0, C-1] is class index, where $C$ is the number of classes.

If $r e d u c t i o n \neq n o n e$ (default 'mean' ), then

\begin{array}{r} ℓ (x, t) = {\begin{cases} \sum_{n = 1}^{N} \frac{1}{\sum_{n = 1}^{N} w_{t n}} l_{n}, & if reduction ='mean'; \\ \sum_{n = 1}^{N} l_{n}, & if reduction ='sum' \end{cases} \end{array}

Parameters

reduction (str, optional) –
Apply specific reduction method to the output: 'none' , 'mean' , 'sum' . Default: 'mean' .
- 'none': no reduction will be applied.
- 'mean': compute and return the weighted mean of elements in the output.
- 'sum': the output elements will be summed.
ignore_index (int) – Specifies a target value that is ignored and does not contribute to the input gradient. Default: -100 .

Inputs:

logits (Tensor) - Input logits, with shape $(N, C)$ . Data type only supports float32 or float16.
labels (Tensor) - Ground truth labels, with shape $(N,)$ , where each value belong to $[0, C - 1]$ . Data type only supports int32 or int64.
weight (Tensor) - The rescaling weight to each class, with shape $(C,)$ and data type only supports float32 or float16.

Outputs:

Tuple of 2 tensors composed with loss and total_weight.

loss (Tensor) - When reduction is 'none' and logits is a 2D tensor, the loss shape is $(N,)$ . Otherwise, the loss is a scalar. The data type is the same with input's.
total_weight (Tensor) - The total_weight is a scalar. The data type is the same with weight's.

Raises

TypeError – If dtype of logits or weight is neither float16 nor float32.
TypeError – If dtype of labels is neither int32 nor int64.
ValueError – If logits is not a one or two dimension tensor, labels and weight are not one dimension tensors. When logits is a two dimension tensor, the first dimension of logits is not equal to labels, and second dimension of logits is not equal to weight. When logits is a one dimension tensor, the dimensions of logits, labels and weight should be equal to each other.
ValueError – If the value of labels exceed $[0, C - 1]$ , where $C$ is the number of classes.

Supported Platforms:: Ascend GPU CPU

Examples

>>> import numpy as np
>>> from mindspore import Tensor, ops
>>> logits = Tensor(np.array([[0.5488135, 0.71518934],
...                           [0.60276335, 0.5448832],
...                           [0.4236548, 0.6458941]]).astype(np.float32))
>>> labels = Tensor(np.array([0, 0, 0]).astype(np.int32))
>>> weight = Tensor(np.array([0.3834415, 0.79172504]).astype(np.float32))
>>> nll_loss = ops.NLLLoss(reduction="mean")
>>> loss, weight = nll_loss(logits, labels, weight)
>>> print(loss)
-0.52507716
>>> print(weight)
1.1503246