mindspore.ops.swiglu

mindspore.ops.swiglu(input, dim=- 1)[source]

Computes SwiGLU (Swish-Gated Linear Unit activation function) of input tensor. SwiGLU is a variant of the mindspore.ops.GLU activation function, it is defined as:

Warning

This is an experimental API that is subject to change or deletion.

S w i G L U (a, b) = S w i s h (a) \otimes b

where $a$ is the first half of the input matrices and $b$ is the second half, Swish(a)=a $σ$ (a), $σ$ is the mindspore.ops.sigmoid() activation function and $\otimes$ is the Hadamard product.

Parameters

input (Tensor) – Tensor to be split. It has shape $(*_{1}, N, *_{2})$ where * means, any number of additional dimensions. $N$ must be divisible by 2.
dim (int, optional) – the axis to split the input. It must be int. Default: -1 , the last axis of input.

Returns

Tensor, the same dtype as the input, with the shape $(*_{1}, M, *_{2})$ where $M = N / 2$ .

Raises

TypeError – If dtype of input is not float16, float32 or bfloat16.
TypeError – If input is not a Tensor.
RuntimeError – If the dimension specified by dim is not divisible by 2.

Supported Platforms:: Ascend

Examples

>>> from mindspore import Tensor, ops
>>> input = Tensor([[-0.12, 0.123, 31.122], [2.1223, 4.1212121217, 0.3123]], dtype=mindspore.float32)
>>> output = ops.swiglu(input, 0)
>>> print(output)
[[-0.11970687 0.2690224 9.7194 ]]