mindspore.dataset.vision
This module is to support vision augmentations. It includes two parts: c_transforms and py_transforms. C_transforms is a high performance image augmentation module which is developed with c++ opencv. Py_transforms provide more kinds of image augmentations which are developed with Python PIL.
Common imported modules in corresponding API examples are as follows:
import mindspore.dataset.vision.c_transforms as c_vision
import mindspore.dataset.vision.py_transforms as py_vision
from mindspore.dataset.transforms import c_transforms
mindspore.dataset.vision.c_transforms
Apply automatic contrast on input image. |
|
Apply a given image transform on a random selection of bounding box regions of a given image. |
|
Crop the input image at the center to the given size. |
|
Change the color space of the image. |
|
Crop the input image at a specific location. |
|
Apply CutMix transformation on input batch of images and labels. |
|
Randomly cut (mask) out a given number of square patches from the input image array. |
|
Decode the input image in RGB mode(default) or BGR mode(deprecated). |
|
Apply histogram equalization on input image. |
|
Blur input image with the specified Gaussian kernel. |
|
Flip the input image horizontally. |
|
Transpose the input image from shape (H, W, C) to shape (C, H, W). |
|
Apply invert on input image in RGB mode. |
|
Apply MixUp transformation on input batch of images and labels. |
|
Normalize the input image with respect to mean and standard deviation. |
|
Normalize the input image with respect to mean and standard deviation then pad an extra channel with value zero. |
|
Pad the image according to padding parameters. |
|
Apply Random affine transformation to the input image. |
|
Adjust the color of the input image by a fixed or random degree. |
|
Randomly adjust the brightness, contrast, saturation, and hue of the input image. |
|
Crop the input image at a random location. |
|
|
A combination of Crop, Decode and Resize. |
Crop the input image at a random location and adjust bounding boxes accordingly. |
|
Randomly flip the input image horizontally with a given probability. |
|
|
Flip the input image horizontally randomly with a given probability and adjust bounding boxes accordingly. |
Reduce the number of bits for each color channel to posterize the input image randomly with a given probability. |
|
Resize the input image using a randomly selected interpolation mode. |
|
Crop the input image to a random size and aspect ratio. |
|
|
Crop the input image to a random size and aspect ratio and adjust bounding boxes accordingly. |
Tensor operation to resize the input image using a randomly selected interpolation mode and adjust bounding boxes accordingly. |
|
Rotate the input image randomly within a specified range of degrees. |
|
Choose a random sub-policy from a policy list to be applied on the input image. |
|
Adjust the sharpness of the input image by a fixed or random degree. |
|
Randomly selects a subrange within the specified threshold range and sets the pixel value within the subrange to (255 - pixel). |
|
Randomly flip the input image vertically with a given probability. |
|
|
Flip the input image vertically, randomly with a given probability and adjust bounding boxes accordingly. |
Rescale the input image with the given rescale and shift. |
|
Resize the input image to the given size with a given interpolation mode. |
|
Resize the input image to the given size and adjust bounding boxes accordingly. |
|
Rotate the input image by specified degrees. |
|
Slice Tensor to multiple patches in horizontal and vertical directions. |
|
|
A combination of Crop, Decode and Resize using the simulation algorithm of Ascend series chip DVPP module. |
|
Decode and resize JPEG image using the simulation algorithm of Ascend series chip DVPP module. |
Perform randomly selected augmentation on input image. |
|
Flip the input image vertically. |
mindspore.dataset.vision.py_transforms
Automatically maximize the contrast of the input PIL Image. |
|
Crop the central region of the input PIL Image with the given size. |
|
Randomly apply a given number of square patches of zeros to a location within the input numpy.ndarray image of shape (C, H, W). |
|
Decode the input raw image to PIL Image format in RGB mode. |
|
Apply histogram equalization on the input PIL Image. |
|
Crop the given image into one central crop and four corners. |
|
Convert the input PIL Image to grayscale. |
|
Convert one or more numpy.ndarray images from HSV to RGB. |
|
Transpose the input numpy.ndarray image of shape (H, W, C) to (C, H, W). |
|
Invert the colors of the input PIL Image. |
|
Transform the input numpy.ndarray image with a given square transformation matrix and a mean vector. |
|
Randomly mix up a batch of images together with its labels. |
|
Normalize the input numpy.ndarray image of shape (C, H, W) with the specified mean and standard deviation. |
|
Normalize the input numpy.ndarray image of shape (C, H, W) with the specified mean and standard deviation, then pad an extra channel filled with zeros. |
|
Pad the input image on all sides with the given padding parameters. |
|
Apply random affine transformation to the input PIL Image. |
|
Adjust the color balance of the input PIL Image by a random degree. |
|
Randomly adjust the brightness, contrast, saturation, and hue of the input PIL Image. |
|
Crop the input PIL Image at a random location with the specified size. |
|
Randomly erase the pixels within a random selected rectangle region with a given probability. |
|
Randomly convert the input image into grayscale with a given probability. |
|
Randomly flip the input image horizontally with a given probability. |
|
Randomly apply perspective transformation to the input PIL Image with a given probability. |
|
Randomly crop the image and resize it to a given size. |
|
Rotate the input PIL Image by a random angle. |
|
Adjust the sharpness of the input PIL Image by a random degree. |
|
Randomly flip the input image vertically with a given probability. |
|
Resize the input PIL Image to the given size. |
|
Convert one or more numpy.ndarray images from RGB to HSV. |
|
Crop the given image into one central crop and four corners plus the flipped version of these. |
|
Convert the input decoded numpy.ndarray image to PIL Image. |
|
Convert the input PIL Image or numpy.ndarray of shape (H, W, C) in the range [0, 255] to numpy.ndarray of shape (C, H, W) in the range [0.0, 1.0] with the desired dtype. |
|
Convert the input numpy.ndarray image to the desired dtype. |
|
Uniformly select a number of transformations from a sequence and apply them sequentially and randomly, which means that there is a chance that a chosen transformation will not be applied. |
mindspore.dataset.vision.utils
Padding Mode, Border Type. |
|
Data Format of images after batch operation. |
|
Interpolation Modes. |
|
Mode to Slice Tensor into multiple parts. |