Inference on a GPU

Linux GPU Inference Application Beginner Intermediate Expert

Inference Using a Checkpoint File

The inference is the same as that on the Ascend 910 AI processor.

Inference Using an ONNX File

Generate a model in ONNX format on the training platform. For details, see Export ONNX Model.
Perform inference on a GPU by referring to the runtime or SDK document. For example, use TensorRT to perform inference on the NVIDIA GPU. For details, see TensorRT backend for ONNX.