MindSpore Serving Documents

MindSpore Serving is a lightweight and high-performance module that helps developers efficiently deploy inference services in production. Simply train your model on MindSpore, export it and then use MindSpore Serving to create inference services for the models.

MindSpore Serving supports:

Customized preprocessing and postprocessing to simplify model release and deployment
Batch function that splits and combines multiple-instances requests to fit the batch size requirements of the model
Distributed model inference
gRPC APIs and easy-to-use Python encapsulation APIs on the client
RESTful APIs on the client

Typical Application Scenarios

Quick Start

Use the Add network as an example to demonstrate how to deploy an inference service with MindSpore Serving.
Access Services with gRPC APIs

Easily access services with high performance.
Using the RESTful APIs to Access Services

Access services based on HTTP.

Installation

MindSpore Serving Installation

Guide

API References

References

FAQ

RELEASE NOTES

Release Notes