Overall Structure

The overall architecture formed by MindSpore Transformers and the end-to-end AI hardware and software ecosystem of MindSpore and Ascend is as follows:

At the hardware level, MindSpore Transformers supports users running large models on Ascend servers;
At the software level, MindSpore Transformers implements the big model-related code through the Python interface provided by MindSpore and performs data computation by the operator libraries provided by the supporting software package of the Ascend AI processor;
The basic functionality features currently supported by MindSpore Transformers are listed below:
1. Supports tasks such as running training and inference for large models distributed parallelism, with parallel capabilities including data parallelism, model parallelism, ultra-long sequence parallelism;
2. Supports model weight conversion, distributed weight splitting and combination, and different format of dataset loading and resumable training after breakpoint;
3. Support 25+ large models pretraining, fine-tuning, inference and [evaluation] (https://www.mindspore.cn/mindformers/docs/en/dev/usage/evaluation.html). Meanwhile, it also supports quantization, and the list of supported models can be found in Model Library;
MindSpore Transformers supports users to carry out model service deployment function through MindIE, and also supports the use of MindX to realize large-scale cluster scheduling; more third-party platforms will be supported in the future, please look forward to it.