模型推理 ========================= .. toctree:: :glob: :maxdepth: 1 :hidden: :caption: MindSpore推理 ms_infer/llm_inference_overview ms_infer/weight_prepare ms_infer/model_dev ms_infer/parallel ms_infer/weight_split ms_infer/model_export ms_infer/quantization ms_infer/profiling ms_infer/custom_operator .. toctree:: :glob: :maxdepth: 1 :hidden: :caption: MindSpore Lite推理 lite_infer/overview .. raw:: html <div class="container"> <div class="row"> <div class="col-md-6"> <div class="doc-article-list"> <div class="doc-article-item"> <a href="./ms_infer/llm_inference_overview.html" class="article-link"> <div> <div class="doc-article-head"> <span class="doc-head-content">MindSpore推理</span> </div> <div class="doc-article-desc"> 提供“开箱即用”的大语言模型部署和推理能力,根据模型特点实现最优性价比。 </div> </div> </a> </div> </div> </div> <div class="col-md-6"> <div class="doc-article-list"> <div class="doc-article-item"> <a href="./lite_infer/overview.html" class="article-link"> <div> <div class="doc-article-head"> <span class="doc-head-content">MindSpore Lite推理</span> </div> <div class="doc-article-desc"> 专注于离线模型的高效推理部署方案和端上设备的高性能推理的轻量化推理引擎。 </div> </div> </a> </div> </div> </div> </div> </div>