模型推理
=========================

.. toctree::
   :glob:
   :maxdepth: 1
   :hidden:
   :caption: MindSpore推理

   ms_infer/llm_inference_overview
   ms_infer/weight_prepare
   ms_infer/model_dev
   ms_infer/parallel
   ms_infer/weight_split
   ms_infer/model_export
   ms_infer/quantization
   ms_infer/profiling
   ms_infer/custom_operator

.. toctree::
   :glob:
   :maxdepth: 1
   :hidden:
   :caption: MindSpore Lite推理

   lite_infer/overview

.. raw:: html

   <div class="container">
         <div class="row">
            <div class="col-md-6">
               <div class="doc-article-list">
                  <div class="doc-article-item">
                     <a href="./ms_infer/llm_inference_overview.html" class="article-link">
                        <div>
                           <div class="doc-article-head">
                              <span class="doc-head-content">MindSpore推理</span>
                           </div>
                           <div class="doc-article-desc">
                              提供“开箱即用”的大语言模型部署和推理能力,根据模型特点实现最优性价比。
                           </div>
                        </div>
                     </a>
                  </div>
               </div>
            </div>
            <div class="col-md-6">
               <div class="doc-article-list">
                  <div class="doc-article-item">
                     <a href="./lite_infer/overview.html" class="article-link">
                        <div>
                           <div class="doc-article-head">
                              <span class="doc-head-content">MindSpore Lite推理</span>
                           </div>
                           <div class="doc-article-desc">
                              专注于离线模型的高效推理部署方案和端上设备的高性能推理的轻量化推理引擎。
                           </div>
                        </div>
                     </a>
                  </div>
               </div>
            </div>
         </div>
   </div>