mindspore_gs.ptq

Post training quantization algorithms.

import mindspore_gs.ptq as ptq

PTQ Config

mindspore_gs.ptq.PTQConfig

Config for post trainning quantization.

PTQMode Enum

mindspore_gs.ptq.PTQMode

Mode for ptq quantizer.

NetworkHelper

mindspore_gs.ptq.NetworkHelper

NetworkHelper for decoupling algorithm with network framework.

mindspore_gs.ptq.MFLlama2Helper

Derived from 'NetworkHelper', a utility class for the MindFormers framework Llama2 network.

RoundToNearest Algorithm

mindspore_gs.ptq.RoundToNearest

Native implementation for post training quantization based on min/max statistic values.