Release Notes
MindSpore Golden Stick 1.0.0 Release Notes
Major Features and Improvements
The Post-Training Quantization algorithm
PTQ
supports theGPTQ
quantization algorithm, which can quantize weight to 8 bit or 4 bit.GPTQ
has been added to the precision recovery algorithm set and can be selected throughprecision_recovery
inPTQConfig
. Currently, only theGPTQ
algorithm is available for precision recovery.The Post-Training Quantization algorithm
PTQ
supports theAWQ
quantization algorithm. It enablesAWQ
by adding a new outlier suppression method to quantize the weights to 4 bits. The outlier suppression method can be selected throughoutliers_suppression
inPTQConfig
, and currently, two methods are available:smooth
andawq
.
API Change
The quantization methods
RoundToNearest
andSmoothQuant
are deprecated. Please replace them withPTQ
.
Contributors
Thanks goes to these wonderful people:
huangzhuo, zhangminli, ccsszz, yyyyrf, hangangqiang
Contributions of any kind are welcome!