Installation and Deployment
Post-Training Quantization
Quant-Aware Quantization
Pruner
API References
RELEASE NOTES
Quant granularity for ptq quantizer.
PER_TENSOR: apply quant granularity to per_tensor.
PER_TENSOR
PER_CHANNEL: apply quant granularity to per_channel.
PER_CHANNEL
PER_TOKEN: apply quant granularity to per_token.
PER_TOKEN
PER_GROUP: apply quant granularity to per_group.
PER_GROUP
Convert name to quant granularity type.
name (str) – the string name of the quant granularity.