Installation and Deployment
Quantization Aware Training Algorithms
Post Training Quantization Algorithms
Pruning Algorithms
Model Deployment
API References
RELEASE NOTES
Quant granularity for ptq quantizer.
PER_TENSOR: apply quant granularity to per_tensor.
PER_TENSOR
PER_CHANNEL: apply quant granularity to per_channel.
PER_CHANNEL
PER_TOKEN: apply quant granularity to per_token.
PER_TOKEN
PER_GROUP: apply quant granularity to per_group.
PER_GROUP
Convert name to quant granularity type.
name (str) – the string name of the quant granularity.