Custom Quantization #1785

anthony-lemurian · 2024-05-10T22:18:18Z

anthony-lemurian
May 10, 2024

Hello, Im just wondering if its possible to define a custom data type to do WOQ in this repo? Im following the MX branch to see how they add that data type, however i wonder if there is a more straightforward approach since im only after WOQ

yiliu30 · 2024-05-13T14:37:53Z

yiliu30
May 13, 2024
Collaborator

Hey @anthony-lemurian, thanks for showing interest in our project!

For WOQ, the basic idea is quantize and dequantize the tensor (weight) to mimic the quantization error. The main function for this is quant_tensor, which takes a tensor (weight) and certain configurations to select the qdq_weight_actor.

Taking 4-bits as an example, qdq_weight_asym applies asymmetrical quantization and dequantization to the provided weight.

Hope this can give you some insights to define new data type :)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom Quantization #1785

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Custom Quantization #1785

anthony-lemurian May 10, 2024

Replies: 1 comment

yiliu30 May 13, 2024 Collaborator

anthony-lemurian
May 10, 2024

yiliu30
May 13, 2024
Collaborator