You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, it is a nice code for a quantization.
However, I have some questions about sensitivity measurement and genetic algorithm.
In the paper, you expressed that layer sensitivity can be measured with diagonal loss while off-diagonal loss expresses cross-layer sensitivity. However, I could not find such a loss term in your code in this git..... if you do not mind, please let me know where you wrote those terms?
Also, could you please tell me where are your implementation details of scale data for weights and activation? Where do you calculation them and improve them?
It was quite interesting that using genetic algorithm to find the optimal bitwidth configuration for each block. But as well as sensitivity problem, I could not find such a code for this algorithm in the git. Again, if you do not mind it either, please let me know where it is.
By the way, I am really impressed with your paper and code for new quantization method. Hope you to have a nice day. Thank you and sorry for the questions!
The text was updated successfully, but these errors were encountered:
Hi, it is a nice code for a quantization.
However, I have some questions about sensitivity measurement and genetic algorithm.
In the paper, you expressed that layer sensitivity can be measured with diagonal loss while off-diagonal loss expresses cross-layer sensitivity. However, I could not find such a loss term in your code in this git..... if you do not mind, please let me know where you wrote those terms?
Also, could you please tell me where are your implementation details of scale data for weights and activation? Where do you calculation them and improve them?
It was quite interesting that using genetic algorithm to find the optimal bitwidth configuration for each block. But as well as sensitivity problem, I could not find such a code for this algorithm in the git. Again, if you do not mind it either, please let me know where it is.
By the way, I am really impressed with your paper and code for new quantization method. Hope you to have a nice day. Thank you and sorry for the questions!
The text was updated successfully, but these errors were encountered: