Add INT8 and INT4 support to P2P benchmark. #918

caogao · 2022-02-08T21:22:29Z

Summary:
Add two options:

default, which transfers quantized tensor directly;
--include_quantization, which start with FP16 tensor, quantize, transfer, and finally dequantize to FP16 tensor.

Also, add a option to sweep through data types and shapes.

Caveat: INT4 dequantization is not numerically correct, but adding as a proxy for performance measurement.

Reviewed By: brad-mengchi

Differential Revision: D31098854

Summary: Add two options: 1) default, which transfers quantized tensor directly; 2) --include_quantization, which start with FP16 tensor, quantize, transfer, and finally dequantize to FP16 tensor. Also, add a option to sweep through data types and shapes. Caveat: INT4 dequantization is not numerically correct, but adding as a proxy for performance measurement. Reviewed By: brad-mengchi Differential Revision: D31098854 fbshipit-source-id: f7dd6ec6d57967aec9593108b8a9ecf52c947d4c

facebook-github-bot · 2022-02-08T21:23:01Z

This pull request was exported from Phabricator. Differential Revision: D31098854

Summary: X-link: pytorch#3841 Pull Request resolved: facebookresearch/FBGEMM#918 qk norm applies L2 norm, not rms norm. So we just use k_norm instead of k_rms_norm. Reviewed By: jasonjk-park, Aya-ZIbra Differential Revision: D71268903 fbshipit-source-id: aa5ad2ea795a718843d6c15a9dee03e9b332b860

facebook-github-bot added cla signed fb-exported labels Feb 8, 2022

facebook-github-bot closed this in 501dfa7 Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add INT8 and INT4 support to P2P benchmark. #918

Add INT8 and INT4 support to P2P benchmark. #918

Uh oh!

caogao commented Feb 8, 2022

Uh oh!

facebook-github-bot commented Feb 8, 2022

Uh oh!

Uh oh!

Add INT8 and INT4 support to P2P benchmark. #918

Add INT8 and INT4 support to P2P benchmark. #918

Uh oh!

Conversation

caogao commented Feb 8, 2022

Uh oh!

facebook-github-bot commented Feb 8, 2022

Uh oh!

Uh oh!