Make some fbgemm fp8 triton ops pt2 friendly #3188

henrylhtsang · 2024-09-27T21:40:18Z

Summary:
Make some fbgemm fp8 triton ops pt2 friendly..

What this diff tries to do

stop using TensorWrapper and tl.reinterpret
Remove the use of triton_heuristics for _kernel_matmul_fp8_row

What this diff won't help:

triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs

need to do in the future:

Update other ops, like quantize_fp8_row.
Update documentation. Feels pretty outdated, and some still reference to TensorWrapper.

Differential Revision: D63560103

facebook-github-bot · 2024-09-27T21:40:33Z

This pull request was exported from Phabricator. Differential Revision: D63560103

netlify · 2024-09-27T21:40:33Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`be457d5`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66faf8e2ad9fd8000885fe8c
😎 Deploy Preview	https://deploy-preview-3188--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2024-09-29T23:47:10Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Reviewed By: jwfromm Differential Revision: D63560103

facebook-github-bot · 2024-09-29T23:52:34Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Reviewed By: jwfromm Differential Revision: D63560103

facebook-github-bot · 2024-09-30T17:33:38Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Differential Revision: D63560103

facebook-github-bot · 2024-09-30T17:40:28Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Differential Revision: D63560103

facebook-github-bot · 2024-09-30T19:03:30Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Reviewed By: henrylhtsang Differential Revision: D63560103

facebook-github-bot · 2024-09-30T19:09:18Z

This pull request was exported from Phabricator. Differential Revision: D63560103

Summary: X-link: facebookresearch/FBGEMM#283 Pull Request resolved: pytorch#3188 Make some fbgemm fp8 triton ops pt2 friendly.. # What this diff tries to do * stop using TensorWrapper and tl.reinterpret * Remove the use of triton_heuristics for _kernel_matmul_fp8_row # What this diff won't help: * triton_herustics use cases of EVEN_K. One option is to just merge that into the autotuning configs # need to do in the future: * Update other ops, like quantize_fp8_row. * Update documentation. Feels pretty outdated, and some still reference to TensorWrapper. Reviewed By: henrylhtsang Differential Revision: D63560103

facebook-github-bot · 2024-09-30T19:15:40Z

This pull request was exported from Phabricator. Differential Revision: D63560103

facebook-github-bot · 2024-09-30T22:55:51Z

This pull request has been merged in d27acbd.

facebook-github-bot added the cla signed label Sep 27, 2024

facebook-github-bot added the fb-exported label Sep 27, 2024

henrylhtsang force-pushed the export-D63560103 branch from 9544784 to 4f38802 Compare September 29, 2024 23:47

henrylhtsang force-pushed the export-D63560103 branch from 4f38802 to c6b13bc Compare September 29, 2024 23:52

henrylhtsang force-pushed the export-D63560103 branch from c6b13bc to 259dd02 Compare September 30, 2024 17:33

henrylhtsang force-pushed the export-D63560103 branch from 259dd02 to 0385aa4 Compare September 30, 2024 17:40

henrylhtsang force-pushed the export-D63560103 branch from 0385aa4 to 63a4721 Compare September 30, 2024 19:03

henrylhtsang force-pushed the export-D63560103 branch from 63a4721 to 48569d0 Compare September 30, 2024 19:09

henrylhtsang force-pushed the export-D63560103 branch from 48569d0 to be457d5 Compare September 30, 2024 19:15

facebook-github-bot closed this in d27acbd Sep 30, 2024

facebook-github-bot added the Merged label Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make some fbgemm fp8 triton ops pt2 friendly #3188

Make some fbgemm fp8 triton ops pt2 friendly #3188

henrylhtsang commented Sep 27, 2024

facebook-github-bot commented Sep 27, 2024

netlify bot commented Sep 27, 2024 •

edited

Loading

facebook-github-bot commented Sep 29, 2024

facebook-github-bot commented Sep 29, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

Make some fbgemm fp8 triton ops pt2 friendly #3188

Make some fbgemm fp8 triton ops pt2 friendly #3188

Conversation

henrylhtsang commented Sep 27, 2024

What this diff tries to do

What this diff won't help:

need to do in the future:

facebook-github-bot commented Sep 27, 2024

netlify bot commented Sep 27, 2024 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Sep 29, 2024

facebook-github-bot commented Sep 29, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

facebook-github-bot commented Sep 30, 2024

netlify bot commented Sep 27, 2024 •

edited

Loading