-
Notifications
You must be signed in to change notification settings - Fork 631
Refactor FP8 grouped GEMM with dynamic and static versions #3561
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
This pull request was exported from Phabricator. Differential Revision: D68004072 |
4208eac
to
27d65f1
Compare
This pull request was exported from Phabricator. Differential Revision: D68004072 |
27d65f1
to
379880d
Compare
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
This pull request was exported from Phabricator. Differential Revision: D68004072 |
379880d
to
fdf78d4
Compare
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
This pull request was exported from Phabricator. Differential Revision: D68004072 |
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
fdf78d4
to
05a1bdc
Compare
This pull request was exported from Phabricator. Differential Revision: D68004072 |
1 similar comment
This pull request was exported from Phabricator. Differential Revision: D68004072 |
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
05a1bdc
to
66def49
Compare
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
66def49
to
fa9e2e5
Compare
This pull request was exported from Phabricator. Differential Revision: D68004072 |
fa9e2e5
to
154c9ad
Compare
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
This pull request was exported from Phabricator. Differential Revision: D68004072 |
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Differential Revision: D68004072 Reviewed By: jwfromm
Summary: Pull Request resolved: pytorch#3560 X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Differential Revision: D67806685 Reviewed By: jwfromm
This pull request was exported from Phabricator. Differential Revision: D68004072 |
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
154c9ad
to
63811f5
Compare
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Differential Revision: D68004072 Reviewed By: jwfromm
) Summary: Pull Request resolved: pytorch#3561 X-link: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072
This pull request was exported from Phabricator. Differential Revision: D68004072 |
63811f5
to
28c52c1
Compare
This pull request has been merged in 4957ca1. |
) Summary: Pull Request resolved: pytorch#3561 X-link: https://github.com/facebookresearch/FBGEMM/pull/647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072 fbshipit-source-id: eef892f77f8614b5d7235af8accc5b134c6bdad5
Summary: X-link: pytorch#3561 Pull Request resolved: facebookresearch/FBGEMM#647 Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm Reviewed By: jwfromm Differential Revision: D68004072 fbshipit-source-id: eef892f77f8614b5d7235af8accc5b134c6bdad5
Summary: Refactor FP8 grouped GEMM with dynamic and static versions to unify CUTLASS and CK FP8 grouped GEMM in fbgemm
Differential Revision: D68004072