New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Support dim != 1 for softmax w/o using permute #845

Closed

int3 wants to merge 1 commit into facebookincubator:main from int3:export-D47732875

Contributor

int3 commented Jul 25, 2023

Summary:
This is a port of PyTorch's softmax implementation.

Notable differences:

We use fast_exp & fast_max instead of std::max and std::exp
We don't use higher-precision types for accumulator values (doesn't look like the dim=-1 softmax code does this either)

This is probably the reason why we are (very marginally) faster.

I have named this new softmax implementation "softmaxGeneral" since it is able to handle arbitrary reduction dimensions, even though we are only using it for the dim > 1 case.

Differential Revision: D47732875

facebook-github-bot added CLA Signed fb-exported labels

Contributor

facebook-github-bot commented Jul 25, 2023

This pull request was exported from Phabricator. Differential Revision: D47732875

int3 force-pushed the export-D47732875 branch from c0333a5 to 3886e66 Compare

July 27, 2023 07:58

Contributor

facebook-github-bot commented Jul 27, 2023

This pull request was exported from Phabricator. Differential Revision: D47732875

int3 added a commit to int3/AITemplate that referenced this pull request


          Support dim != 1 for softmax w/o using permute (facebookincubator#845)

87b9340

Summary:
Pull Request resolved: facebookincubator#845

This is a port of PyTorch's softmax implementation.

Notable differences:
* We use fast_exp & fast_max instead of std::max and std::exp
* We don't use higher-precision types for accumulator values (doesn't look like the dim=-1 softmax code does this either)
* We propagate the reduction dim size & inner size as constants

We seem to be very marginally slower than PT for small batch sizes and very marginally faster for large ones.

I have named this new softmax implementation "softmaxGeneral" since it is able to handle arbitrary reduction dimensions, even though we are only using it for the `dim > 1` case.

Differential Revision: D47732875

fbshipit-source-id: 5118fed5cf6457bd9d27f553245c3b9985403f78

int3 force-pushed the export-D47732875 branch from 3886e66 to 87b9340 Compare

July 28, 2023 05:37

Contributor

facebook-github-bot commented Jul 28, 2023

This pull request was exported from Phabricator. Differential Revision: D47732875

int3 added a commit to int3/AITemplate that referenced this pull request


          Support dim != 1 for softmax w/o using permute (facebookincubator#845)

a92f4b3

Summary:
Pull Request resolved: facebookincubator#845

This is a port of PyTorch's softmax implementation.

Notable differences:
* We use fast_exp & fast_max instead of std::max and std::exp
* We don't use higher-precision types for accumulator values (doesn't look like the dim=-1 softmax code does this either)
* We propagate the reduction dim size & inner size as constants

We seem to be very marginally slower than PT for small batch sizes and very marginally faster for large ones.

I have named this new softmax implementation "softmaxGeneral" since it is able to handle arbitrary reduction dimensions, even though we are only using it for the `dim > 1` case.

Differential Revision: D47732875

fbshipit-source-id: 1bd5a47e4293e3b800942b3d0cd0270c657d0069

int3 force-pushed the export-D47732875 branch from 87b9340 to a92f4b3 Compare

July 28, 2023 06:31

Contributor

facebook-github-bot commented Jul 28, 2023

This pull request was exported from Phabricator. Differential Revision: D47732875


          Support dim != 1 for softmax w/o using permute (facebookincubator#845)

11be2a9

Summary:
Pull Request resolved: facebookincubator#845

This is a port of PyTorch's softmax implementation.

Notable differences:
* We use fast_exp & fast_max instead of std::max and std::exp
* We don't use higher-precision types for accumulator values (doesn't look like the dim=-1 softmax code does this either)
* We propagate the reduction dim size & inner size as constants

We seem to be very marginally slower than PT for small batch sizes and very marginally faster for large ones.

I have named this new softmax implementation "softmaxGeneral" since it is able to handle arbitrary reduction dimensions, even though we are only using it for the `dim > 1` case.

Differential Revision: D47732875

fbshipit-source-id: 5323b5cfb0b2d3e3b983033718563b2976df5519

int3 force-pushed the export-D47732875 branch from a92f4b3 to 11be2a9 Compare

August 8, 2023 12:28

Contributor

facebook-github-bot commented Aug 8, 2023

This pull request was exported from Phabricator. Differential Revision: D47732875

facebook-github-bot closed this in

318111f

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Aug 8, 2023

This pull request has been merged in 318111f.

int3 added a commit to int3/AITemplate that referenced this pull request


          Remove reshape workaround for softmax when dim != -1

aec605f

Summary:
Now that facebookincubator#845 has landed,
the backend supports softmax with `dim != -1` directly, and the fx converter no
longer needs the workaround from
facebookincubator#395.

Differential Revision: D48248330

fbshipit-source-id: fad534f63b642ecbf79a90f7fae3c4cc9ad4dadf

int3 mentioned this pull request

Remove reshape workaround for softmax when dim != -1 #895

Open

facebook-github-bot pushed a commit that referenced this pull request


          Remove reshape workaround for softmax when dim != -1

acfc815

Summary:
Now that #845 has landed,
the backend supports softmax with `dim != -1` directly, and the fx converter no
longer needs the workaround from
#395.

Reviewed By: chenyang78

Differential Revision: D48248330

fbshipit-source-id: 78b26f81b85c69bd01a59c6db0a04e6c755127e5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported Merged