Releases · nshepperd/flash_attn_jax

24 May 10:57

v0.2.2

4b704af

v0.2.2 Latest

Latest

Build cuda12 version with Hopper support.

Assets 10

21 May 12:29

github-actions

v0.2.1

047b437

v0.2.1

Bump minor version to 0.2.1.

Assets 10

01 May 23:46

github-actions

v0.2.0

d43cbca

v0.2.0

Expanded vmap support for flash_mha. Vmapping q but not k,v reduces t…

…o a grouped-query attention, which we now support.

Assets 10

18 Mar 14:24

github-actions

v0.1.0a3

d485930

v0.1.0a3

Try cibuildwheel.

Assets 10

18 Mar 14:22

github-actions

v0.1.0a2

e475f56

v0.1.0a2

Try cibuildwheel.

Assets 2

17 Mar 14:13

github-actions

v0.1.0a1

f4ff626

v0.1.0a1

Set CUDA_HOME for the sdist

Assets 10

17 Mar 12:16

github-actions

v0.1.0

8e064ea

v0.1.0

Try release with github actions and rebased repo.

Assets 22

28 Feb 16:52

github-actions

v2.5.5a2

bc9a01d

v2.5.5a2

Implement ring attention backward pass. More tests.

Assets 22

27 Feb 07:09

github-actions

v2.5.5a1

40c9981

v2.5.5a1

Merge up to v2.5.5

Assets 22

21 Feb 00:12

github-actions

v2.5.0a4

92f9691

v2.5.0a4

Implement custom sharding for flash_mha, to allow efficient multi-gpu…

… computation when sharded across batch or head dimensions.

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: nshepperd/flash_attn_jax

v0.2.2

v0.2.1

v0.2.0

v0.1.0a3

v0.1.0a2

v0.1.0a1

v0.1.0

v2.5.5a2

v2.5.5a1

v2.5.0a4