Releases · Dao-AILab/flash-attention

09 Oct 00:22

v2.3.2

7f31e7c

v2.3.2

Bump to v2.3.2

Assets 96

04 Oct 05:21

github-actions

v2.3.1.post1

5e525a8

v2.3.1.post1

[CI] Use official Pytorch 2.1, add CUDA 11.8 for Pytorch 2.1

Assets 96

04 Oct 02:57

github-actions

v2.3.1

21c3b0d

v2.3.1

Bump to v2.3.1

Assets 14

27 Sep 05:09

github-actions

v2.3.0

601b4dc

v2.3.0

Bump to v2.3.0

Assets 88

24 Sep 08:05

github-actions

v2.2.5

0a1d03c

v2.2.5

Bump to v2.2.5

Assets 88

22 Sep 06:57

github-actions

v2.2.4.post1

bff3147

v2.2.4.post1

Re-enable compilation for Hopper

Assets 88

21 Sep 06:40

github-actions

v2.2.4

229080b

v2.2.4

Bump to v2.2.4

Assets 88

18 Sep 05:17

github-actions

v2.2.3.post2

799f56f

v2.2.3.post2

Don't compile for Pytorch 2.1 on CUDA 12.1 due to nvcc segfaults

Assets 88

17 Sep 23:40

github-actions

v2.2.3.post1

c984208

v2.2.3.post1

Set block size to 64 x 64 for kvcache to avoid nvcc segfaults

Assets 71

16 Sep 08:47

github-actions

v2.2.3

8c8b4d3

v2.2.3

Bump to v2.2.3

Assets 88

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: Dao-AILab/flash-attention

v2.3.2

v2.3.1.post1

v2.3.1

v2.3.0

v2.2.5

v2.2.4.post1

v2.2.4

v2.2.3.post2

v2.2.3.post1

v2.2.3