Dao-AILab / flash-attention Public

Notifications You must be signed in to change notification settings
Fork 1.9k
Star 19k

Code
Issues 811
Pull requests 78
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: Dao-AILab/flash-attention

Labels 9 Milestones 0

New pull request New

78 Open 246 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add support Orin, Thor, Spark and GB300

#1829 opened Aug 20, 2025 by johnnynunez

Loading…

ci: Allow build/deploy of arbitrary configurations

#1827 opened Aug 19, 2025 by ko3n1g • Draft

Add sorting and head swizzle to varlen scheduler

#1823 opened Aug 19, 2025 by jayhshah

Loading…

feat: Implement Sink Attention

#1819 opened Aug 18, 2025 by aoxy

Loading…

fix race condition bug in cute _flash_attn_fwd in multiple gpu env

#1793 opened Aug 1, 2025 by beiw-nv

Loading…

[skip_ci] ABI stable fa3

#1791 opened Jul 31, 2025 by mikaylagawarecki • Draft

2 tasks done

feat: blocksparse support

#1784 opened Jul 30, 2025 by guangyunh-nv • Draft

[CI] build upon manylinux, improve compatibility

#1780 opened Jul 29, 2025 by zipzou

Loading…

Fixes incorrect variable reference in comment

#1775 opened Jul 25, 2025 by LoserCheems

Loading…

Change the update method of the sub-module

#1774 opened Jul 25, 2025 by RealTapeL

Loading…

add var_len case for benchmark_mla_decode

#1770 opened Jul 22, 2025 by XiaobingSuper

Loading…

Add torch.compile support to flash attention 3

#1769 opened Jul 22, 2025 by guilhermeleobas

Loading…

Enable the deterministic mode option in the backward kernel

#1766 opened Jul 21, 2025 by GD06

Loading…

[AMD] Torch Compile Issues

#1756 opened Jul 15, 2025 by micmelesse

Loading…

Suppress warnings in windows compilation

#1748 opened Jul 10, 2025 by XXXXRT666

Loading…

Fix illegal memory access through off-by-one error in num_splits_dynamic_ptr init

#1747 opened Jul 10, 2025 by klondenberg-bioptimus

Loading…

Theoretically make compiling from pip quicker

#1703 opened Jun 8, 2025 by whrit

Loading…

fix: fa3 backward check qkv with qkv_scale and dqkv

#1686 opened May 29, 2025 by yuyu5333

Loading…

[skip ci] libtorch agnostic FA3 north star proposal

#1685 opened May 28, 2025 by janeyx99 • Draft

Fix/deterministic dk dv

#1678 opened May 26, 2025 by yuWeiCute

Loading…

Fix a bug in flash_attn_triton.py

#1668 opened May 15, 2025 by AminDarabi

Loading…

Useuful command to install flash faster on behamoth clusters

#1660 opened May 10, 2025 by sleepingcat4

Loading…

Fix typos in multiple files

#1655 opened May 8, 2025 by co63oc

Loading…

Patch RPATH of compiled Linux library to locate PyTorch and CUDA libraries in virtual env

#1634 opened Apr 30, 2025 by sisp

Loading…

feat: support to tile K and V separately in FA3 backward

#1626 opened Apr 28, 2025 by beginlner

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-08-17.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!