Skip to content

FA4 local (_flash_4 mode)#13373

Open
christopher5106 wants to merge 1 commit intohuggingface:mainfrom
scenario-labs:fa4
Open

FA4 local (_flash_4 mode)#13373
christopher5106 wants to merge 1 commit intohuggingface:mainfrom
scenario-labs:fa4

Conversation

@christopher5106
Copy link
Copy Markdown
Contributor

This PR adds local flash attention 4. Package required: flash-attn-4 (flash_attn.cute)

Note: flash_4_hub already in the code but points to kernels-staging/flash-attn4 not publicly available. When it will be available, flash_4_hub will work too

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant