-
Notifications
You must be signed in to change notification settings - Fork 69
Flash attention for GPUs like in maxtext #149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 34 commits
Commits
Show all changes
35 commits
Select commit
Hold shift + click to select a range
bc6cd42
add support for flux vae. ~ wip
jfacevedo-google 5f56257
test for flux vae both encoding and decoding.
jfacevedo-google c7829d1
add clip text encoder test
jfacevedo-google 572f20d
remove transformers inside maxdiffusion, add transformers dependency.…
jfacevedo-google ff04543
add double block to flux
jfacevedo-google 8a0ede4
forward pass for single double block.
jfacevedo-google 9fe42ba
trying to use scan.
jfacevedo-google 7e79e05
add single stream block
jfacevedo-google 6641dda
finish transformer
jfacevedo-google d37a278
convert pt weights to flax and load transformer state.
jfacevedo-google bb91e8e
apply fsdp sharding, do one forward pass in the transformer.
jfacevedo-google dfe1089
wip - generate fn
jfacevedo-google cbc7723
working loop, bad generation
jfacevedo-google ac14a4b
e2e, encoder offloading.
jfacevedo-google 1c8ed7b
support both dev and schnell loading. Images still incorrect.
jfacevedo-google c8196ed
flux schnell working
jfacevedo-google 1f1475d
removed unused code.
jfacevedo-google b49695a
support dev
jfacevedo-google 04377df
add sentencepiece requirement
jfacevedo-google f6c25e4
fix repeated double and single blocks.
jfacevedo-google ff24ee1
optimized flash block sizes for trillium.
jfacevedo-google 18250c5
clean up code and lint
jfacevedo-google 303e82a
fix sdxl generate smoke tests.
jfacevedo-google 5df1f3c
fix rest of unit tests.
jfacevedo-google 1ec459d
update readme and some dependencies.
entrpn 1f28cb5
remove unused dependencies.
entrpn ff16ba6
initial lora implementation for flux
jfacevedo-google 719e6db
adding another format lora support.
jfacevedo-google 91d7f5c
Support other format loras. update readme. Run code_style.
jfacevedo-google 1e01c67
ruff
jfacevedo-google 19e1b8a
fix typo in readme.
entrpn f05a7be
Added FA support for GPUs
ksikiric 3141d69
ruff and code_style
ksikiric 2072f7e
fixed final comments
ksikiric 56771dd
Correcting small misstake due to missunderstanding
ksikiric File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.