Skip to content

Use independent rope method from attention_flax in flux transformer#176

Merged
entrpn merged 1 commit intomainfrom
kunjanp-dev-2
May 19, 2025
Merged

Use independent rope method from attention_flax in flux transformer#176
entrpn merged 1 commit intomainfrom
kunjanp-dev-2

Conversation

@coolkp
Copy link
Copy Markdown
Collaborator

@coolkp coolkp commented May 19, 2025

  1. Use independent rope method from attention_flax in flux transformer
  2. train_utils.py
    remove with jax.spmd_mode("allow_all"):
    its a noop now since enhanced barrier is switched on in JAX
    maxtext reference change
    jax

Signed-off-by: Kunjan patel <kunjanp@google.com>
@coolkp coolkp requested a review from entrpn May 19, 2025 15:58
@entrpn entrpn merged commit 3d43b89 into main May 19, 2025
2 of 3 checks passed
@coolkp coolkp deleted the kunjanp-dev-2 branch May 19, 2025 16:01
hx89 pushed a commit to hx89/maxdiffusion that referenced this pull request May 28, 2025
Signed-off-by: Kunjan patel <kunjanp@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants