Skip to content

Commit ff537a5

Browse files
committed
updated with ring stats
1 parent 18cc287 commit ff537a5

1 file changed

Lines changed: 18 additions & 4 deletions

File tree

README.md

Lines changed: 18 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -572,8 +572,6 @@ To generate images, run the following command:
572572
* For Wan2.2 T2V, use `base_wan_27b.yml`.
573573
* For Wan2.2 I2V, use `base_wan_i2v_27b.yml`.
574574

575-
<<<<<<< HEAD
576-
=======
577575
### Caching Mechanisms
578576

579577
Wan 2.x pipelines support several caching strategies to accelerate inference by skipping redundant transformer forward passes. These are **mutually exclusive** — enable only one at a time.
@@ -599,7 +597,24 @@ To generate images, run the following command:
599597
...
600598
```
601599

602-
>>>>>>> origin/main
600+
### Ring Attention
601+
We added ring attention support for Wan models. Below are the stats for one `720p` (81 frames) video generation (with CFG DP):
602+
| Accelerator | Model | Attention Type | Inference Steps | Sharding | e2e Generation Time |
603+
| -- | -- | -- | -- | -- | -- |
604+
| v7x-8 | WAN 2.1 | Tokamax Flash | 50 | dp2-fsdp1-context4-tp1 | 264.2 |
605+
| v7x-8 | WAN 2.1 | Tokamax Ring | 50 | dp2-fsdp1-context4-tp1 | **252.4** |
606+
| v7x-8 | WAN 2.2 | Tokamax Flash | 40 | dp2-fsdp1-context4-tp1 | 212.7 |
607+
| v7x-8 | WAN 2.2 | Tokamax Ring | 40 | dp2-fsdp1-context4-tp1 | **201.7** |
608+
609+
| Accelerator | Model | Attention Type | Inference Steps | Sharding | e2e Generation Time |
610+
| -- | -- | -- | -- | -- | -- |
611+
| v7x-16 | WAN 2.1 | Tokamax Flash | 50 | dp2-fsdp1-context8-tp1 | 146.6 |
612+
| v7x-16 | WAN 2.1 | Tokamax Ring | 50 | dp2-fsdp1-context8-tp1 | **137.2** |
613+
| v7x-16 | WAN 2.2 | Tokamax Flash | 40 | dp2-fsdp1-context8-tp1 | **117.8** |
614+
| v7x-16 | WAN 2.2 | Tokamax Ring | 40 | dp2-fsdp1-context8-tp1 | 137.5 |
615+
616+
(* There are some known stability issues for ring attention on 16 TPUs, please use `tokamax_flash` attention instead.)
617+
603618
## Flux
604619

605620
First make sure you have permissions to access the Flux repos in Huggingface.
@@ -775,4 +790,3 @@ This script will automatically format your code with `pyink` and help you identi
775790
776791
777792
The full suite of -end-to end tests is in `tests` and `src/maxdiffusion/tests`. We run them with a nightly cadance.
778-

0 commit comments

Comments
 (0)