We recommend starting with a single host first and then moving to multihost.
Local development is a convenient way to run MaxDiffusion on a single host. It doesn't scale to multiple hosts.
- You can find here here the list of zones that support the v6(Trillium) TPUs
- We recommend using the base VM image "v2-alpha-tpuv6e", which meets the version requirements: Ubuntu Version 22.04, Python 3.10 and Tensorflow >= 2.12.0
- Clone MaxDiffusion in your TPU VM.
git clone https://github.com/AI-Hypercomputer/maxdiffusion.git
cd maxdiffusion- Within the root directory of the MaxDiffusion
gitrepo, install dependencies by running:
bash setup.sh MODE=stable DEVICE=tpu[GKE, recommended] Running MaxDiffusion with xpk - Quick Experimentation and Production support