diff --git a/README.md b/README.md index 943573a..6917963 100644 --- a/README.md +++ b/README.md @@ -12,7 +12,7 @@ This model is SOTA for text-to-image for now. Please join Join us on Discord if you are interested in helping out with the replication -There was enough interest for a Jax version. It will be completed after the Pytorch version shows signs of life on my toy tasks. Placeholder repository. I will also eventually extend this to text to video, once the repository is in a good place. +There was enough interest for a Jax version. I will also eventually extend this to text to video, once the repository is in a good place. ## Install @@ -533,6 +533,7 @@ Once built, images will be saved to the same directory the command is invoked - [ ] spend one day cleaning up tech debt in decoder - [ ] become an expert with unets, cleanup unet code, make it fully configurable, port all learnings over to https://github.com/lucidrains/x-unet - [ ] copy the cascading ddpm code to a separate repo (perhaps https://github.com/lucidrains/denoising-diffusion-pytorch) as the main contribution of dalle2 really is just the prior network +- [ ] transcribe code to Jax, which lowers the activation energy for distributed training, given access to TPUs - [ ] train on a toy task, offer in colab - [ ] extend diffusion head to use diffusion-gan (potentially using lightweight-gan) to speed up inference - [ ] bring in tools to train vqgan-vae