added diffusion-gan thoughts

https://github.com/NVlabs/denoising-diffusion-gan
This commit is contained in:
Kashif Rasul
2022-04-20 21:01:11 +02:00
committed by GitHub
parent faebf4c8b8
commit 1d8f37befe

View File

@@ -413,6 +413,7 @@ Offer training wrappers
- [x] be able to finely customize what to condition on (text, image embed) for specific unet in the cascade (super resolution ddpms near the end may not need too much conditioning)
- [x] offload unets not being trained on to CPU for memory efficiency (for training each resolution unets separately)
- [ ] build out latent diffusion architecture, make it completely optional (additional autoencoder + some regularizations [kl and vq regs]) (figure out if latent diffusion + cascading ddpm can be used in conjunction)
- [ ] Extend diffusion head to use diffusion-gan (potentially using lightweight-gan) to speed up inference
- [ ] become an expert with unets, cleanup unet code, make it fully configurable, port all learnings over to https://github.com/lucidrains/x-unet
- [ ] train on a toy task, offer in colab