diff --git a/README.md b/README.md index 78eb68f..f497fbb 100644 --- a/README.md +++ b/README.md @@ -864,7 +864,7 @@ Once built, images will be saved to the same directory the command is invoked - [x] add convnext backbone for vqgan-vae (in addition to vit [vit-vqgan] + resnet) - [x] make sure DDPMs can be run with traditional resnet blocks (but leave convnext as an option for experimentation) - [x] make sure for the latter unets in the cascade, one can train on crops for learning super resolution (constrain the unet to be only convolutions in that case, or allow conv-like attention with rel pos bias) -- [ ] become an expert with unets, cleanup unet code, make it fully configurable, port all learnings over to https://github.com/lucidrains/x-unet (test out unet² in ddpm repo) +- [ ] become an expert with unets, cleanup unet code, make it fully configurable, port all learnings over to https://github.com/lucidrains/x-unet (test out unet² in ddpm repo) - consider https://github.com/lucidrains/uformer-pytorch attention-based unet - [ ] copy the cascading ddpm code to a separate repo (perhaps https://github.com/lucidrains/denoising-diffusion-pytorch) as the main contribution of dalle2 really is just the prior network - [ ] transcribe code to Jax, which lowers the activation energy for distributed training, given access to TPUs - [ ] pull logic for training diffusion prior into a class DiffusionPriorTrainer, for eventual script based + CLI based training @@ -877,7 +877,7 @@ Once built, images will be saved to the same directory the command is invoked - [ ] use an experimental tracker agnostic setup, as done here - [ ] interface out the vqgan-vae so a pretrained one can be pulled off the shelf to validate latent diffusion + DALL-E2 - [ ] make sure FILIP works with DALL-E2 from x-clip https://arxiv.org/abs/2111.07783 -- [ ] make sure resnet | convnext block hyperparameters can be configurable across unet depth (groups and expansion factor) +- [ ] make sure resnet hyperparameters can be configurable across unet depth (groups and expansion factor) ## Citations