From 40140b54d62c0db831f149130736ed4264fc458e Mon Sep 17 00:00:00 2001 From: Phil Wang Date: Tue, 12 Apr 2022 17:51:23 -0700 Subject: [PATCH] put on project manager hat --- README.md | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/README.md b/README.md index e9bed10..37247ea 100644 --- a/README.md +++ b/README.md @@ -36,6 +36,15 @@ Once built, images will be saved to the same directory the command is invoked Todo +## Todo + +- [ ] finish off gaussian diffusion class for latent embedding - allow for both prediction of epsilon as well as directly predicting embedding +- [ ] make sure it works end to end +- [ ] augment unet so that it can also be conditioned on text encodings (although in paper they hinted this didn't make much a difference) +- [ ] look into Jonathan Ho's cascading DDPM for the decoder, as that seems to be what they are using. get caught up on DDPM literature +- [ ] figure out all the current bag of tricks needed to make DDPMs great (starting with the blur trick mentioned in paper) +- [ ] train on a toy task, offer in colab + ## Citations ```bibtex