todo

2026-02-06 16:54:20 +01:00 · 2022-04-23 08:23:08 -07:00
parent f82917e1fd
commit d5318aef4f
1 changed files with 1 additions and 11 deletions
--- a/README.md
+++ b/README.md
@@ -536,6 +536,7 @@ Once built, images will be saved to the same directory the command is invoked
 - [ ] extend diffusion head to use diffusion-gan (potentially using lightweight-gan) to speed up inference
 - [ ] bring in tools to train vqgan-vae
 - [ ] for decoder, allow ability to customize objective (predict epsilon vs x0), in case latent diffusion does better with prediction of x0
+- [ ] bring in vit-vqgan https://arxiv.org/abs/2110.04627 for the latent diffusion

 ## Citations

@@ -573,17 +574,6 @@ Once built, images will be saved to the same directory the command is invoked
 }
 ```

-```bibtex
-@misc{zhang2019root,
-    title   = {Root Mean Square Layer Normalization},
-    author  = {Biao Zhang and Rico Sennrich},
-    year    = {2019},
-    eprint  = {1910.07467},
-    archivePrefix = {arXiv},
-    primaryClass = {cs.LG}
-}
-```
-
 ```bibtex
@inproceedings{Tu2022MaxViTMV,
    title   = {MaxViT: Multi-Axis Vision Transformer},