mirror of
https://github.com/lucidrains/DALLE2-pytorch.git
synced 2026-02-14 07:34:22 +01:00
Compare commits
3 Commits
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
4a4c7ac9e6 | ||
|
|
fad7481479 | ||
|
|
123658d082 |
10
README.md
10
README.md
@@ -1065,6 +1065,7 @@ Once built, images will be saved to the same directory the command is invoked
|
|||||||
- [ ] allow for unet to be able to condition non-cross attention style as well
|
- [ ] allow for unet to be able to condition non-cross attention style as well
|
||||||
- [ ] for all model classes with hyperparameters that changes the network architecture, make it requirement that they must expose a config property, and write a simple function that asserts that it restores the object correctly
|
- [ ] for all model classes with hyperparameters that changes the network architecture, make it requirement that they must expose a config property, and write a simple function that asserts that it restores the object correctly
|
||||||
- [ ] for both diffusion prior and decoder, all exponential moving averaged models needs to be saved and restored as well (as well as the step number)
|
- [ ] for both diffusion prior and decoder, all exponential moving averaged models needs to be saved and restored as well (as well as the step number)
|
||||||
|
- [ ] read the paper, figure it out, and build it https://github.com/lucidrains/DALLE2-pytorch/issues/89
|
||||||
|
|
||||||
## Citations
|
## Citations
|
||||||
|
|
||||||
@@ -1153,4 +1154,13 @@ Once built, images will be saved to the same directory the command is invoked
|
|||||||
}
|
}
|
||||||
```
|
```
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@article{ho2021cascaded,
|
||||||
|
title = {Cascaded Diffusion Models for High Fidelity Image Generation},
|
||||||
|
author = {Ho, Jonathan and Saharia, Chitwan and Chan, William and Fleet, David J and Norouzi, Mohammad and Salimans, Tim},
|
||||||
|
journal = {arXiv preprint arXiv:2106.15282},
|
||||||
|
year = {2021}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
*Creating noise from data is easy; creating data from noise is generative modeling.* - <a href="https://arxiv.org/abs/2011.13456">Yang Song's paper</a>
|
*Creating noise from data is easy; creating data from noise is generative modeling.* - <a href="https://arxiv.org/abs/2011.13456">Yang Song's paper</a>
|
||||||
|
|||||||
@@ -794,7 +794,7 @@ class DiffusionPriorNetwork(nn.Module):
|
|||||||
text_embed,
|
text_embed,
|
||||||
text_encodings = None,
|
text_encodings = None,
|
||||||
mask = None,
|
mask = None,
|
||||||
cond_drop_prob = 0.2
|
cond_drop_prob = 0.
|
||||||
):
|
):
|
||||||
batch, dim, device, dtype = *image_embed.shape, image_embed.device, image_embed.dtype
|
batch, dim, device, dtype = *image_embed.shape, image_embed.device, image_embed.dtype
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user