mirror of
https://github.com/lucidrains/DALLE2-pytorch.git
synced 2026-01-04 01:04:20 +01:00
31 lines
885 B
Markdown
31 lines
885 B
Markdown
<img src="./dalle2.png" width="450px"></img>
|
|
|
|
## DALL-E 2 - Pytorch (wip)
|
|
|
|
Implementation of <a href="https://openai.com/dall-e-2/">DALL-E 2</a>, OpenAI's updated text-to-image synthesis neural network, in Pytorch
|
|
|
|
This is SOTA for text-to-image now, but probably not for long.
|
|
|
|
It may also explore an extension of using latent diffusion in the decoder
|
|
|
|
## Citations
|
|
|
|
```bibtex
|
|
@misc{ramesh2022,
|
|
title = {Hierarchical Text-Conditional Image Generation with CLIP Latents},
|
|
author = {Aditya Ramesh et al},
|
|
year = {2022}
|
|
}
|
|
```
|
|
|
|
```bibtex
|
|
@misc{rombach2021highresolution,
|
|
title = {High-Resolution Image Synthesis with Latent Diffusion Models},
|
|
author = {Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer},
|
|
year = {2021},
|
|
eprint = {2112.10752},
|
|
archivePrefix = {arXiv},
|
|
primaryClass = {cs.CV}
|
|
}
|
|
```
|