mirror of
https://github.com/lucidrains/DALLE2-pytorch.git
synced 2025-12-24 03:54:19 +01:00
885 B
885 B
DALL-E 2 - Pytorch (wip)
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
This is SOTA for text-to-image now, but probably not for long.
It may also explore an extension of using latent diffusion in the decoder
Citations
@misc{ramesh2022,
title = {Hierarchical Text-Conditional Image Generation with CLIP Latents},
author = {Aditya Ramesh et al},
year = {2022}
}
@misc{rombach2021highresolution,
title = {High-Resolution Image Synthesis with Latent Diffusion Models},
author = {Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer},
year = {2021},
eprint = {2112.10752},
archivePrefix = {arXiv},
primaryClass = {cs.CV}
}
