From 1e939153fb3de88f1276bf26cf21050a7b67760c Mon Sep 17 00:00:00 2001 From: Phil Wang Date: Fri, 15 Apr 2022 12:58:57 -0700 Subject: [PATCH] link to AssemblyAI explanation --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0449fdb..df31c77 100644 --- a/README.md +++ b/README.md @@ -2,7 +2,9 @@ ## DALL-E 2 - Pytorch (wip) -Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. Yannic Kilcher summary +Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. + +Yannic Kilcher summary | AssemblyAI explainer The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network 😂)