mirror of
https://github.com/lucidrains/DALLE2-pytorch.git
synced 2026-01-04 01:04:20 +01:00
allow for decoder conditioning with the text encodings from CLIP, if it is passed in. use lazy linear to avoid researchers having to worry about text encoding dimensions, but remove later if it does not work well
This commit is contained in:
@@ -276,7 +276,7 @@ decoder = Decoder(
|
||||
cond_drop_prob = 0.2
|
||||
).cuda()
|
||||
|
||||
loss = decoder(images)
|
||||
loss = decoder(images) # this can optionally be decoder(images, text) if you wish to condition on the text encodings as well, though it was hinted in the paper it didn't do much
|
||||
loss.backward()
|
||||
|
||||
# do above for many steps
|
||||
|
||||
Reference in New Issue
Block a user