Phil Wang
|
fbba0f9aaf
|
bring in prediction of v objective, combining the findings from progressive distillation paper and imagen-video to the eventual extension of dalle2 to make-a-video
|
2022-10-28 18:21:07 -07:00 |
|
Phil Wang
|
b39653cf96
|
fix readme dataloader example
|
2022-09-20 08:39:52 -07:00 |
|
Phil Wang
|
39f8b6cf16
|
show example of using SOTA open sourced open clip
|
2022-09-19 10:45:20 -07:00 |
|
Phil Wang
|
22019fddeb
|
todo
|
2022-08-31 13:36:05 -07:00 |
|
Phil Wang
|
1cc5d0afa7
|
upgrade to best downsample
|
2022-08-25 10:37:02 -07:00 |
|
Phil Wang
|
de5e628773
|
cite einops
|
2022-08-17 08:58:41 -07:00 |
|
Phil Wang
|
1b4046b039
|
gratitude
|
2022-08-17 08:57:33 -07:00 |
|
Phil Wang
|
44e09d5a4d
|
add weight standardization behind feature flag, which may potentially work well with group norm
|
2022-08-14 11:34:45 -07:00 |
|
Phil Wang
|
7c5477b26d
|
bet on the new self-conditioning technique out of geoffrey hintons group
|
2022-08-12 11:36:08 -07:00 |
|
Phil Wang
|
f22e8c8741
|
make open clip available for use with dalle2 pytorch
|
2022-07-30 09:02:31 -07:00 |
|
Phil Wang
|
80046334ad
|
make sure entire readme runs without errors
|
2022-07-28 10:17:43 -07:00 |
|
Phil Wang
|
36fb46a95e
|
fix readme and a small bug in DALLE2 class
|
2022-07-28 08:33:51 -07:00 |
|
Phil Wang
|
2e35a9967d
|
product management
|
2022-07-26 11:10:16 -07:00 |
|
Phil Wang
|
406e75043f
|
add upsample combiner feature for the unets
|
2022-07-26 10:46:04 -07:00 |
|
Phil Wang
|
7f120a8b56
|
cleanup, CLI no longer necessary since Zion + Aidan have https://github.com/LAION-AI/dalle2-laion and colab notebook going
|
2022-07-19 09:47:44 -07:00 |
|
Phil Wang
|
8c003ab1e1
|
readme and citation
|
2022-07-19 09:36:45 -07:00 |
|
Phil Wang
|
723bf0abba
|
complete inpainting ability using inpaint_image and inpaint_mask passed into sample function for decoder
|
2022-07-19 09:26:55 -07:00 |
|
Phil Wang
|
c7fe4f2f44
|
project management
|
2022-07-17 17:27:44 -07:00 |
|
Phil Wang
|
e76e89f9eb
|
remove text masking altogether in favor of deriving from text encodings (padded text encodings must be pad value of 0.)
|
2022-07-12 15:40:31 -07:00 |
|
Phil Wang
|
e055793e5d
|
shoutout for @MalumaDev
|
2022-07-11 16:12:35 -07:00 |
|
Phil Wang
|
4173e88121
|
more accurate readme
|
2022-07-09 20:57:26 -07:00 |
|
Phil Wang
|
b7e22f7da0
|
complete ddim integration of diffusion prior as well as decoder for each unet, feature complete for https://github.com/lucidrains/DALLE2-pytorch/issues/157
|
2022-07-09 17:25:34 -07:00 |
|
Phil Wang
|
46be8c32d3
|
fix a potential issue in the low resolution conditioner, when downsampling and then upsampling using resize right, thanks to @marunine
|
2022-07-07 09:41:49 -07:00 |
|
Phil Wang
|
900f086a6d
|
fix condition_on_text_encodings in dalle2 orchestrator class, fix readme
|
2022-07-07 07:43:41 -07:00 |
|
zion
|
b3e646fd3b
|
add readme for prior (#159)
* add readme for prior
* offload prior info in main readme
* typos
|
2022-07-06 20:50:52 -07:00 |
|
Phil Wang
|
6d477d7654
|
link to dalle2 laion
|
2022-07-05 11:43:07 -07:00 |
|
Phil Wang
|
531fe4b62f
|
status
|
2022-07-05 10:46:55 -07:00 |
|
Phil Wang
|
a922a539de
|
bring back convtranspose2d upsampling, allow for nearest upsample with hyperparam, change kernel size of last conv to 1, make configurable, cleanup
|
2022-07-01 09:21:47 -07:00 |
|
Phil Wang
|
6a11b9678b
|
bring in the skip connection scaling factor, used by imagen in their unets, cite original paper using it
|
2022-06-26 21:59:55 -07:00 |
|
Phil Wang
|
b90364695d
|
fix remaining issues with deriving cond_on_text_encodings from child unet settings
|
2022-06-26 21:07:42 -07:00 |
|
Phil Wang
|
a5b9fd6ca8
|
product management
|
2022-06-24 08:15:05 -07:00 |
|
Phil Wang
|
56883910fb
|
cleanup
|
2022-06-20 11:14:55 -07:00 |
|
Phil Wang
|
893f270012
|
project management
|
2022-06-20 10:00:22 -07:00 |
|
Phil Wang
|
0215237fc6
|
update status
|
2022-06-19 09:42:24 -07:00 |
|
Phil Wang
|
41ca896413
|
depend on huggingface accelerate, move appreciation thread up for visibility
|
2022-06-19 08:50:35 -07:00 |
|
Phil Wang
|
9eea9b9862
|
add p2 loss reweighting for decoder training as an option
|
2022-06-14 10:58:57 -07:00 |
|
Ryan Russell
|
1cc288af39
|
Improve Readability (#133)
Signed-off-by: Ryan Russell <git@ryanrussell.org>
|
2022-06-01 13:28:02 -07:00 |
|
Phil Wang
|
f12a7589c5
|
commit to trying out grid attention
|
2022-05-26 12:56:10 -07:00 |
|
Phil Wang
|
645e207441
|
credit assignment
|
2022-05-26 08:16:03 -07:00 |
|
Phil Wang
|
00743b3a0b
|
update
|
2022-05-26 08:12:25 -07:00 |
|
Phil Wang
|
01589aff6a
|
cite maxvit properly
|
2022-05-26 07:12:25 -07:00 |
|
Phil Wang
|
8864fd0aa7
|
bring in the dynamic thresholding technique from the Imagen paper, which purportedly improves classifier free guidance for the cascading ddpm
|
2022-05-24 18:15:14 -07:00 |
|
Phil Wang
|
72bf159331
|
update
|
2022-05-24 08:25:40 -07:00 |
|
Phil Wang
|
e5e47cfecb
|
link to aidan's test run
|
2022-05-23 12:41:46 -07:00 |
|
Phil Wang
|
4d346e98d9
|
allow for config driven creation of clip-less diffusion prior
|
2022-05-22 20:36:20 -07:00 |
|
Phil Wang
|
2b1fd1ad2e
|
product management
|
2022-05-22 19:23:40 -07:00 |
|
zion
|
82a2ef37d9
|
Update README.md (#109)
block in a section that links to available pre-trained models for those who are interested
|
2022-05-22 19:22:30 -07:00 |
|
Phil Wang
|
4e49373fc5
|
project management
|
2022-05-22 15:27:40 -07:00 |
|
Phil Wang
|
e527002472
|
take care of saving and loading functions on the diffusion prior and decoder training classes
|
2022-05-22 15:10:15 -07:00 |
|
Phil Wang
|
a1ef023193
|
use pydantic to manage decoder training configs + defaults and refactor training script
|
2022-05-22 14:27:40 -07:00 |
|