DALLE2-pytorch

mirror of https://github.com/lucidrains/DALLE2-pytorch.git synced 2026-02-22 06:44:33 +01:00

Author	SHA1	Message	Date
Phil Wang	3ee3c56d2a	add learned padding tokens, same strategy as dalle1, for diffusion prior, and get rid of masking in causal transformer v0.23.1	2022-07-12 17:33:14 -07:00
Phil Wang	cd26c6b17d	0.22.3 v0.22.3	2022-07-12 17:08:31 -07:00
Phil Wang	775abc4df6	add setting to attend to all text encodings regardless of padding, for diffusion prior	2022-07-12 17:08:12 -07:00
Phil Wang	11b1d533a0	make sure text encodings being passed in has the correct batch dimension v0.22.1	2022-07-12 16:00:19 -07:00
Phil Wang	e76e89f9eb	remove text masking altogether in favor of deriving from text encodings (padded text encodings must be pad value of 0.) v0.22.2	2022-07-12 15:40:31 -07:00
Phil Wang	bb3ff0ac67	protect against bad text mask being passed into decoder v0.21.3	2022-07-12 15:33:13 -07:00
Phil Wang	1ec4dbe64f	one more fix for text mask, if the length of the text encoding exceeds max_text_len, add an assert for better error msg v0.21.2	2022-07-12 15:01:46 -07:00
Phil Wang	e0835acca9	generate text mask within the unet and diffusion prior itself from the text encodings, if not given v0.21.1	2022-07-12 12:54:59 -07:00
Phil Wang	e055793e5d	shoutout for @MalumaDev	2022-07-11 16:12:35 -07:00
Phil Wang	1d9ef99288	add PixelShuffleUpsample thanks to @MalumaDev and @marunine for running the experiment and verifyng absence of checkboard artifacts v0.21.0	2022-07-11 16:07:23 -07:00
Phil Wang	bdd62c24b3	zero init final projection in unet, since openai and @crowsonkb are both doing it v0.20.1	2022-07-11 13:22:06 -07:00
Phil Wang	1f1557c614	make it so even if text mask is omitted, it will be derived based on whether text encodings are all 0s or not, simplify dataloading v0.20.0	2022-07-11 10:56:19 -07:00
Aidan Dempster	1a217e99e3	Unet parameter count is now shown (#202 )	2022-07-10 16:45:59 -07:00
Phil Wang	7ea314e2f0	allow for final l2norm clamping of the sampled image embed v0.19.6	2022-07-10 09:44:38 -07:00
Phil Wang	4173e88121	more accurate readme	2022-07-09 20:57:26 -07:00
Phil Wang	3dae43fa0e	fix misnamed variable, thanks to @nousr v0.19.5	2022-07-09 19:01:37 -07:00
Phil Wang	a598820012	do not noise for the last step in ddim v0.19.4	2022-07-09 18:38:40 -07:00
Phil Wang	4878762627	fix for small validation bug for sampling steps v0.19.3	2022-07-09 17:31:54 -07:00
Phil Wang	47ae17b36e	more informative error for something that tripped me up v0.19.2	2022-07-09 17:28:14 -07:00
Phil Wang	b7e22f7da0	complete ddim integration of diffusion prior as well as decoder for each unet, feature complete for https://github.com/lucidrains/DALLE2-pytorch/issues/157 v0.19.1	2022-07-09 17:25:34 -07:00
Romain Beaumont	68de937aac	Fix decoder test by fixing the resizing output size (#197 )	2022-07-09 07:48:07 -07:00
Phil Wang	097afda606	0.18.0 v0.18.0	2022-07-08 18:18:38 -07:00
Aidan Dempster	5c520db825	Added deepspeed support (#195 )	2022-07-08 18:18:08 -07:00
Phil Wang	3070610231	just force it so researcher can never pass in an image that is less than the size that is required for CLIP or CoCa v0.17.1	2022-07-08 18:17:29 -07:00
Aidan Dempster	870aeeca62	Fixed issue where evaluation would error when large image was loaded (#194 )	2022-07-08 17:11:34 -07:00
Romain Beaumont	f28dc6dc01	setup simple ci (#193 )	2022-07-08 16:51:56 -07:00
Phil Wang	081d8d3484	0.17.0 v0.17.0	2022-07-08 13:36:26 -07:00
Aidan Dempster	a71f693a26	Add the ability to auto restart the last run when started after a crash (#191 ) * Added autoresume after crash functionality to the trackers * Updated documentation * Clarified what goes in the autorestart object * Fixed style issues Unraveled conditional block Chnaged to using helper function to get step count	2022-07-08 13:35:40 -07:00
Phil Wang	d7bc5fbedd	expose num_steps_taken helper method on trainer to retrieve number of training steps of each unet v0.16.19	2022-07-08 13:00:56 -07:00
Phil Wang	8c823affff	allow for control over use of nearest interp method of downsampling low res conditioning, in addition to being able to turn it off v0.16.18	2022-07-08 11:44:43 -07:00
Phil Wang	ec7cab01d9	extra insurance that diffusion prior is on the correct device, when using trainer with accelerator or device was given v0.16.17	2022-07-07 10:08:33 -07:00
Phil Wang	46be8c32d3	fix a potential issue in the low resolution conditioner, when downsampling and then upsampling using resize right, thanks to @marunine v0.16.16	2022-07-07 09:41:49 -07:00
Phil Wang	900f086a6d	fix condition_on_text_encodings in dalle2 orchestrator class, fix readme	2022-07-07 07:43:41 -07:00
zion	b3e646fd3b	add readme for prior (#159 ) * add readme for prior * offload prior info in main readme * typos	2022-07-06 20:50:52 -07:00
Phil Wang	6a59c7093d	more shots in the dark regarding fp16 with learned variance for deepspeed issue v0.16.14	2022-07-06 19:05:50 -07:00
Phil Wang	a6cdbe0b9c	relax learning rate constraint, as @rom1504 wants to try a higher one v0.16.13	2022-07-06 18:09:11 -07:00
Phil Wang	e928ae5c34	default the device to the device that the diffusion prior parameters are on, if the trainer was never given the accelerator nor device v0.16.12	2022-07-06 12:47:48 -07:00
Phil Wang	1bd8a7835a	attempting to fix issue with deepspeed fp16 seeing overflowing gradient v0.16.10	2022-07-06 08:27:34 -07:00
Phil Wang	f33453df9f	debugging with Aidan v0.16.9	2022-07-05 18:22:43 -07:00
Phil Wang	1e4bb2bafb	cast long as float before deriving sinusoidal pos emb v0.16.8	2022-07-05 18:01:22 -07:00
Phil Wang	ee75515c7d	remove forcing of softmax in f32, in case it is interfering with deepspeed v0.16.7	2022-07-05 16:53:58 -07:00
Phil Wang	ec68243479	set ability to do warmup steps for each unet during training v0.16.6	2022-07-05 16:24:16 -07:00
Phil Wang	3afdcdfe86	need to keep track of training steps separately for each unet in decoder trainer v0.16.3	2022-07-05 15:17:59 -07:00
Phil Wang	b9a908ff75	bring in two tricks from the cogview paper for reducing the chances of overflow, for attention and layernorm v0.16.2	2022-07-05 14:27:04 -07:00
Phil Wang	e1fe3089df	do bias-less layernorm manually v0.16.0	2022-07-05 13:09:58 -07:00
Phil Wang	6d477d7654	link to dalle2 laion	2022-07-05 11:43:07 -07:00
Phil Wang	531fe4b62f	status	2022-07-05 10:46:55 -07:00
Phil Wang	ec5a77fc55	0.15.4 v0.15.4	2022-07-02 08:56:34 -07:00
Aidan Dempster	fac63c61bc	Fixed variable naming issue (#183 )	2022-07-02 08:56:03 -07:00
Phil Wang	3d23ba4aa5	add ability to specify full self attention on specific stages in the unet v0.15.3	2022-07-01 10:22:07 -07:00

1 2 3 4 5 ...

499 Commits