mirror of
https://github.com/lucidrains/DALLE2-pytorch.git
synced 2025-12-19 09:44:19 +01:00
bring in the simple tokenizer released by openai, but also plan on leaving room for custom tokenizer with yttm
This commit is contained in:
262145
dalle2_pytorch/data/bpe_simple_vocab_16e6.txt
Normal file
262145
dalle2_pytorch/data/bpe_simple_vocab_16e6.txt
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user