The multimodal space is heating up, earlier this week, a research group announced their own version of DALL-E which is 4 billion parameters and takes in simplified Chinese and generates images. If you don’t know what DALL-E is, DALL-E is a model OpenAI published about earlier this year which can generate entire images just with a text description.
CogView - DALL-E in simplified Chinese
CogView - DALL-E in simplified Chinese
CogView - DALL-E in simplified Chinese
The multimodal space is heating up, earlier this week, a research group announced their own version of DALL-E which is 4 billion parameters and takes in simplified Chinese and generates images. If you don’t know what DALL-E is, DALL-E is a model OpenAI published about earlier this year which can generate entire images just with a text description.