Pushing ChatGPT’s Image Generation to the Extreme

When a technology company makes a revolutionary, somewhat terrifying innovation, it is a law of humanity that the rest of us will try and make a mockery of that thing. The latest iteration of that trend with OpenAI’s ChatGPT and DALL-E 3 artificial intelligence programmes is users demanding that they ‘make it more’.

OpenAI has recently released DALL-E 3, the third evolution of its image generation programme. DALL-E 3 is more advanced, more responsive, and better able to incorporate all prompt concepts into its creations. The results are often visually stunning, bordering on lifelike. What’s more, it’s now also integrated into ChatGPT, meaning users can ask ChatGPT to create images.

The generated images, being based on concepts, are able to be extrapolated and users have been playing around with the idea of pushing the generation software it its limits.

The ‘make it more’ trend sees people asking ChatGPT to create an image and then increasingly enhance some aspect of it.

For example, Don Caldwell, editor-in-chief of the site Know Your Meme, asked ChatGPT to generate an image of a ‘meme lord’. He then proceeded to demand the software ‘make it more of a meme lord’ over and over again.

Reddit user @dulipat asked for a bowl of spicy ramen and then asked it to generate increasingly spicer versions.

Modulos AI CEO Kevin Schawinski asked for an image of Switzerland and then pushed it to be ‘more Swiss’.

Typically, as ChatGPT is generating the images, it attempts to push back on the prompts of the user. “I’m sorry, but I’ve reached the limit of how much I can exaggerate the hamster’s hunger in a cartoon-style while still maintaining a suitable and sensible depiction,” it told one Reddit user. Of course, the programme can still be forced to create further, even more exaggerated images even when it protests.

What many users notice is that the third iteration of their prompt tends to be the best. Beyond that, ChatGPT and DALL-E 3 begin to start depicting images or concepts in space, similar to the artwork of Alex Grey or the ‘galaxy brain’ meme. It appears its training data indicates that a more extreme version of a thing encompasses outer space and spiritualism.

The release of Dall-E 3 last month has caused concern among artists. Already, visual creators have expressed fears over the fact that AI image generation will ultimately end up replacing them, something that is already happening in a range of industries, not least of all in media.

Concerns over the fact that AI programmes like DALL-E and a host of others do not pay artists or image creators for the images that they are trained on are widespread. A group of artists, including illustrator Kelly McKernan, is currently suing image-generation programmes Stability AI, Midjourney, and DeviantArt for copyright infringement.

OpenAI has implemented a system whereby image owners can opt-out of having their images used in training data. However, the programme has been criticised for being time-consuming while still not preventing the programme from copying artistic style.

How Can I Use ChatGPT to ‘Make It More’?

Access to DALL-E 3 is integrated through ChatGPT. However, the service is only available to subscribers of ChatGPT Plus and Enterprise. Non-paying users will have to make do with the freely accessible DALL-E 2.

That said, there is a workaround. DALL-E 3 image generation is integrated into Bing Chat. Microsoft’s own AI assistant search programme has the power to generate images from user prompts powered by Dall-E 3. Bing Chat is absolutely free to use, although Microsoft will limit the amount you can use the chat without signing in an using their internet browser, Bing.

The responses don’t seem quite as funny or as characteristic as when generated through ChatGPT and it will generate four images at a time, even when requested not to, but it’s still a free way to use the cutting edge image generator.


