Ryan Haines / Android Authority
AI-generated photographs are extra spectacular than ever, with some even successful images awards and fooling consultants in the course of. The better part? You don’t should be knowledgeable artist or have any technical abilities to create them. But not all AI image generators are created equal — some excel at realism, whereas others are riddled with easy-to-spot errors. One factor is for positive: only a few can generate textual content reliably. To discover the greatest one then, I pushed every AI image generator with successively difficult prompts. Here are my findings.
Which is the greatest AI image generator?
C. Scott Brown / Android Authority
Finding the greatest AI image generator is troublesome since the outcomes can range wildly from one immediate to a different. However, we all know that generative AI tech tends to battle in sure areas greater than others so we will tailor our prompts to spotlight these weaknesses and see the place every one shines — or fails. Virtually all image generators can deal with easier artwork kinds so I’ll restrict testing to reasonable scenes this time.
If you ever have to stress take a look at an AI image generator, strive asking for photographs with intricate particulars like arms, hair, or textual content. Only a handful of them can deal with these nicely, with others usually producing distorted or unrealistic outcomes. Another good take a look at is complicated scenes with a number of topics or uncommon views, which are likely to journey up even the greatest fashions.
With that in thoughts, I determined to check a handful of various AI image generators. Specifically, I picked Google’s Imagen 3, Meta’s Imagine, DALL-E 3 by way of Microsoft Designer and ChatGPT, and Grok. And for my first immediate, I requested for an image of an individual crying. This request could appear too floor, however the end result variance was fascinating.
Prompt 1: An individual crying, with tears streaming down their face
As you’ll be able to already inform, photographs from completely different AI fashions look nothing alike. While a part of this is as a result of my immediate was somewhat imprecise, each image generator I tested was additionally educated on a unique dataset. Meta used public photographs from Facebook and Instagram, for instance, whereas it’s much less clear how most different corporations obtained their coaching datasets.
Replicating anatomy has at all times been difficult for AI image generators and these outcomes solely proves that truth. Google’s Imagen 3 produced a particularly convincing end result, with others like Meta’s Imagine generated . I retested this immediate with minor variations to enhance the pattern dimension however Imagen 3 did win each single time.
Microsoft Designer makes use of OpenAI’s DALL-E 3 underneath the hood, which means it ought to produce comparable outcomes as ChatGPT. And that proved to be true in my testing, with each providers delivering first rate outcomes.
Winner: Imagen 3, adopted by DALL-E 3
Prompt 2: An action-packed scene of two dancers mid-performance in a rain-soaked road…
I elevated the complexity and element of my immediate this time, whereas protecting human topics in the body. Imagen 3 yielded a superb end result as soon as once more, solely faltering with one topic’s fingers. On the different hand, Meta’s Imagine botched one dancer’s limbs and face fully and I would think about the end result unusable.
Microsoft Designer supplied cartoon-style outcomes, which seemed satisfactory however wasn’t what I was searching for. ChatGPT’s try was a lot worse, with an additional limb sprouting out of one dancer. Thankfully, Grok swung the pendulum again with an inexpensive end result in addition to the dancers’ interlocked fingers.
Prompt 3: Generate an image of an Airbus A380…taxiing down a runaway with tropical timber in the background.
I could sound like a damaged report at this level however Imagen 3 continues to decimate the competitors. Even although this immediate requires the AI to generate textual content on the fuselage, Google’s mannequin dealt with it with ease. The airline’s identify is replicated completely and except for the odd runway taxiway markings, it’s almost not possible to inform that the image has been AI generated.
Grok delivered a equally spectacular end result, though not on the first strive, and nonetheless garbled some home windows on the aircraft’s higher deck. The chatbot makes use of a comparatively new image generator known as Flux, created by the researchers who developed Stable Diffusion. Given the latter’s popularity in the image generator area, it’s no shock that Grok can produce glorious outcomes.
Unfortunately, the different AI image generators delivered sub-par to comically dangerous outcomes right here. Meta’s Imagine spit out garbled textual content and the flawed aircraft. DALL-E 3 by way of ChatGPT virtually nailed the textual content on the aspect of the aircraft however generated malformed runway markings. Microsoft Designer makes use of the similar DALL-E 3 mannequin however someway delivered even worse-looking unrealistic photographs.
It’s price noting that including phrases like “photorealistic” or “HD” did little to make the AI-generated outcomes any extra authentic-looking or lifelike. The affect was minimal at greatest, despite the fact that it’s normal observe to incorporate these phrases as a part of good prompting.
Winner: Imagen 3, adopted by Grok
Prompt 4: Famous personalities
Loads has been stated about the darkish aspect of AI image generators and their potential to sway public opinion via false narratives. To fight this downside, most generative AI platforms now have guardrails stopping you from requesting photographs that mimic a selected particular person.
Unsurprisingly then, my immediate was turned down by each single AI image generator – besides Grok. Elon Musk created Grok as a most “truth-seeking” AI, which is simply advertising and marketing converse for a chatbot with fewer guardrails than its rivals. This lack of restrictions extends to AI-generated photographs, as nicely, which implies you could possibly technically generate photographs of world leaders, celebrities, and even Musk himself in questionable settings.
Which AI image generator do I recommend?
Many of the AI image generators I tested have distinctive strengths that make them higher than the relaxation, so right here’s my prime decide relying on my priorities.
- Quality: Google’s Imagen 3 could not have the most recognizable model identify of all the AI image generators on this record, however it stands out for delivering reasonable photographs and extraordinarily plausible outcomes. The solely draw back is that you just solely get one image at a time and the AI processing can take a number of seconds every time you ship in a immediate.
- Speed: Meta Imagine stands out should you want a fast image because you don’t even have to hit the Enter key to see a end result. The instrument generates an image inside a second of typing in a immediate, which feels virtually instantaneous in comparison with different choices on this record.
- Cost: With so many AI image generators obtainable at the moment, is paying for one even price it? Doing so will unlock some good options, since AI image enhancing is usually locked behind subscription providers like Midjourney, Adobe Firefly, and DALL-E 3. For easy AI image technology, although, I’d recommend Imagen 3, Meta Imagine, and Microsoft Designer.
- Censorship: Grok gives one of the best AI image generators with a few of the least restrictions, so it’s price a strive. The solely draw back is that you just’ll want an X Premium (previously Twitter Blue) subscription to make use of the service.
From a sensible standpoint, although, the greatest AI image generator could very nicely be the one already in your machine. For instance, Meta AI is already built-in inside WhatsApp and Facebook Messenger. If you already use both app, Meta Imagine ought to serve you for primary image technology wants.
Likewise, the Pixel 9 collection ships with Google’s new Pixel Studio app powered by Imagen 3. Alternatively, you can too request AI-generated photographs by way of the Gemini app on any Android machine. The latter nonetheless makes use of the last-gen Imagen 2 for now, however it would transfer as much as Google’s newest mannequin quickly.