Every month, someone launches a new AI image generator and claims it’s “the best.” Instead of taking anyone’s word for it, I ran the same 50 prompts across Midjourney v7, DALL-E 3.5, Flux Pro, and Google Gemini’s Imagen 4.
200 images. Same prompts. Blind evaluation by 30 judges. Here are the results.
Methodology
To make this fair:
- 50 prompts across 10 categories (portraits, landscapes, product shots, illustrations, abstract, architecture, food, fashion, sci-fi, photorealism)
- Default settings on each platform — no custom style tuning
- 30 blind evaluators rated each image on: quality, prompt adherence, aesthetics, and usability
- Same day — all images generated within a 6-hour window to avoid version differences
Overall Rankings
1. Midjourney v7 — Overall Score: 8.7/10
Midjourney remains the king of aesthetic quality. Images have a polished, almost cinematic quality that the other generators can’t quite match. V7 brought significant improvements in:
- Hand and finger rendering — finally mostly correct
- Text in images — readable about 70% of the time
- Photorealism — borderline indistinguishable from photos
- Style consistency — multiple generations maintain a coherent look
Where it falls short: Prompt adherence. Midjourney has a “house style” it gravitates toward, and it sometimes ignores specific details in favor of what it thinks looks better. If you want exactly what you described, this can be frustrating.
2. Flux Pro — Overall Score: 8.4/10
The dark horse of the competition. Flux Pro has improved dramatically since its launch and now competes directly with Midjourney on quality. Key strengths:
- Prompt adherence — the best of any generator. It does what you ask.
- Text rendering — near-perfect text in images, which is a game-changer for design work
- Composition control — spatial reasoning and object placement are excellent
- Speed — fastest generation times of any premium option
Where it falls short: Sometimes lacks the “magic” that Midjourney adds. Images are technically correct but can feel clinical.
3. DALL-E 3.5 — Overall Score: 7.9/10
OpenAI’s latest image generator is solid but not spectacular:
- ChatGPT integration — the seamless conversational editing is unmatched
- Safety filters — the most restrictive of all, which is a pro for commercial use
- Illustration style — particularly good at cartoon and illustration styles
Where it falls short: Photorealism lags behind Midjourney and Flux. Images often have a “AI-generated” look that’s hard to pin down but easy to spot.
4. Gemini Imagen 4 — Overall Score: 7.6/10
Google’s entry is improving fast but still trails:
- Multi-modal understanding — best at understanding complex scene descriptions
- Reference image integration — excellent at style transfer and image editing
- Accessibility — free tier is the most generous
Where it falls short: Aesthetic quality is a step behind. Images often look slightly flat or oversaturated.
Comments · 0
No comments yet. Be the first to share your thoughts.