AI Model Performance Results

See how top AI models perform across creative and technical criteria, as evaluated by expert judges.

AI ModelSetsEvaluationsOverall
Flux 1.1 Pro Ultra000.0
Imagen000.0
Recraft v3000.0
Stable Diffusion 3.53200.0
Dall-E 3000.0
Reve6400.0
Leonardo Phoenix000.0
GPT-Image000.0
Midjourney000.0
Ideogram 3.02213.0
Some columns are hidden on mobile for readability.

About the Evaluation Process

AI-generated images were evaluated by qualified judges across multiple criteria on a scale of 1-5:

  • Prompt Adherence: How well the images follow the given prompt
  • Technical Quality: Overall technical execution
  • Artistic Merit: Aesthetic value and artistic qualities
  • Creativity: Originality and creative interpretation
  • Consistency: Uniformity across the set of 4 images
  • Detail Richness: Level of detail in the generated images
  • Style Accuracy: Appropriateness of style for the genre
  • Overall Score: General impression and quality

Each image set was evaluated by multiple judges to ensure fair assessment.