Model Comparison

Grok Imagine Image vs GPT Image 1.5: 5 Prompt Tests

Grok Imagine Image vs GPT Image 1.5: 5 Prompt Tests

Grok Imagine Image vs GPT Image 1.5 becomes useful once both models face the same prompts for text, counting, reflections, and poster layouts under equal conditions.

Models in this comparison

Test setup

  • 5 prompts. Same text for both models.
  • 1 sample per prompt.
  • Portrait outputs: Grok used 9:16 at 2K. GPT used 2:3 at medium quality.
  • Run time measured from task completion times.
Model Avg time per image (5 runs) Notes
Grok Imagine Image ~10s Strong cinematic contrast. Very clean reflections.
GPT Image 1.5 ~22s Sharp detail. Often a bit more literal on typography.

Prompt tests

Prompt 1: Product photo with label text

Prompt: Studio product photo of a clear glass cold brew bottle on a white seamless background. The label shows the exact text NIGHT SHIFT on the first line and COLD BREW on the second line. Clean sans serif, centered, crisp. Softbox reflections, 85mm lens, sharp focus, high detail.

Grok Imagine Image

Grok Imagine Image output of a studio product photo of a glass cold brew bottle with the label NIGHT SHIFT and COLD BREW
Prompt: Studio product photo of a clear glass cold brew bottle on a white seamless background. The label shows the exact text NIGHT SHIFT on the first line and COLD BREW on the second line. Clean sans serif, centered, crisp. Softbox reflections, 85mm lens, sharp focus, high detail.

GPT Image 1.5

GPT Image 1.5 output of a clean product photo of a cold brew bottle with the label NIGHT SHIFT and COLD BREW
Prompt: Studio product photo of a clear glass cold brew bottle on a white seamless background. The label shows the exact text NIGHT SHIFT on the first line and COLD BREW on the second line. Clean sans serif, centered, crisp. Softbox reflections, 85mm lens, sharp focus, high detail.

Both nailed legible text. Grok looked more like a premium studio shoot. GPT went for a flatter, catalog style.

Prompt 2: Exact object list and counting

Prompt: Top-down flat lay photo on a pale stone table. Exactly 2 red pencils, 3 silver paper clips, 1 lemon, and 4 yellow sticky notes. Arrange them in a spiral. Soft natural light, sharp focus, realistic.

Grok Imagine Image

Grok Imagine Image output of a top-down flat lay with red pencils, paper clips, a lemon, and yellow sticky notes arranged in a spiral
Prompt: Top-down flat lay photo on a pale stone table. Exactly 2 red pencils, 3 silver paper clips, 1 lemon, and 4 yellow sticky notes. Arrange them in a spiral. Soft natural light, sharp focus, realistic.

GPT Image 1.5

GPT Image 1.5 output of a top-down flat lay with pencils, paper clips, a lemon, and sticky notes in a spiral arrangement
Prompt: Top-down flat lay photo on a pale stone table. Exactly 2 red pencils, 3 silver paper clips, 1 lemon, and 4 yellow sticky notes. Arrange them in a spiral. Soft natural light, sharp focus, realistic.

Grok stayed physically believable. GPT introduced bent pencils that look impossible. This prompt punishes small geometry mistakes.

Prompt 3: Rainy neon street action

Prompt: Photorealistic scene of a cyclist crossing a rainy neon-lit street at night. Puddles with colorful reflections, motion blur on the wheels, sharp raindrops in streetlights, cinematic framing, high detail.

Grok Imagine Image

Grok Imagine Image output of a cyclist crossing a rainy neon-lit street at night with reflections in puddles
Prompt: Photorealistic scene of a cyclist crossing a rainy neon-lit street at night. Puddles with colorful reflections, motion blur on the wheels, sharp raindrops in streetlights, cinematic framing, high detail.

GPT Image 1.5

GPT Image 1.5 output of a cyclist in rain at night on a wet street with neon lights
Prompt: Photorealistic scene of a cyclist crossing a rainy neon-lit street at night. Puddles with colorful reflections, motion blur on the wheels, sharp raindrops in streetlights, cinematic framing, high detail.

Both hit the brief. Grok delivered the stronger movie look. GPT looked real but less dramatic.

Prompt 4: Poster layout with typography

Prompt: Minimalist travel poster illustration for REYKJAVIK. Background: snowy mountains and a small town under the northern lights. Big title text REYKJAVIK at top, small tagline RING ROAD WINTER below. Clean layout, paper texture, limited palette.

Grok Imagine Image

Grok Imagine Image output of a minimalist Reykjavik travel poster with aurora and the text REYKJAVIK and RING ROAD WINTER
Prompt: Minimalist travel poster illustration for REYKJAVIK. Background: snowy mountains and a small town under the northern lights. Big title text REYKJAVIK at top, small tagline RING ROAD WINTER below. Clean layout, paper texture, limited palette.

GPT Image 1.5

GPT Image 1.5 output of a vintage style Reykjavik travel poster with aurora and readable title and tagline
Prompt: Minimalist travel poster illustration for REYKJAVIK. Background: snowy mountains and a small town under the northern lights. Big title text REYKJAVIK at top, small tagline RING ROAD WINTER below. Clean layout, paper texture, limited palette.

Both handled title and subtitle well. GPT leaned into a classic printed-poster look. Grok looked cleaner and more modern.

Prompt 5: Hard reflections and tiny engraving

Prompt: Ultra realistic photo of a transparent glass chess set on a mirrored table in a sunlit room. Caustic light patterns, soft depth of field, a curious cat in the background. The king piece has a tiny engraving that reads E4. High detail, natural colors.

Grok Imagine Image

Grok Imagine Image output of a glass chess set on a mirrored table with a cat in the background and an E4 engraving on the king
Prompt: Ultra realistic photo of a transparent glass chess set on a mirrored table in a sunlit room. Caustic light patterns, soft depth of field, a curious cat in the background. The king piece has a tiny engraving that reads E4. High detail, natural colors.

GPT Image 1.5

GPT Image 1.5 output of a close-up glass chess set with a cat watching and a visible E4 engraving
Prompt: Ultra realistic photo of a transparent glass chess set on a mirrored table in a sunlit room. Caustic light patterns, soft depth of field, a curious cat in the background. The king piece has a tiny engraving that reads E4. High detail, natural colors.

GPT pulled ahead on sharp micro-detail. Grok looked more atmospheric and wide-angle. Both kept the E4 engraving readable.

Takeaways

  • Grok Imagine Image won 3 out of 5 prompts here, mostly on realism and cinematic lighting.
  • GPT Image 1.5 looked strongest on poster styling and close-up detail work.
  • For strict physical plausibility, Grok stayed cleaner on this set.
  • For typography-driven designs, GPT produced a more cohesive printed poster vibe.

Try both models here: Grok Imagine Image and GPT Image 1.5.


Leave a Comment

Your email address will not be published. Required fields are marked *