Model Reviews

Sana 1600M (1024px): 6 Prompt Tests

Sana 1600M (1024px): 6 Prompt Tests

Sana 1600M (1024px): 6 Prompt Tests

Sana is a text-to-image framework built for efficient high-resolution generation. The paper describes design choices like a deep-compression autoencoder and a more efficient attention setup so it can scale to larger sizes while staying fast.

Model link

Test setup

  • Model variant: Efficient-Large-Model/Sana_1600M_1024px_BF16_diffusers
  • Size: 1024×1024
  • Steps: 22 (Test 1 used 25)
  • Guidance scale: 3.5
  • Negative prompt: “bad, ugly, low quality, watermark, blurry, deformed”

Test 1: Studio product shot + simple label text

Studio product photo of a matte black water bottle with a label
Prompt: Studio product photo of a matte black reusable water bottle on a white seamless background. The bottle has a clean white label with the exact text ‘AURORA’ on top line and ‘SPARKLING WATER’ on second line. Sans serif, centered. Softbox reflections. 85mm lens, sharp, high detail.

The lighting and reflections land well for a clean product look. The label placement looks plausible, but the text itself does not stay perfectly crisp. For branding work, expect to add text in post.

Test 2: Pixel art scene (style control)

Pixel art seaside town at night with a lighthouse
Prompt: Pixel art scene of a tiny seaside town at night. A lighthouse beam sweeps across the ocean. Small boats in the harbor. 16-bit era style. Crisp edges. Limited palette of navy, cyan, and warm yellow.

This prompt checks whether the model can lock into a constrained style. The result reads as retro pixel art with clean shapes and a coherent palette. Small details can still blur into painterly texture if the prompt asks for too much realism.

Test 3: Double exposure portrait (composition and blending)

Double exposure portrait with city skyline inside silhouette
Prompt: Double exposure portrait of a cyclist wearing a helmet. Inside the silhouette, a modern city skyline at sunset. Teal and orange color grade. Soft film grain. Clean edges. High detail.

Double exposure stresses masking and edge handling. The silhouette stays readable and the skyline blend looks intentional. This type of prompt benefits from simple subject shapes and a strong color grade callout.

Test 4: Minimal movie poster + typography

Minimalist foggy forest movie poster
Prompt: Minimalist movie poster. Background: foggy pine forest at dawn. Big title text reads ‘SANA’. Small tagline below reads ‘FAST HIGH RES’. Clean layout. Centered. Subtle paper texture.

The layout and mood look poster-like, but the letters may come out imperfect or stylized. If the goal is production typography, generate the background first, then set type with a design tool.

Test 5: Counting + exact object list

Top-down breakfast flat lay with croissant and fruit
Prompt: Top-down flat lay breakfast photo on a pale stone table. Exactly 1 croissant, 2 strawberries, 3 blueberries, and 4 almonds. Arrange them in a spiral. Soft natural light. Sharp focus.

This checks if the model respects exact counts. The image often gets close, but it can still miss the exact number when objects overlap or look similar. Keep objects large and separated if counting matters.

Test 6: Cinematic wildlife photo (texture + lighting)

Cinematic black panther in a rainforest at night
Prompt: Cinematic photo of a black panther in a rainforest at night. Wet fur with small water droplets. Rim light from behind. Shallow depth of field. Ultra-detailed. Realistic.

This is a good fit for Sana: a clear subject, strong lighting direction, and a tight scene. Fur texture and rim light read well, and the background stays soft enough to keep focus on the subject.

Quick takeaways

What was tested What worked What to watch
Product lighting Clean studio reflections Text on labels needs cleanup
Style control (pixel art) Strong palette and shapes Over-detailed prompts can soften edges
Poster composition Readable layout and mood Legible typography remains hard
Exact counting Close on simple layouts Small repeated objects drift in count

Where Sana fits

  • Fast iteration on 1024px concepts with solid style control
  • Background plates for posters, ads, and product comps
  • High-resolution pipelines when paired with post tools for text and strict counts

Try it


Leave a Comment

Your email address will not be published. Required fields are marked *