Model Reviews

Kolors Text-to-Image: 6 Prompt Tests (1024px)

Kolors Text-to-Image: 6 Prompt Tests (1024px)

Kolors Text-to-Image: 6 Prompt Tests (1024px)

Kolors is a diffusion-based text-to-image model from the Kuaishou Kolors team. The project highlights strong Chinese and English text rendering and solid prompt understanding. This post runs six prompts that stress text, portraits, product photos, and big sci-fi scenes.

Model link

Test setup

  • Size: 1024×1024
  • Steps: 30
  • Guidance scale: 3.5
  • Samples: 1
  • Negative prompt: “bad, blurry, watermark”

Test 1: Chinese text rendering

Ladybug holding a sign with Chinese characters
Prompt: Macro photo of a ladybug holding a small sign with the Chinese characters 可图. Ultra-detailed. Shallow depth of field. Natural light.

This prompt checks small text legibility plus macro detail. If characters warp, try fewer style adjectives and ask for a larger sign.

Test 2: Neon sign typography in a complex scene

Cyberpunk rainy street with a neon WIRO sign
Prompt: Cinematic cyberpunk street at night in the rain. Neon sign reads WIRO in bold letters. Reflections on wet pavement. High detail.

Text inside busy lighting often breaks in subtle ways. A good result keeps the word readable while preserving reflections and rain mood.

Test 3: Portrait + sunset lighting

Woman in a red dress on a rooftop at sunset
Prompt: A woman with long black hair, wearing a red dress, stands on a rooftop, gazing at the golden sunset. City skyline in the background. Cinematic lighting. Realistic.

Portrait prompts show how well the model handles skin, hair, and color grading. Keep the prompt simple if faces drift.

Test 4: Epic sci-fi scale

Gas giant planet with rings and floating cities
Prompt: A colossal gas giant with swirling orange and blue storms dominates the sky, surrounded by a massive ring system. Floating cities hover above the clouds. Epic scale. Ultra-HD.

This checks composition and worldbuilding. The best outputs keep coherent lighting and readable structures even with lots of elements.

Test 5: Clean product photo

Japanese bento box product photo
Prompt: Studio product photo of a Japanese bento box on a clean white table. Chopsticks beside it. Soft shadows. High detail. 85mm lens.

Product-style prompts rely on believable shadows and materials. If props drift, state exact counts and keep the scene minimal.

Test 6: Minimal poster typography

Minimalist poster with KOLORS text
Prompt: Minimalist poster design. Big title text: KOLORS. Small subtitle text: TEXT TO IMAGE. Clean layout, high contrast, subtle grain.

Poster prompts are a fast way to test layout and text. If the letters are not perfect, generate the background and add type in a design tool.

Quick takeaways

What was tested What worked What to watch
Chinese and English text Readable layouts in many cases Exact typography can still deform
Complex scenes Strong mood and composition Busy prompts can introduce artifacts
Product lighting Soft studio shadows Prop counts can drift without anchors

Try it


Leave a Comment

Your email address will not be published. Required fields are marked *