LongCat-Image: Multilingual Text Rendering in 6 Tests

LongCat-Image targets a tricky combo: photoreal images plus readable text in multiple languages. This review runs six prompts that stress posters, labels, neon signs, diagrams, embroidered patches, and tiny engraving.

Model link

https://wiro.ai/models/meituan-longcat/longcat-image

Test settings

Setting	Value
Aspect ratio	1:1 (1024×1024)
Steps	28
Guidance scale	4.5
Samples	1 per prompt

Results

Test 1: clean poster layout (English + Chinese)

Minimalist poster with WIRO API and Chinese text — Prompt: Minimalist poster on white background, Swiss grid layout, clean typography. Main headline text “WIRO API” and small subtitle text “Build fast”. Add a small Chinese line “你好世界”. Print texture.

The layout stays controlled. The big headline looks close to real type. Small text remains the first thing to blur or drift.

Test 2: product label on glass (short word)

Perfume bottle on black slate with label text AURA — Prompt: Studio product photo of a transparent glass perfume bottle on black slate. Label text “AURA” in a clean serif font. Dramatic spotlights, realistic caustics, shallow depth of field.

Short label text tends to work better than sentences. This test also shows whether glass turns into plastic under harsh highlights.

Test 3: neon sign in a busy scene

Rainy futuristic night market with a neon sign that says MERHABA — Prompt: Rainy futuristic night market, neon reflections, people with umbrellas, cinematic lighting. One clear neon sign says “MERHABA”.

Busy scenes usually break first. A good run keeps depth and readable silhouettes without turning faces into noise.

Test 4: diagram style with labels

Isometric diagram of an API gateway with labels GPU API DB — Prompt: Isometric technical illustration on white background. An API gateway connected to GPU servers and a database. Add simple labels “GPU”, “API”, “DB”. Sharp vector lines.

Vector style works when lines stay clean. Exact label text still needs a quick edit pass in many cases.

Test 5: embroidered patch (texture + text)

Portrait of a person with a jacket patch reading LONGCAT — Prompt: Ultra realistic portrait photo, 85mm lens look, soft rim light. A person wearing a black jacket with an embroidered patch that reads “LONGCAT”. Neutral background, detailed skin texture.

Embroidery pushes the model to keep both texture and letters. This prompt also exposes common face issues like waxy skin and misaligned eyes.

Test 6: tiny engraving on glass (hard)

Glass chess set with engraved text on the king piece — Prompt: Transparent glass chess set on a reflective black table. Dramatic spotlights, realistic refraction. Tiny engraving text “FLUX” on the king piece.

Tiny text stays hard for most models. Even when the lighting looks right, the letters often melt into the material.

What it does well

Handles short text better than typical diffusion baselines.
Keeps a clean look at 1024×1024 without heavy artifacts.
Scene lighting stays stable across very different prompts.

What still needs work

Small text and labels still drift.
Ultra tiny engraving remains unreliable.
Some prompts can over-smooth fine texture.

Try it

LongCat-Image on Wiro