Qwen3.5-27B: 6 Quick Tests on Reasoning, Parsing, and Code
Qwen3.5-27B shows how a 27B multimodal model handles long-context reasoning and mixed tasks. This short review runs six focused tests: product copy, invoice parsing, translation, a Python utility, a JSON Schema, and a short summary task. Each test includes the exact prompt and the raw model output.
Model
Quick summary
- Model type: 27B multimodal LLM with extended context.
- Strengths: instruction following, structured outputs, long-context handling.
- Observed limits: verbose “thinking” sections appear by default; prompts must force concise output for clean machine-readable results.
Tests and outputs
Test 1 — Product copy (4 sentences)
Prompt: Write a 4 sentence product description for noise cancelling headphones. Plain language. No hype. Mention battery life and comfort.
These headphones reduce outside sounds to help you focus on your audio. The padded ear cups are designed to stay comfortable during extended use. A single charge provides up to thirty hours of continuous listening time. They connect wirelessly to your devices for easy setup.
Test 2 — Invoice → JSON
Prompt: Convert the following messy invoice into JSON with keys invoice_id, date, vendor, total_usd, items (array of {name, qty, unit_price_usd}). Return JSON only, no extra text.
Invoice #A-1049
Date: 2026/03/01
Vendor: Blue Finch Tools
– 2x Torque wrench (1/2 inch drive) $39.50 each
– 1x Safety glasses (clear) $12.00
Tax: $7.41
Total: $98.41
{
"invoice_id": "A-1049",
"date": "2026/03/01",
"vendor": "Blue Finch Tools",
"total_usd": 98.41,
"items": [
{ "name": "Torque wrench (1/2 inch drive)", "qty": 2, "unit_price_usd": 39.5 },
{ "name": "Safety glasses (clear)", "qty": 1, "unit_price_usd": 12.0 }
]
}
Test 3 — Turkish → English (short translation)
Prompt: Translate to English. Return English only.
Turkish: Bu modeli API ile denedim ama cevaplarda <think> bolumu geliyor. Kapatmanin yolu var mi?
I tried this model with the API, but the <think> section appears in the responses. Is there a way to disable it?
Test 4 — Python utility: parse_duration
Prompt: Write a Python function parse_duration(s) that accepts strings like 2h 15m, 90m, 45s, 1h and returns total seconds as an int. Handle spaces. Assume lowercase h m s only. Include 4 example calls as comments. Return code only.
import re
def parse_duration(s):
total_seconds = 0
for num_str, unit in re.findall(r"(\d+)([hms])", s):
num = int(num_str)
if unit == 'h':
total_seconds += num * 3600
elif unit == 'm':
total_seconds += num * 60
elif unit == 's':
total_seconds += num
return total_seconds
# Examples:
# print(parse_duration("2h 15m")) # 8100
# print(parse_duration("90m")) # 5400
# print(parse_duration("45s")) # 45
# print(parse_duration("1h")) # 3600
Test 5 — JSON Schema (draft 2020-12)
Prompt: Create a minimal JSON Schema (draft 2020-12) for an event object with fields: id (string), startTime (string, date-time), endTime (string, date-time), title (string), location (string, optional), attendees (array of strings, optional). Return JSON only.
{
"$schema": "https://json-schema.org/draft/2020-12/schema",
"type": "object",
"properties": {
"id": { "type": "string" },
"startTime": { "type": "string", "format": "date-time" },
"endTime": { "type": "string", "format": "date-time" },
"title": { "type": "string" },
"location": { "type": "string" },
"attendees": { "type": "array", "items": { "type": "string" } }
},
"required": ["id", "startTime", "endTime", "title"]
}
Test 6 — Short summary + Best for
Prompt: Summarize the text in 5 bullet points, then add a 1-line Best for: statement. Keep it short.
- Qwen3.5-27B uses a thinking stage before final answers.
- Thinking content appears in a separate block by default.
- Direct responses require API flags to disable thinking.
- The model supports very long context windows (262K tokens).
- Context can be extended further depending on serving stack.
Best for: complex reasoning tasks that need very long context windows.
Short assessment
The model follows instructions and returns structured outputs when prompts require them. The default thinking block adds verbosity; set API flags for direct answers when machine-readable output is needed. For tasks that need large context or detailed reasoning, Qwen3.5-27B performs well.
Try it
Try the model on Wiro: https://wiro.ai/models/qwen/qwen3-5-27b