Qwen3.5-4B vs Qwen3.5-4B-heretic: 5 tests

Qwen3.5-4B vs Qwen3.5-4B-heretic is easiest to judge when the same short tasks test JSON accuracy, coding discipline, and translation stability side by side.

Qwen3.5-4B vs Qwen3.5-4B-heretic: what stands out

Model links

qwen/qwen3-5-4b on Wiro

qwen/qwen3-5-4b-heretic on Wiro

Cover art (generated)

Abstract split tech background with teal circuit patterns on the left and red glitch texture on the right — Prompt: Abstract tech cover background. Split composition: left side cool teal circuit patterns, right side warm red glitch texture. Soft gradient lighting, minimal, lots of negative space for a title on the left. No text.

What this comparison covers

Qwen3.5-4B is a compact chat model. Qwen3.5-4B-heretic is described as a decensored checkpoint. This post runs five practical prompts and shows the raw outputs side by side.

Test setup

Thinking	Requested: Off
Temperature	0.2
Top-p	0.95
Notes	Some responses include wrapper tags; those are stripped only for display.

5 prompt tests (side by side)

1) JSON-only response

Prompt: Return a JSON object with keys a and b. a=2, b=3. No extra text.

Qwen3.5-4B	Qwen3.5-4B-heretic
{"a": 2, "b": 3}	{"a": 2, "b": 3}

2) Small coding task (slugify)

Prompt: Write Python code only. Implement slugify(s): lowercase, convert spaces and underscores to hyphens, remove all characters except a-z 0-9 and hyphen, collapse multiple hyphens into one, strip leading and trailing hyphens. Include exactly 3 assert tests. No explanations.

Qwen3.5-4B	Qwen3.5-4B-heretic
def slugify(s): s = s.lower() s = s.replace(' ', '-') s = s.replace('_', '-') s = ''.join(c for c in s if c.isalnum() or c == '-') s = s.strip('-') s = re.sub(r'-+', '-', s) return s import re assert slugify("Hello World") == "hello-world" assert slugify("Hello_World") == "hello-world" assert slugify("Hello---World!!!") == "hello-world"	def slugify(s): s = s.lower() s = s.replace(' ', '-') s = s.replace('_', '-') s = ''.join(c for c in s if c.isalnum() or c == '-') s = s.strip('-') s = re.sub(r'-+', '-', s) return s import re assert slugify("Hello World") == "hello-world" assert slugify("Hello_World") == "hello-world" assert slugify("Hello---World") == "hello-world"

Qwen3.5-4B

Qwen3.5-4B-heretic

def slugify(s):
    s = s.lower()
    s = s.replace(' ', '-')
    s = s.replace('_', '-')
    s = ''.join(c for c in s if c.isalnum() or c == '-')
    s = s.strip('-')
    s = re.sub(r'-+', '-', s)
    return s

import re
assert slugify("Hello World") == "hello-world"
assert slugify("Hello_World") == "hello-world"
assert slugify("Hello---World!!!") == "hello-world"

def slugify(s):
    s = s.lower()
    s = s.replace(' ', '-')
    s = s.replace('_', '-')
    s = ''.join(c for c in s if c.isalnum() or c == '-')
    s = s.strip('-')
    s = re.sub(r'-+', '-', s)
    return s

import re

assert slugify("Hello World") == "hello-world"
assert slugify("Hello_World") == "hello-world"
assert slugify("Hello---World") == "hello-world"

3) Field extraction to JSON

Prompt: Extract the fields into JSON. Return JSON only.

Ticket: #1842
Name: Ava Stone
Email: [email protected]
Plan: Pro Annual
Issue: charged twice on 2026-04-16
Action requested: refund the duplicate charge

Keys: id, name, email, plan, issue, requested_action.

Qwen3.5-4B	Qwen3.5-4B-heretic
{ "id": "#1842", "name": "Ava Stone", "email": "[email protected]", "plan": "Pro Annual", "issue": "charged twice on 2026-04-16", "requested_action": "refund the duplicate charge" }	{ "id": "#1842", "name": "Ava Stone", "email": "[email protected]", "plan": "Pro Annual", "issue": "charged twice on 2026-04-16", "requested_action": "refund the duplicate charge" }

Qwen3.5-4B

Qwen3.5-4B-heretic

{
  "id": "#1842",
  "name": "Ava Stone",
  "email": "[email protected]",
  "plan": "Pro Annual",
  "issue": "charged twice on 2026-04-16",
  "requested_action": "refund the duplicate charge"
}

{
  "id": "#1842",
  "name": "Ava Stone",
  "email": "[email protected]",
  "plan": "Pro Annual",
  "issue": "charged twice on 2026-04-16",
  "requested_action": "refund the duplicate charge"
}

4) Constraint following (word counts)

Prompt: Write exactly 5 sentences. Each sentence must be exactly 8 words. Topic: debugging a production incident. No numbering. No extra text.

Qwen3.5-4B	Qwen3.5-4B-heretic
Identify the error quickly to stop the issue. Analyze logs to find the root cause. Fix the code and deploy the patch. Test the fix in a staging environment. Verify the system is stable before closing.	Identify the root cause quickly. Reproduce the error in a lab. Apply a targeted fix to the code. Deploy the fix to production carefully. Monitor the system for regressions.

Qwen3.5-4B

Qwen3.5-4B-heretic

Identify the error quickly to stop the issue.
Analyze logs to find the root cause.
Fix the code and deploy the patch.
Test the fix in a staging environment.
Verify the system is stable before closing.

Identify the root cause quickly.
Reproduce the error in a lab.
Apply a targeted fix to the code.
Deploy the fix to production carefully.
Monitor the system for regressions.

5) Simple arithmetic check

Prompt: A shop sells pens at $1.20 each. You have $10.00. You want to buy as many pens as possible and still have at least $0.40 left. How many pens can you buy? Show the calculation. Keep it under 3 lines.

Qwen3.5-4B	Qwen3.5-4B-heretic
You can buy 7 pens. Calculation: $10.00 - (7 pens × $1.20/pen) = $1.60 remaining, which is greater than $0.40. Buying 8 pens would leave only $0.40, meeting the minimum requirement exactly.	You can buy 8 pens. Calculation: $10.00 - (8 pens × $1.20/pen) = $0.80 left, which is more than $0.40. Buying 9 pens would leave only $0.00.

Qwen3.5-4B

Qwen3.5-4B-heretic

You can buy 7 pens. Calculation: $10.00 - (7 pens × $1.20/pen) = $1.60 remaining, which is greater than $0.40. Buying 8 pens would leave only $0.40, meeting the minimum requirement exactly.

You can buy 8 pens. Calculation: $10.00 - (8 pens × $1.20/pen) = $0.80 left, which is more than $0.40. Buying 9 pens would leave only $0.00.

Quick takeaways

Both models handle short JSON formatting and extraction cleanly.
Both can write small utility code quickly.
Strict constraint prompts (exact word counts) still need validation.
Even simple math prompts can produce small arithmetic slips, so add checks for production use.

Try them

Run Qwen3.5-4B | Run Qwen3.5-4B-heretic

Qwen3.5-4B vs Qwen3.5-4B-heretic: 5 Side-by-Side Tests

Qwen3.5-4B vs Qwen3.5-4B-heretic: what stands out

Model links

Cover art (generated)

What this comparison covers

Test setup

5 prompt tests (side by side)

1) JSON-only response

2) Small coding task (slugify)

3) Field extraction to JSON

4) Constraint following (word counts)

5) Simple arithmetic check

Quick takeaways

Try them

Leave a Comment Cancel reply

Qwen3.5-4B vs Qwen3.5-4B-heretic: what stands out

Model links

Cover art (generated)

What this comparison covers

Test setup

5 prompt tests (side by side)

1) JSON-only response

2) Small coding task (slugify)

3) Field extraction to JSON

4) Constraint following (word counts)

5) Simple arithmetic check

Quick takeaways

Try them

Leave a Comment Cancel reply

Related Posts

Seedream V5 Pro vs Nano Banana Pro: 3 Prompt Tests

FireRed Image Edit vs Seedream V5 Lite: 5 Before and After Tests

LTX-Video vs Kling vs Seedance: 5 Text-to-Video Tests with the Same Prompts

Stay in the Loop