{"id":1235,"date":"2026-03-01T21:39:14","date_gmt":"2026-03-01T21:39:14","guid":{"rendered":"https:\/\/wiro.ai\/blog\/?p=1235"},"modified":"2026-02-25T21:54:25","modified_gmt":"2026-02-25T21:54:25","slug":"qwen3-asr-1-7b-speech-to-text-in-6-audio-tests","status":"publish","type":"post","link":"https:\/\/wiro.ai\/blog\/qwen3-asr-1-7b-speech-to-text-in-6-audio-tests\/","title":{"rendered":"Qwen3-ASR-1.7B: Speech-to-Text in 6 Audio Tests"},"content":{"rendered":"<p>Qwen3-ASR-1.7B is a lightweight speech-to-text model on Wiro. This post runs 6 repeatable audio tests and shows the transcripts it returned.<\/p>\n<p>Model link: <a href=\"https:\/\/wiro.ai\/models\/qwen\/qwen3-asr-1-7b\">https:\/\/wiro.ai\/models\/qwen\/qwen3-asr-1-7b<\/a><\/p>\n<h2>What was tested<\/h2>\n<ul>\n<li>English dictation with numbers and a tracking code<\/li>\n<li>English dictation with URL and token-like strings<\/li>\n<li>Turkish e-commerce style sentence<\/li>\n<li>Spanish customer support style sentence<\/li>\n<li>The same English dictation with added white noise<\/li>\n<li>The token-heavy English clip sped up 1.35x<\/li>\n<\/ul>\n<h2>Test setup<\/h2>\n<p>All input audio clips were generated with <a href=\"https:\/\/wiro.ai\/models\/resemble-ai\/chatterbox-multilingual\">https:\/\/wiro.ai\/models\/resemble-ai\/chatterbox-multilingual<\/a> so the scripts stay consistent across reruns. Then the audio was uploaded to WordPress and used as the ASR input URL. Qwen3-ASR-1.7B was run with language selection plus batchSize=32 and newTokens=256.<\/p>\n<h2>Results<\/h2>\n<table>\n<thead>\n<tr>\n<th>Test<\/th>\n<th>Language<\/th>\n<th>Condition<\/th>\n<th>Notes<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>1<\/td>\n<td>English<\/td>\n<td>Clean<\/td>\n<td>Strong punctuation and numbers<\/td>\n<\/tr>\n<tr>\n<td>2<\/td>\n<td>English<\/td>\n<td>Clean<\/td>\n<td>Struggled with domain and token spelling<\/td>\n<\/tr>\n<tr>\n<td>3<\/td>\n<td>Turkish<\/td>\n<td>Clean<\/td>\n<td>Major errors in amounts and codes<\/td>\n<\/tr>\n<tr>\n<td>4<\/td>\n<td>Spanish<\/td>\n<td>Clean<\/td>\n<td>Digits and time were unstable<\/td>\n<\/tr>\n<tr>\n<td>5<\/td>\n<td>English<\/td>\n<td>White noise mix<\/td>\n<td>Held up well on this clip<\/td>\n<\/tr>\n<tr>\n<td>6<\/td>\n<td>English<\/td>\n<td>1.35x speed<\/td>\n<td>Similar errors to test 2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Test 1: English clean dictation<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-01-en-clean.mp3\"><\/audio><figcaption>Prompt script: For the shipping audit, order 48219 shipped on February 14 at 9:05 AM. Total weight 3.7 kilograms. Tracking code Z X dash 9 1 dash Delta.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>For the shipping audit, order 48219 shipped on February 14 at 9:05 a.m. Total weight 3.7 kilograms. Tracking code ZX-91-DELTA.<\/pre>\n<h3>Test 2: English with URL and token-like strings<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-02-en-tokens.mp3\"><\/audio><figcaption>Prompt script: Email support plus wiro at acme dot dev. URL https colon slash slash api dot example dot com slash v1 slash run question mark mode equals fast ampersand retry equals 2. Error code E underscore C O N N underscore R E S E T. Commit seven f three a nine c one.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>Email support plus yiro at acme.dev. URL s colon slash slash api.example.com slash v1 slash run question mark mode equals fast ampersand retry equals two. Error code e underscore c o n n underscore r e s e t. Commit seven f three a nine c one.<\/pre>\n<h3>Test 3: Turkish<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-03-tr.mp3\"><\/audio><figcaption>Prompt script: Sepet tutari 1.249,90 TL. Kargo kodu T R dash 508 dash A B. Teslimat 3 gun icinde. Iade suresi 14 gundur.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>Sepet tutar\u0131 180000 komisiki 16 TL kargo kodu TR016<\/pre>\n<h3>Test 4: Spanish<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-04-es.mp3\"><\/audio><figcaption>Prompt script: El pedido numero 1740 llego el martes a las 18:30. El codigo postal es 28013. Gracias por llamar.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>El pedido n\u00famero B 740 lleg\u00f3 el martes a las de 8:40. El c\u00f3digo postal es 2800 S. Gracias por llamar. Hay fiends en tu.<\/pre>\n<h3>Test 5: English with added white noise<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-05-en-noisy.mp3\"><\/audio><figcaption>Prompt script: Same as test 1, mixed with white noise.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>For the shipping audit, order 48219 shipped on February 14 at 9:05 a.m. Total weight 3.7 kilograms. Tracking code ZX-91-DELTA.<\/pre>\n<h3>Test 6: English token clip at 1.35x speed<\/h3>\n<figure>\n  <audio controls preload=\"metadata\" src=\"https:\/\/wiro.ai\/blog\/wp-content\/uploads\/2026\/02\/qwen3-asr-test-06-en-fast.mp3\"><\/audio><figcaption>Prompt script: Same as test 2, audio sped up 1.35x.<\/figcaption><\/figure>\n<p>Qwen3-ASR output:<\/p>\n<pre>Email support plus wireo at acme.dev. URL s colon slash slash api.example.com slash v1 slash run question mark mode equals fast ampersand retry equals two error code e underscore c o n n underscore r e s e t commit seven f three a nine c one<\/pre>\n<h2>Quick takeaways<\/h2>\n<ul>\n<li>Clean English dictation looked solid, including numbers and punctuation.<\/li>\n<li>Token-heavy text (URLs, underscores, commit hashes) needs cleanup rules on the client side.<\/li>\n<li>This Turkish and Spanish sample did not hold up. More language-specific testing is needed before relying on it.<\/li>\n<\/ul>\n<h2>Try it<\/h2>\n<p><a href=\"https:\/\/wiro.ai\/models\/qwen\/qwen3-asr-1-7b\">Qwen3-ASR-1.7B on Wiro<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Qwen3-ASR-1.7B is a lightweight speech-to-text model on Wiro. This post runs 6 repeatable audio tests and shows the transcripts it returned. Model&hellip;<\/p>\n","protected":false},"author":4,"featured_media":1234,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[52],"tags":[101,94,100,63],"class_list":["post-1235","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-model-reviews","tag-asr","tag-audio","tag-qwen","tag-speech-to-text"],"_links":{"self":[{"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/posts\/1235","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/comments?post=1235"}],"version-history":[{"count":1,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/posts\/1235\/revisions"}],"predecessor-version":[{"id":1236,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/posts\/1235\/revisions\/1236"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/media\/1234"}],"wp:attachment":[{"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/media?parent=1235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/categories?post=1235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wiro.ai\/blog\/wp-json\/wp\/v2\/tags?post=1235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}