Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can try it out on https://chat.qwen.ai/ - sign in with Google or GitHub (signed out users can't use the voice mode) and then click on the voice icon.

It has an entertaining selection of different voices, including:

*Dylan* - A teenager who grew up in Beijing's hutongs

*Peter* - Tianjin crosstalk, professionally supporting others

*Cherry* - A sunny, positive, friendly, and natural young lady

*Ethan* - A sunny, warm, energetic, and vigorous boy

*Eric* - A Sichuan Chengdu man who stands out from the crowd

*Jada* - The fiery older sister from Shanghai



Many of these voices are especially hillarious when you switch the language.

In Russian, Ryan sounds like a westerner who started reading Russian words a month ago.

Dylan sounds somewhat authentic, while everyone else is a different degree of heavy-asian-accented Russian.


The voices are really fun, thanks for the laughs :)


I only see Omni Flash, is that the one?


same, did you figure it out?


I think so, you need to click the big jagged audio icon to start a voice session.


Is the Qwen3-Omni-Flash the same as Qwen3-Omni-30B-A3B, or is the Omni-Flash a different closed-source model?


In Section 5 of their [technical report](https://arxiv.org/pdf/2509.17765v1) they mention them

"... A comprehensive evaluation was performed on a suite of models, including Qwen3-Omni-30B-A3B- Instruct, Qwen3-Omni-30B-A3B-Thinking, and two in-house developed variants, designated Qwen3- Omni-Flash-Instruct and Qwen3-Omni-Flash-Thinking. These “Flash” models were designed to improve both computational efficiency and performance efficacy, integrating new functionalities, notably the support for various dialects. ..."


My question too




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: