Qwen 3 is an open model in the Qwen series.There are multiple releases available including a 30/3 billion parameter MoE model.

Qwen3 models are hybrid reasoning models that can be prompted to think or not by passing special tokens /think or /no_think in the prompt. Thinking is apparently enabled by default.

Qwen3 can be easily run within ollama and are supported in v0.8.5 of vllm

Resources