Qwen3-32B, a dense model developed by Alibaba, is now available on Lambda's Inference API, offering advanced capabilities such as hybrid reasoning, multilingual support, and agentic capacities. With its STEM and logical reasoning proficiency, Qwen3-32B can process complex tasks that require human intervention, including coding, math, and logic. The model features two problem-solving modes, thinking mode and non-thinking mode, allowing developers to switch between sequential processing and instant responses. It also supports 119 languages and dialects, creative writing, role-playing, instruction following, and multi-turn dialogue. Qwen3-32B can execute agentic actions, enabling developers to call their tools of choice and modify Model Context Protocol configuration files. The model was pre-trained with 36 trillion tokens from online sources and PDF documents, delivering similar performance to the Qwen2.5 base models while operating more efficiently due to improved pre-training processes. Qwen3-32B is available on Lambda's Inference API for $0.10 per million input tokens and $0.30 per million output tokens, with no rate limits or cost-efficient pricing.