Qwen3 32B is available on Lambda's Inference API

Post Details

Company

Lambda

Date Published

May 29, 2025

Author

Anthony Walsh

Word Count

815

Language

English

Hacker News Points

-

Source URL

lambda.ai/blog/qwen3-32b-is-available-on-lambdas-inference-api

Summary

Qwen3-32B, a dense model developed by Alibaba, is now available on Lambda's Inference API, offering advanced capabilities such as hybrid reasoning, multilingual support, and agentic capacities. With its STEM and logical reasoning proficiency, Qwen3-32B can process complex tasks that require human intervention, including coding, math, and logic. The model features two problem-solving modes, thinking mode and non-thinking mode, allowing developers to switch between sequential processing and instant responses. It also supports 119 languages and dialects, creative writing, role-playing, instruction following, and multi-turn dialogue. Qwen3-32B can execute agentic actions, enabling developers to call their tools of choice and modify Model Context Protocol configuration files. The model was pre-trained with 36 trillion tokens from online sources and PDF documents, delivering similar performance to the Qwen2.5 base models while operating more efficiently due to improved pre-training processes. Qwen3-32B is available on Lambda's Inference API for $0.10 per million input tokens and $0.30 per million output tokens, with no rate limits or cost-efficient pricing.