Mellum2 Goes Open Source: A Fast Model for AI Workflows

Post Details

Company

JetBrains

Date Published

June 1, 2026

Author

Anton Semenkin Nikita Pavlichenko Read this post in other languages: Français, 한국어

Word Count

566

Company Posts That Month

53

Language

American English

Hacker News Points

-

Post removed?

No

Source URL

blog.jetbrains.com/ai/2026/06/mellum2-goes-open-source-a-fast-model-for-ai-workflows

Summary

Mellum2, an AI model developed by JetBrains and now open-sourced under the Apache 2.0 license, is designed to address core challenges in AI production systems, such as latency, throughput, and cost, primarily in software engineering environments. With a Mixture-of-Experts architecture, Mellum2 utilizes 12 billion parameters but activates only 2.5 billion per token, leading to lower compute costs and faster inference times. This model is specialized in natural language and code, making it highly suitable for tasks such as routing, summarization, and intermediate reasoning in AI workflows. Mellum2's design philosophy emphasizes the importance of "focal models," which are fast, specialized components that perform high-frequency tasks efficiently, as opposed to relying solely on large, multimodal models. It offers a cost-effective and high-performance solution for routing AI workloads, building low-latency retrieval-augmented generation (RAG) pipelines, and enabling private, local AI deployments, thereby providing a significant competitive edge in production-grade AI applications.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
RAG	2	992	256	104	-53%
Local AI	1	65	36	18	+38%
Real-time	1	5,515	1,316	255	-4%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog