38 blog posts published by month since the start of 2025. Start from a different year:

Posts year-to-date
38 (22 posts by this month last year.)
Average posts per month since 2025
3.2

Post details (2025 to today)

Title Author Date Word count HN points
Build ultra low latency voice AI applications with Together AI and Cartesia Sonic Together AI Jan 23, 2025 829 -
How to deploy DeepSeek-R1 and distilled models securely on Together AI Together AI Jan 31, 2025 1004 -
Mistral Small 3 API now available on Together AI: A new category leader in small models Together AI Jan 30, 2025 712 -
Generate images with specific styles using Flux LoRAs on Together AI Together AI Jan 27, 2025 891 -
Deploy DeepSeek-R1 at scale: Fast, secure serverless APIs and large-scale Together Reasoning Clusters Together AI Feb 12, 2025 984 -
Together AI Achieves 90% Faster BF16 Training with NVIDIA Blackwell Platform and Together Kernel Collection Together AI Feb 13, 2025 1422 -
Minions: embracing small LMs, shifting compute on-device, and cutting cloud costs in the process Avanika Narayan*, Dan Biderman*, Sabri Eyuboglu*, Avner May, Scott Linderman, James Zou, Christopher Ré Feb 25, 2025 1257 -
Together AI Announces $305M Series B to Scale AI Acceleration Cloud for Open Source and Enterprise AI Together AI Feb 20, 2025 808 -
Together AI becomes NVIDIA Cloud Partner to bolster accelerated AI offerings Together AI Mar 11, 2025 744 -
ThunderKittens Now Optimized for NVIDIA Blackwell GPUs Benjamin Spector, Aaryan Singhal, Dan Fu, Chris Ré Mar 15, 2025 1573 -
On-demand dedicated endpoints: run inference with unmatched price-performance & control at scale Together AI Mar 13, 2025 1191 -
Introducing Together Instant GPU Clusters Accelerated by NVIDIA GPUs, with Self-Service Provisioning in Minutes Together AI Mar 18, 2025 800 -
Together AI Powers Pioneers at GTC: NVIDIA Blackwell GPUs, Instant GPU Clusters, and A Full-Stack for AI Innovation Together AI Mar 18, 2025 1836 -
Deploy Leading AI Models Accelerated by NVIDIA NIM on Together AI Together AI Mar 18, 2025 744 -
Introducing Together Chat: use DeepSeek R1 for free, hosted in North America Hassan El Mghari Mar 24, 2025 648 -
Together AI Awarded ClusterMAX™ Gold Rating by SemiAnalysis Together AI Mar 27, 2025 973 -
Together AI partners with Meta to offer Llama 4: SOTA Multimodal MoE Models Together AI Apr 05, 2025 608 -
Scaling AI Companions: How Dippy AI Reached Over 4 Million Tokens/Minute with Together Dedicated Endpoints Together AI Apr 01, 2025 1074 -
DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level Michael Luo*, Sijun Tan*, Roy Huang*, Ameen Patel*, Alpay Ariyak*, Qingyang Wu*, Xiaoxiang Shi, Rachel Xin, Colin Cai, Maurice Weber, Ce Zhang, Li Erran Li, Raluca Ada Popa, Ion Stoica Apr 08, 2025 2870 31
Together Fine-Tuning Platform, Now With Preference Optimization and Continued Training Anirudh Jain, Ivan Provilkov, Artem Chumachenko, Alex Moldovan, George Grigorev, Gleb Vazhenin, Arsh Zahed, Avner May, Tristan Dubbeld, Max Ryabinin Apr 17, 2025 1360 -
Direct Preference Optimization Ivan Provilkov, Zain Hasan, Max Ryabinin Apr 17, 2025 1472 37
Continued Fine-tuning of LLMs Artem Chumachenko, Zain Hasan, Max Ryabinin Apr 17, 2025 1292 -
Open Deep Research Together AI Apr 16, 2025 3100 -
Chipmunk: Training-Free Acceleration of Diffusion Transformers with Dynamic Column-Sparse Deltas Austin Silveria, Soham Govande, Dan Fu Apr 21, 2025 1550 -
Salesforce, Zoom, InVideo Train Faster with Together AI Turbocharged with NVIDIA Blackwell Together AI Apr 24, 2025 1145 -
Together AI acquires Refuel.ai to unlock data for developers and businesses building production-grade AI applications Together AI May 15, 2025 792 -
Together Code Sandbox: the most robust infrastructure for building AI coding products at scale Together AI May 20, 2025 920 1
Together Code Interpreter: execute LLM-generated code seamlessly with a simple API call Together AI May 20, 2025 946 -
Introducing Together Code Sandbox & Together Code Interpreter: SOTA code execution for AI Together AI May 20, 2025 1112 -
Boosting DeepSeek-R1’s Speed with Customized Speculative Decoding Wai Tong Chung, Dan Waters, Avner May, Ben Athiwaratkun May 12, 2025 1284 -
FLUX.1 Kontext models: Character consistency and precise image editing without fine-tuning Together AI May 29, 2025 734 -
From AWS to Together Dedicated Endpoints: Arcee AI's journey to greater inference flexibility Together AI May 05, 2025 1448 -
Mixture-of-Agents Alignment: Harnessing the Collective Intelligence of Open-Source LLMs to Improve Post-Training Junlin Wang, Roy Xie, Shang Zhu, Jue Wang, Ben Athiwaratkun, Bhuwan Dhingra, Shuaiwen Leon Song, Ce Zhang, James Zou May 28, 2025 1394 -
Model-Preserving Adaptive Rounding with YAQA Albert Tseng, Zhaofeng Sun, and Chris De Sa Jun 05, 2025 2091 -
How to Build a Coding Agent from Scratch: A Practical Guide for Developers Zain Hasan Jun 12, 2025 1060 -
Introducing the Together AI Batch API: Process Thousands of LLM Requests at 50% Lower Cost TOGETHER AI Jun 11, 2025 637 -
The Frontier is Open Charles Zedlewski Jun 09, 2025 1351 1
Bringing 100,000 GPUs to Europe Together AI Jun 12, 2025 731 -