Together AI at ICML 2026: frontier research across the full stack
Blog post from Together AI
Frontier AI research spans multiple layers, from agent development to GPU kernel optimization, ensuring improvements at each stage contribute effectively to the overall system, as demonstrated by the Together platform. Research efforts have produced notable advancements: the DSGym framework standardizes evaluation for data-science agents across diverse tasks, while ThunderAgent enhances agent inference efficiency and TTT-Discover achieves state-of-the-art discoveries using open models. Model shaping techniques like RARO and V1 enhance reasoning capabilities without relying on traditional verifiers, and algorithmic optimizations such as Aurora improve adaptive speculative decoding for dynamic traffic. Systems optimizations, including Untied Ulysses and OEA, address memory efficiency and MoE decode latency, and the ParallelKernelBench sets new benchmarks for multi-GPU kernel generation, emphasizing the importance of communication over computation. These innovations are presented at ICML 2026 in Seoul, where Together AI showcases its comprehensive, multi-layered research approach.
No tracked trend matches for this post yet.