Company
Date Published
Author
Michelle Chen Ashish Datta
Word count
784
Language
English
Hacker News points
None

Summary

OpenAI has released its latest open-weight models, a 120 billion parameter model and a 20 billion parameter model, as part of a collaboration with Cloudflare, which is making these models available through its Workers AI platform. These Mixture-of-Experts models, which run natively at FP4 quantization, offer improved efficiency and speed compared to traditional dense models and are designed for text-only applications with features like reasoning capabilities, tool calling, and Code Interpreter. Cloudflare is utilizing its infrastructure, including its Sandbox product, to support these models' capabilities, allowing for stateful code execution and rapid deployment. The models are integrated with the new Responses API format recommended by OpenAI, and Cloudflare is providing support for developers to build applications using these models on its platform, emphasizing transparency, customizability, and data security.