Company
Date Published
Author
Together AI
Word count
1191
Language
English
Hacker News points
None

Summary

Together AI is announcing the availability of on-demand Dedicated Endpoints, which offer unmatched price-performance for scaling AI inference in production. This new service provides a balance between flexibility and affordability, making it an ideal solution for startups and large companies alike. With up to 43% lower pricing than previous offerings, Dedicated Endpoints deliver high performance, full control and customizability over the deployment hardware and configuration, support for custom fine-tuned models, no minimum commitments, and no upfront costs. The service allows users to spin up on-demand Dedicated Endpoints for popular open-source models or upload their own custom fine-tuned model from Hugging Face, deploy it instantly, and start running inference without any additional storage or upload fees. This update is expected to provide significant cost savings at scale, making it an attractive option for mission-critical AI applications that require reliable QPS, predictable availability, and seamless handling of surges without performance dips.