New in January 2024

Post Details

Company

Baseten

Date Published

Jan. 31, 2024

Author

Baseten

Word Count

580

Language

English

Hacker News Points

-

Source URL

www.baseten.co/blog/new-in-january-2024

Summary

The new model library, launched in January 2024, aims to simplify the comparison and exploration of open-source machine learning models by categorizing them based on task, family, and publisher. The library provides detailed information about each model's version, variant, size, optimizations, license, and other essential properties. This is achieved through a more intuitive taxonomy that makes it easier for developers to find the right models for their needs. Additionally, NVIDIA's L4 GPU has been made available for model inference on Baseten, offering a cost-effective alternative to A10G-based instances with improved performance for compute-bound workloads. The library also includes a new introduction to quantizing ML models, providing insights into its advantages and risks for improving model performance without compromising output quality.