New in July 2023 - Plushcap

Post Details

Company

Baseten

Date Published

Aug. 2, 2023

Author

Baseten

Word Count

514

Language

English

Hacker News Points

-

Source URL

www.baseten.co/blog/new-in-july-2023

Summary

Llama 2 is a new state of the art (SOTA) in open-source large language models (OSS LLMs), offering three variants with varying sizes and capabilities, including a 4k-token context window. Stable Diffusion XL 1.0 creates high-quality images from shorter prompts, allowing users to specify exactly what they want to see without appending lengthy descriptions. Model autoscaling is now available, enabling cost-effective throughput by automatically creating and deleting replicas of the model server in response to incoming traffic, with a focus on scale-to-zero deployments that pay zero dollars when not in use.