Llama 2 is a new state of the art (SOTA) in open-source large language models (OSS LLMs), offering three variants with varying sizes and capabilities, including a 4k-token context window. Stable Diffusion XL 1.0 creates high-quality images from shorter prompts, allowing users to specify exactly what they want to see without appending lengthy descriptions. Model autoscaling is now available, enabling cost-effective throughput by automatically creating and deleting replicas of the model server in response to incoming traffic, with a focus on scale-to-zero deployments that pay zero dollars when not in use.