Deploy StableLM with Truss

Post Details

Company

Baseten

Date Published

April 20, 2023

Author

Tuhin Srivastava

Word Count

423

Language

English

Hacker News Points

-

Source URL

www.baseten.co/blog/deploy-stablelm-with-baseten-and-truss

Summary

Deploy StableLM with Truss Stability AI recently announced the ongoing development of the StableLM series of language models, which were released alongside a number of checkpoints for this model. These models are ideal for conversational and coding-related tasks and can be deployed using Baseten and Truss infrastructure to provide scalable and cost-efficient performance. The deployment process is made easy through the truss push command, allowing users to deploy StableLM behind a REST API for immediate use in production. With auto-scaling resources, StableLM ensures efficient and low-latency performance even in high-traffic scenarios. The models can be modified by changing the load method in model.py or by configuring GPU resources in config.yaml, and system prompts can be added for use in chatbots. Users can get started with $30 of free credits.