How to deploy Databricks Dolly v2 12b, instruction tuned casual language model.

Post Details

Company

Deepinfra

Date Published

April 12, 2023

Author

Yessen Kanapin

Word Count

349

Company Posts That Month

3

Language

English

Hacker News Points

-

Source URL

deepinfra.com/blog/databricks_dolly

Summary

Databricks Dolly v2 12b is a 12 billion parameter instruction-tuned casual language model derived from EleutherAI's pythia-12b and pretrained on The Pile and GPT-J's pretraining corpus. This model is optimized using the open-source databricks-dolly-15k instruction-following dataset and can be deployed through the DeepInfra web dashboard or API, with the deployment triggered upon the first inference request. Inference requests can be made via a REST API, and usage is priced at $0.0005 per second, running on Nvidia A100 cards. DeepInfra provides a fully managed GPU infrastructure for scalable model hosting, and users can access further documentation and support through their website and Discord server.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	3	668	124	62	-20%