Content Deep Dive
How to deploy Databricks Dolly v2 12b, instruction tuned casual language model.
Blog post from Deepinfra
Post Details
Company
Date Published
Author
Yessen Kanapin
Word Count
349
Language
English
Hacker News Points
-
Source URL
Summary
Databricks Dolly v2 12b is a 12 billion parameter instruction-tuned casual language model derived from EleutherAI's pythia-12b and pretrained on The Pile and GPT-J's pretraining corpus. This model is optimized using the open-source databricks-dolly-15k instruction-following dataset and can be deployed through the DeepInfra web dashboard or API, with the deployment triggered upon the first inference request. Inference requests can be made via a REST API, and usage is priced at $0.0005 per second, running on Nvidia A100 cards. DeepInfra provides a fully managed GPU infrastructure for scalable model hosting, and users can access further documentation and support through their website and Discord server.