Company
Date Published
Author
Yessen Kanapin
Word count
349
Language
English
Hacker News points
None

Summary

Databricks Dolly v2 12b is a 12 billion parameter instruction-tuned casual language model derived from EleutherAI's pythia-12b and pretrained on The Pile and GPT-J's pretraining corpus. This model is optimized using the open-source databricks-dolly-15k instruction-following dataset and can be deployed through the DeepInfra web dashboard or API, with the deployment triggered upon the first inference request. Inference requests can be made via a REST API, and usage is priced at $0.0005 per second, running on Nvidia A100 cards. DeepInfra provides a fully managed GPU infrastructure for scalable model hosting, and users can access further documentation and support through their website and Discord server.