Company
Date Published
Author
Yessen Kanapin
Word count
541
Language
English
Hacker News points
None

Summary

Databricks Dolly v2 12b is a 12 billion parameter instruction-tuned casual language model based on EleutherAI's pythia-12b, pretrained on The Pile and GPT-J's corpus, and fine-tuned using the open-source databricks-dolly-15k dataset. DeepInfra facilitates the deployment and inference of this model through its platform, requiring users to obtain an API key for access. The model can be deployed via the web dashboard or API, with inference requests charged per execution time, running on Nvidia A100 cards. Additionally, DeepInfra integrates with LlamaIndex for document indexing and searching, and offers various other AI models and tools, available via their platform.