Language models are on Replicate
Blog post from Replicate
Replicate has introduced the capability to deploy, run, and fine-tune large language models such as FLAN-T5, GPT-J, and LLaMA, along with the option to push custom models, either publicly or privately, on its platform. Users can easily run these models with minimal code using Python, Node.js, or an HTTP API, bypassing the need for server or GPU setups. The platform offers a preview of model fine-tuning in the cloud, allowing users to tailor models to specific tasks, like creating a product-specific support bot or writing emails in a user's style. While initial access to this feature is limited, Replicate plans to expand availability and provide further guides and examples to demonstrate the potential applications of open-source language models.