Company
Date Published
Author
Philip Kiely
Word count
339
Language
English
Hacker News points
None

Summary

When you deploy a model on Baseten, you can call it via an API endpoint with zero configuration, saving the effort of building and deploying a model server. This allows for faster interaction with models, but may require additional glue code to integrate generative AI models with applications. The API endpoint is available behind every model in your Baseten workspace, accessible through the "Call model" button on the model dashboard or by calling it from the command line, programming language of choice, or an API tool like Postman. The API endpoint can be used to parse model output and use it in production, with instructions provided for integration into various applications, including Python, cURL, JavaScript, and Truss.