AudioGen: deploy and build today!

Post Details

Company

Baseten

Date Published

Aug. 4, 2023

Author

Jesse Mostipak

Word Count

340

Language

English

Hacker News Points

-

Source URL

www.baseten.co/blog/audiogen-deploy-and-build-today

Summary

AudioGen is a breakthrough text-to-audio model from the AudioCraft family of models by Meta AI, capable of generating an array of sounds based on simple text inputs after being trained on publicly available sound effects. It is now available in the Baseten model library and can be deployed with just two clicks, utilizing efficient GPUs for optimal performance. Once deployed, users can run inference to generate one clip per prompt as a base64 encoded WAV file, producing impressive results such as footsteps on a wooden floor, small dog barking, and man talking with an emergency vehicle siren.