Company
Date Published
Author
Jesse Mostipak
Word count
340
Language
English
Hacker News points
None

Summary

AudioGen is a breakthrough text-to-audio model from the AudioCraft family of models by Meta AI, capable of generating an array of sounds based on simple text inputs after being trained on publicly available sound effects. It is now available in the Baseten model library and can be deployed with just two clicks, utilizing efficient GPUs for optimal performance. Once deployed, users can run inference to generate one clip per prompt as a base64 encoded WAV file, producing impressive results such as footsteps on a wooden floor, small dog barking, and man talking with an emergency vehicle siren.