/plushcap/analysis/deepgram/comic-books-videos-yack

Create Comic Books From Videos with yack!

What's this blog post about?

The team behind yack! has developed an automatic video-to-comic book generator using Deepgram's Speech Recognition API and computer vision. The process involves generating a transcript with Deepgram, selecting keyframes in the video, applying comic book styling to the images, overlaying captions as speech bubbles, and placing each 'tile' in a dynamic SVG element. The project leverages Deepgram's utterances feature for understanding keyframes and diarization for color-coded text when different speakers are detected.

Company
Deepgram

Date published
March 9, 2022

Author(s)
Kevin Lewis

Word count
432

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.