As more and more apps record audio and video files we need to start thinking about what to do with those files. Playing them back isn't enough. Media files are full of data that developers can start exploiting thanks to an emergent category of signal and natural language processing APIs.
There are only 3 options for processing the words embedded in these files:
- Transcribe them yourself, manually.
- Find a transcript made by someone else.
- Use a library that extracts the words for you.
As the developer of a python library that automates the extraction and processing of words in media files, I'll demonstrate how easy it is to make audio and video libraries fully searchable, create a word cloud of keywords from a recorded phone call, and extract topics from news broadcast.
I'll show coding examples as well as products using this API.