Speech to Text using Assembly AI

#devchallenge #assemblyaichallenge #ai #api

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

I built a Speech-to-Text Application that showcases the power of Universal-2, AssemblyAI’s latest speech-to-text model. The application:

Supports Multilingual Transcription Users can choose from multiple languages, ensuring global accessibility.
Outputs with Formatting and Timestamps application delivers well-structured transcripts, complete with proper nouns, punctuation, and timestamps.
User-Friendly Interface built using Streamlit, the app features an intuitive frontend for easy navigation and interaction.

Incorporating Universal-2:
The application utilizes Universal-2 through AssemblyAI’s robust API. The backend:

Uploads audio files using AssemblyAI's upload endpoint.
Submits transcription requests, including optional parameters like language_code and punctuate.
Polls transcription progress until completion and fetches the final transcript with timestamps, and word-by-word breakdown.

Team Submission:
I worked on this project independently-Sunder Kumar