Omkar tripathi

Posted on Nov 23

DialogueAI: Interactive Playground for assemblyai with automatic code generation

#devchallenge #assemblyaichallenge #ai #developertool

This is a submission for the AssemblyAI Challenge: : Sophisticated Speech-to-text and No More Monkey Business.

What I Built : DialogueAI ( GITHUB )

I built DialogueAI, an interactive platform that leverages the powerful capabilities of AssemblyAI's sophisticated speech-to-text API and their LeMUR summarization model. The primary goal of this platform is to simplify the process for users who are new to these APIs, helping them overcome the steep learning curve typically associated with diving into new documentation.

Key Features of the Platform:

Interactive Playground: Users can explore and experiment with various API functionalities through an intuitive interface. Input boxes, selection options, model selection, and summary types are all easily adjustable.
Instant Results: With a single click, users can execute API calls and see the results immediately. This feature helps bridge the gap between learning and actual implementation.
Code Generation: For those who prefer to handle API calls manually, the platform generates the necessary code snippets, which can be directly run on their systems. This feature significantly reduces the time and effort required to understand and use the API.
Smart Summary Page: Similar to the main playground, this page offers various configuration options and examples to help users generate summaries of transcripts quickly. Users can also get the generated code to use by themselves.

By providing these features, the platform ensures that users can quickly and efficiently learn how to use AssemblyAI's APIs, reducing the frustration and time typically spent navigating complex documentation. This makes it an invaluable tool for developers and anyone looking to incorporate speech-to-text and summarization capabilities into their projects.

Journey

The inspiration for this platform came from my own experience when I first encountered AssemblyAI's API. I found it a bit confusing to get started with the documentation and the API usage. So, I set out to solve this problem not just for myself but for everyone else who might face the same challenge.

Tech Used

Frontend: React, TypeScript, Tailwind CSS
API: AssemblyAI Speech-to-Text, LeMUR LLM model summary API
Animations: Framer Motion

Working Features

Interactive Speech-to-Text Configurations:
- Users can easily configure and experiment with various settings.
- Single Click Run: Execute the configuration and see results immediately.
- Single Click Code Generation: Generates the code based on the configuration for users to use directly.

Configurations Available:

API Key
Speech Model
Word Boost
Profanity Filter
Audio Range
Audio Intelligence
Summary Model
Summary Type

Interactive Summary Generation with LeMUR:
- Users can generate summaries with various options and configurations.
- Single Click Run: Instantly generate summaries.
- Single Click Code Generation: Provides the code for generating summaries.

Configurations Available:

API Key
Summary Type (Basic, Custom)
Transcript ID
Model
Prompt
Custom Prompt
Max Output Tokens (Example Pre-coded)

In Development

Chat with the Transcript: Using LeMUR API to enable interactions with the generated transcript.
Interactive Quiz Generation: Generate quizzes based on the transcript.

Journey

So far, I've successfully addressed the initial problem statements for the speech-to-text API and LeMUR summary model. This project has been incredibly exciting to work on, pushing the boundaries of what can be done with API interactions and user interface design.

Looking ahead, I plan to expand the platform to include interactive playgrounds and code generation capabilities for real-time APIs and more sophisticated use cases of LeMUR. This will further streamline the learning and implementation process for developers and enhance the overall user experience.

DEV Community

DialogueAI: Interactive Playground for assemblyai with automatic code generation

What I Built : DialogueAI ( GITHUB )

Journey

Tech Used

Working Features

In Development

Journey

Top comments (0)

Read next

AI in 2024: Year in Review and Predictions for 2025

Code as Doc: Automate by Vercel AI SDK and ZenStack for Free

Mastering AWS Container Cost Optimization with EKS and ECS: Essential Tips for Developers

Why Are AI Tools Getting Free?