AI is not going down anytime soon especially because of free powerful LLMs like GPT-4o and Claude.
A lot of startups are already using it to build cool products and reach massive audiences.
Today, I'm covering 17 powerful LLM tools that went viral after launch. These examples might give you some ideas for your own projects.
Let's jump in.
By the way, I’m part of Latitude and we’re building an open source prompt engineering platform. You can check our public GitHup repo.
You would be able to do a lot of cool stuff like:
⚡ Create and iterate prompt in the platform.
⚡ Run evaluations to check your prompts with LLMs.
⚡ Optimize the output of your LLM-based features automatically.
I'm very confident that you will love it after release!
1. GPT Crawler - create your own custom GPT from a URL.
With GPT Crawler, you can crawl any site to generate knowledge files to make your own custom GPT from one or multiple URLs.
The objective is to make the docs site interactive, people can more simply find the answers they are looking for using a chat interface.
Watch this quick demo!
You will have to configure the crawler and then simply run it. After the crawl is complete, you will have a new output.json
file, which includes the title, URL and extracted text from all the crawled pages.
You can now upload this directly to ChatGPT by creating a new GPT. Once uploaded, this GPT assistant will have all the information from those docs and be able to answer unlimited questions about them.
It's officially a assistant in ChatGPT.
You can read the docs on how to get started. You can find all the instructions on the official blog.
If you are wondering how Mitosis compiles those components, then watch this quick tutorial.
They have 18.4k stars on GitHub.
2. Personal AI - your personal sweet AI.
PI is an AI chatbot (created by Inflection AI) and is built to provide helpful, engaging and fun conversations. I loved the onboarding section!
I have used GPT and Gemini but Pi seems more human-like. You can have conversations about yourself and general things in life, that you would do with a friend.
You can also ask it to convey in voice. The fact I love it because of how calm the User Interface is, it feels so peaceful and you can do all sorts of stuff.
They also offer a desktop app that is free to download.
You can watch the video if you want (AI voice).
It's not open source but very powerful.
3. Vanna - Chat with your SQL database.
Vanna is an MIT-licensed open-source Python RAG (Retrieval-Augmented Generation) framework for SQL generation.
Basically, it's a Python package that uses retrieval augmentation to help you generate accurate SQL queries for your database using LLMs.
It's perfect for developers like me who are not fond of SQL queries!
Vanna works in two easy steps - train a RAG model
on your data, and then ask questions that will return SQL queries that can be set up to run on your database automatically.
You don't need to know how this whole stuff works to use it.
You just have to train
a model, which stores some metadata, and then use it to ask
questions.
Get started with the following command.
pip install vanna
To make things a little easier, they have built user interfaces that you can use as a starting point for your own custom interface. Find all the interfaces including Jupyter Notebook and Flask.
You can read the docs and you can try this Colab notebook in case you want to see how it works after training.
Watch this demo for a complete walkthrough!
They have 9.5k+ stars on GitHub and are built using Python.
4. AI Dungeon - text-based adventure-story game you direct while the AI brings it to life.
AI Dungeon is one of the coolest use case of LLM that I have seen so far. It's creative and has a lot of options.
There are no rules or objectives since they are only focused on adventure.
AI Dungeon is a text-based, AI generated fantasy simulation with infinite possibilities. Unlike most games where you experience worlds created by game designers, with AI Dungeon, you can direct the AI to create worlds, characters and scenarios for your character to interact with.
Watch the official trailer :)
You could lead an army fighting back an alien invasion or be a cool detective investigating the assassination of the queen.
You can discover 200+ playable scenarios at play.aidungeon.com/discover.
If you are not aware, Scenarios are basically templates that players can use to start a new Adventure.
You can filter them based on your choice like playable/readable, multiple choice or 3rd person and so many more.
They have created a complete guidebook to answer all your queries.
You can find the official app on platstore and apple store. They have 1M+ downloads which makes them very credible.
You can even improve this by using tools like Sora (upcoming launch).
They are not open source but worth checking out.
5. Chat2DB - AI-driven data management platform.
Chat2DB is an AI-first data management, development, and analysis tool.
Its core is AIGC (Artificial Intelligence Generation Code), which can convert natural language into SQL, SQL into natural language, and automatically generate reports, taking efficiency to another level.
Even operations that do not understand SQL can use it to quickly query business data and generate reports.
For instance, when you are doing database development, it will help you generate SQL directly in natural language, SQL optimization suggestions, analyze SQL performance, SQL execution plan, and can also help you quickly generate SQL test data, system code and more. Lots of jargon but it's actually very powerful :)
They have excellent support for multi-data sources and can easily integrate up to 17 different database types including PostgreSQL, MySQL, MongoDB, and Redis.
You can download or try it in the browser.
Let's see some of the exciting features:
⚡ Intelligent reports.
⚡ Data Exploration.
⚡ SQL Development.
You can read the quickstart guide on the official docs.
They have 14k+ stars on GitHub and are on the release v3.2
.
6. Mem - magic notes app.
Mem is the AI notes app that keeps you organized.
You can use their AI assistant to summarize insights or draft content based on their notes. There is also a concept of automated tagging.
It's not free (a major drawback) but you can fill out a form for huge discounts.
This is how the dashboard looks. The search feature is very handy and there are lots of useful features.
Let's see what else we can do with Mem.
⚡ You can use Mem Chat to write content by using a deep understanding of your notes.
⚡ You can use AI to create new collections.
⚡ You can pick things from your favorite apps.
⚡ Search beyond your keywords.
I have attached a tutorial to help you on how to use Mem.
It's not open source but you can find everything on their blogs.
7. Character AI - chat with AI characters as if they were humans.
Character AI is a full-stack AI tool that lets users personalize their experience by interacting with AI Characters.
You can chat with those virtual characters as if they were humans.
There are many filters and personas that you can try. Their community is very huge with around 300k+ members.
You can create a character with a simple form.
And of course, add a custom persona if the existing one isn't up to your requirements.
I found a funny official video to show you behind the scenes.
You should explore it yourself to get the overall idea.
It's not open source.
8. Prompt Perfect - optimize your prompts.
It's a tool to optimize your prompt. Using LLM to optimize the input that is again going to use LLM is kind of fancy.
PromptPerfect is a prompt optimization tool designed to enhance the quality of large language models and large model results. To use PromptPerfect, you need to input a prompt and customize various settings according to their needs.
These settings include the maximum prompt length, the number of iterations to be optimized, and the quality of the output. It's a useful one if you use models like GPT-4, Claude and Midjourney.
Some of the exciting features are:
⚡ Concept of auto-tune.
⚡ Convert to precise prompts.
⚡ Improve content based on your target audience.
⚡ Also make images.
There is a whole lot of stuff that you can do. Just explore all the features.
It has a free tier plan of 10 requests per day but it's not open source.
9. Heygen - create awesome videos using AI.
With Heygen, you can create production-ready videos from text in seconds with AI-generated avatars and voices.
HeyGen does multiple things related to video generation and is a leader in generative AI for video production purposes. As such, it sports a great AI Avatar generator, AI-powered Text-to-Speech, and AI voice cloner. These tools are wrapped in a clean User Interface that lets creators get more done in less.
It works in three simple steps.
Watch the quick demo!
It is present in a lot of languages.
They also provide ChatGPT script writer that you can use to create videos faster.
They recently received $60M funding so they're growing fast.
They're not open source but are definitely worth checking out.
10. Micro Agent - AI agent that writes (actually useful) code for you.
AI-assisted coding tools like GitHub Copilot and ChatGPT don't produce very reliable code and they often don't work correctly right out of the box, you find bugs, edge cases, or even references to non-existent APIs.
This can lead to a frustrating loop of trying the generated code, finding issues, going back to the AI for fixes and repeating.
The time spent debugging can negate the total time saved using AI tools in the first place.
Micro Agent uses AI to mitigate the problems of unreliable code generation.
Give it a prompt and it'll generate a test and then iterate on code until all test cases pass.
You can install it using this command.
npm install -g @builder.io/micro-agent
# Next, set your OpenAI API key when prompted or manually using this.
micro-agent config set OPENAI_KEY=<your token>
# Then you can run to start a new coding task
micro-agent
Micro Agent will prompt you to describe the function you want, generate tests, and start writing code in your preferred language to make the tests pass. Once all the tests are green, you'll have a fully functional, test-backed function ready to use.
Let's explore some of the most mind blowing use cases:
⚡ 30-second demo of Micro Agent generating tests and code for a TypeScript function that groups anagrams together from an array of strings.
⚡ Using Micro Agent to generate a simple HTML to AST parser (it was achieved on two iterations).
⚡ Unit test matching.
⚡ Visual matching (experimental).
⚡ Integration with Figma.
Micro Agent can also integrate with Visual Copilot to connect directly with Figma to ensure the highest fidelity possible design to code!
Visual Copilot connects directly to Figma to assist with pixel-perfect conversion, exact design token mapping, and precise usage of your components in the generated output.
Then, Micro Agent can take the output of Visual Copilot and make final adjustments to the code to ensure it passes TSC, lint, tests, and fully matches your design including final tweaks. Amazing right :)
You can read the docs and the official blog where the team discussed everything about the micro agent.
It's open source with 2.1k stars on GitHub.
11. EditGPT - improve your writing with a powerful AI editor.
EditGPT helps you proofread, edit and improve your writing with a powerful AI editor.
There is also an editGPT extension that works directly on the ChatGPT website. It lets you see the difference between your input and ChatGPT's output.
This process doesn't allow you to edit text directly and is a bit of a hassle if you want to resubmit parts you don't like. The new editor allows you to edit text directly and you can even save your prompts.
There are a lot of options available to improve your message clearly.
There is a free tier available for 10k words per month but it's limited to 1k words per request.
It's not open source but they do support 20+ languages.
12. Fireflies - Automate your meeting notes.
As a developer, I'm not fond of meetings but if you attend too many meetings then this is for you!
Fireflies help your team transcribe, summarize, search, and analyze voice conversations. Automate your meeting notes!
Watch this quick demo!
They provide an insane amount of useful features:
⚡ Get automatic meeting notes, action items, & transcripts.
⚡ Find anything with AI-powered search.
⚡ Collaborate with your co-workers.
⚡ Automate workflows from meetings.
⚡ A real-time knowledge base for your entire team
Plus, you can capture & transcribe meetings in multiple ways.
There are a lot of features, but the AI ones are only included in premium plans.
The good thing is that their plans are affordable and they offer a free tier plan with 800mins/storage which is more than enough.
13. Web ChatGPT - ChatGPT with internet access.
This leads to ChatGPT prompts with relevant web search results through web browsing. This means it adds relevant web results to your prompts to ChatGPT for more accurate and up-to-date conversations.
You can find the extension on Chrome, Edge and Firefox.
Watch the quick demo!
It has more than 1.5M users for three simple features:
⚡ Get web results for your queries.
⚡ Also provides a free one-click ChatGPT prompt library with hundreds of high-quality prompts.
⚡ Get AI-powered search answers right beside your search results by using ChatGPT, Claude, Bard and Bing AI.
It's not open source but has an excellent use case.
14. Private GPT - ask questions about your documents without the internet.
PrivateGPT is a production-ready AI project that allows you to ask questions about your documents using the power of Large Language Models (LLMs), even in scenarios without an internet connection.
100% private meaning no data leaves your execution environment at any point.
The API is divided into two logical blocks:
a. High-level API, which abstracts all the complexity of a RAG (Retrieval Augmented Generation) pipeline implementation:
- Ingestion of documents: internally managing document parsing, splitting, metadata extraction, embedding generation, and storage.
- Chat & Completions using context from ingested documents: abstracting the retrieval of context, the prompt engineering, and the response generation.
b. Low-level API, which allows advanced users to implement their complex pipelines:
- Embeddings generation: based on a piece of text.
- Contextual chunks retrieval: given a query, returns the most relevant chunks of text from the ingested documents.
You can read the installation guide to get started.
You can read the docs and the detailed architecture that is involved.
PrivateGPT is now evolving towards becoming a gateway to generative AI models and primitives, including completions, document ingestion, RAG pipelines, and other low-level building blocks.
They have 53k+ stars on GitHub and evolving at a rapid pace.
15. Perplexica - AI-powered search engine.
You would have heard about Perplexity AI somewhere along your journey, Perplexica is an open source alternative to that!
Perplexica is an AI-powered search tool or an AI-powered search engine that goes deep into the internet to find answers. It not only searches the web but also understands your questions.
It uses advanced machine learning algorithms like similarity searching and embeddings to refine results and provides clear answers with sources cited.
Using SearxNG to stay current and fully open source, Perplexica ensures you always get the most up-to-date information without compromising your privacy.
If you're wondering about the architecture, you can read about the key components.
I also recommend reading the more detailed explanation of how these components work together. They've explained it clearly by taking an example of a scenario where a user asks: How does an A.C. work?
.
Some of the features are:
⚡ Some search tools might give you outdated info because they use data from crawling bots convert them into embeddings and store them in an index. Unlike them, Perplexica uses SearxNG, a metasearch engine to get the results and rerank and get the most relevant source out of it, ensuring you always get the latest information without the overhead of daily data updates.
⚡ You can make use of local LLMs such as Llama3 and Mixtral using Ollama.
⚡ It has a copilot mode to search instead of just using the context by SearxNG, it visits the top matches and tries to find relevant sources to the user's query directly from the page.
⚡ Perplexica currently has 6 focus modes:
- YouTube Search Mode.
- Wolfram Alpha Search Mode.
- Reddit Search Mode.
- Academic Search Mode.
- Writing Assistant Mode.
- All Mode (searches the entire web).
You can watch the complete walkthrough on YouTube!
They have 12.5k+ stars on GitHub and are on the v1.8
release.
16. Email Triager - writing emails using AI in your style.
EmailTriager automatically learns how your email and pre-drafts are ready to send replies for you in the background. Watch the demo attached below!
It automatically trains on your past emails and drafts show up directly in Gmail if you're worried about changing your email workflow.
As I said earlier, EmailTriager is the only email assistant that actually learns to triage and write emails like you.
This is how it would show as a result of the old way vs the new way.
It's not open source and there is no free tier which is a major drawback. But $10/month is worth it in my opinion if you write a lot of emails.
17. Udio - generate music using AI.
Udio can help you create music from simple text prompts by specifying topics, genres and other descriptors, then watch them transform into high-quality tracks.
Watch this demo and listen to some impressive music. I loved it! Plug in your earphones :)
Once you have created your masterpiece (or just a song you like), publish it on Udio for everyone to enjoy.
You can, of course, export your creations to a DAW or upload your own audio to experience Udio’s fantastic AI-editing tools.
This is how the dashboard looks.
They also provide a step-by-step tutorial. To get more credits, you just need to watch 10s music and let them know which you prefer. A win-win situation for both of us unless the song is bad.
If you want a YouTube tutorial, then I recommend watching this!
It's not open source but I'm already loving the songs :)
Did you find anything good enough?
It took me a long time to research, so let me know if you have any awesome ideas or tools that you want to share.
AI is already changing the world, developers need to adapt as well.
Have a great day! Till next time.
Follow Latitude for more content like this.
Top comments (13)
Solid work! Thanks!
Thanks for reading Serhiy :)
I’m sure others will find something valuable in this.
for sure!!!
Hi guys, I've tried to do my best. Let me know if you have any feedback.
It's high time for LLMs since every company is trying to use them to add something AI related yk.
If you're wondering why I covered these, it's because we learn from real use cases (or tools) and you might get a new idea for your project by just reading this.
AI Dungeon is pre GPT 3!!
Whoa! I didn’t know that actually.
I recently found out and was really surprised by the unique concept.
Good to see AI Dungeon has evolved!
Thanks for reading Martin.
Why do so many people already know about AI Dungeon? 😅
I only recently found out about it and thought most people wouldn't know. Guess I’m a little late to the party :)
Ahahah, so many of the tools listed, I didn't!
Is there any open source alternative to Udio?
There are alternatives, but they are not as good compared to Udio. Even Suno isn't really better, according to community feedback. You can check these:
I hope this helps!
Thanks!
Hi Anmol, thanks for sharing about voice agents! We're thrilled to introduce TEN(github.com/TEN-framework/TEN-Agent), the world's first real-time multimodal agent framework for next-gen AI agents. It's open-source and lets developers build agents with voice, video, and more in real-time. We'd love your feedback, and if there's anything we can do to make TEN more accessible, just let us know!