Hi all! We open-sourced a framework for testing LLMs, RAGs, and chatbots. The tool automates query generation, completion requests, regression detection, penetration testing, and hallucination assessment. Designed for developers, researchers, and businesses. And we are looking for contributors! Feel free to try it out for yourself and share your feedback!
For further actions, you may consider blocking this person and/or reporting abuse
Top comments (1)
Hi, Nice work. I would love to know why most of the frameworks use only "llm as a judge" for hallucination. Why not perplexity and semantic entropy? dev.to/mayank_laddha_21ef3e061ff/d...