Measuring Hallucinations in RAG Systems

Company

Vectara

Date Published

Nov. 6, 2023

Author

Shane Connelly

Word count

1183

Language

English

Hacker News points

None

URL

vectara.com/blog/measuring-hallucinations-in-rag-systems

Summary

Vectara has released an open-source Hallucination Evaluation Model (HEM) that provides a FICO-like score for grading how often generative LLMs hallucinate in Retrieval Augmented Generation (RAG) systems. The model helps mitigate the risks of hallucinations, such as large errors or introducing biases due to training data, by evaluating the trustworthiness of RAG systems and identifying which LLMs are best suited for specific use cases. The HEM provides a scorecard that compares various models, including GPT4, GPT3.5, and others, on their hallucination rates, accuracy, and summary length, allowing users to make informed decisions about their generative AI adoption.