Auto-Eval of Question-Answering Tasks

Company

LangChain

Date Published

April 15, 2023

Author

Word count

509

Language

English

Hacker News points

URL

blog.langchain.dev/auto-eval-of-question-answering-tasks

Summary

We introduce auto-evaluator, a simple tool for evaluating question-answering chains built on top of large language models like GPT-3.5-turbo, allowing users to easily assemble and experiment with different chain configurations to optimize QA performance. The tool uses an LLM to score the quality of retrieved documents and answers, providing prompts and results for human inspection and comparison across various tests.