Reusable Evaluators and Evaluator Templates in LangSmith
Blog post from LangChain
LangSmith has introduced reusable evaluators and evaluator templates aimed at enhancing agent evaluation processes, which are critical for debugging and improving agent performance. The platform now offers over 30 templates that focus on safety, response quality, trajectory, user behavior, and multimodal evaluations, allowing users to either use them as-is or customize them for specific needs. These templates facilitate both online monitoring and offline experiments, helping categorize production traffic and assess experiments. A centralized Evaluators tab allows users to manage and apply evaluators across multiple tracing projects, ensuring consistency and eliminating the need for duplicate evaluators. This update is part of LangSmith's broader effort to streamline evaluation by allowing users to build evaluators once and deploy them universally, while upcoming features will include spend visibility to help track evaluation costs.