Skip to main content

Agent Evaluations

Evaluations help you test your AI agent’s responses and ensure it’s providing accurate, helpful information to candidates.
This documentation is coming soon. Check back for detailed guides on running agent evaluations.

Why evaluate

  • Verify your agent handles common questions correctly
  • Test edge cases and tricky scenarios
  • Measure improvement over time
  • Catch issues before candidates do

Running evaluations

  1. Navigate to Agents > Evaluations in Studio
  2. Create test cases with expected behaviors
  3. Run the evaluation suite
  4. Review results and adjust guidance as needed

Evaluation types

TypeDescription
AccuracyDoes the agent provide correct information?
HelpfulnessDoes the agent address the candidate’s need?
SafetyDoes the agent avoid inappropriate responses?
Policy complianceDoes the agent follow your defined policies?