Best Practices
- Use enough questions on your dataset to represent real-world usage. A dataset that is too small will not expose gaps effectively.
- Include a variety of questions that include common, edge-case, and failure-case questions.
- Cover each knowledge base, prompt instruction, or integration the agent will rely on to ensure you are evenly testing your agent components.

