Smart Agent Evaluation Runner

Instructions:

  1. Clone this space, define your agent logic, tools, packages, etc.
  2. Log in to Hugging Face.
  3. Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.

Questions and Agent Answers

Questions and Agent Answers