agentskit.js
Open specs

Eval format

Portable JSON for eval datasets + run results. Tool-agnostic.

Subpath: @agentskit/core/eval-format.

Dataset

{
  "name": "triage-v1",
  "version": "1.0.0",
  "cases": [
    {
      "id": "refund",
      "input": "How do I get a refund?",
      "expect": { "kind": "regex", "value": "refund policy" }
    }
  ]
}

Expectation kinds

KindMatch rule
literalexact string equality
regexRegExp test
normalizedwhitespace + case-insensitive
similaritycosine ≥ threshold (needs embedder)

API

import { matchesExpectation, parseEvalSuite } from '@agentskit/core/eval-format'

const suite = parseEvalSuite(json)
const ok = matchesExpectation(output, suite.cases[0].expect)
✎ Edit this page on GitHub·Found a problem? Open an issue →·How to contribute →

On this page