Compare named-entity recognition (NER) model outputs side by side, score them, find disagreements, and build labeled gold datasets by hand or by correcting model predictions. JSON in, JSON out — no account, your data stays in the browser.
modelResponses per example and review them side by side. The tool computes per-model precision, recall, and F1 against any humanAnnotations, surfaces disagreements, and exposes a cross-model label confusion matrix.humanAnnotations; export the file to keep your gold dataset.The full schema is at /data-schema.json. Uploads are validated with Ajv at upload time; mismatches surface a path-keyed error. Minimal example:
{
"schemaVersion": 2,
"examples": [
{
"id": "q1",
"text": "OpenAI was founded in San Francisco in 2015.",
"modelResponses": [
{
"modelName": "Model A",
"inferenceTime": 0.045,
"entities": [
{ "text": "OpenAI", "label": "ORG", "start": 0, "end": 6, "confidence": 0.95 },
{ "text": "San Francisco", "label": "LOC", "start": 22, "end": 35, "confidence": 0.92 }
]
}
],
"humanAnnotations": [
{ "text": "OpenAI", "label": "Organization", "start": 0, "end": 6 }
]
}
],
"modelNames": ["Model A"]
}examples — array of objects.examples[].id — unique string identifier.examples[].text — the source text.examples[].modelResponses — array (may be empty for hand-annotation projects).examples[].modelResponses[].modelName — string.examples[].modelResponses[].entities — array of { text, label }.modelNames — array of model name strings (top-level).schemaVersion — integer; current schema is 2.examples[].humanAnnotations — array of gold entities.examples[].rejectedPredictions — predictions the user has dismissed.entity.start, entity.end — character offsets into text.entity.confidence — number in [0, 1].customLabelColors, savedThemes, labelDefinitions.The Export JSON button writes the same shape back out, plus a scores map keyed by example id and model name (1–5 star ratings, category, notes) and a metadata block with totals and per-model precision / recall / F1.
See also: /llms.txt (markdown mirror for LLM agents) and /example-data.json (full working example).