Quality Testing

Test Runner




Test output will appear here once a test is running.

Baseline

Saved: 2026-03-18 — 17 prompts
88.2%
Classification rate
82.4%
KB hit rate
2.7
Avg excerpts
109
Avg tokens
645
Avg duration (ms)