The following results were collected using the benchmarks directory in this repository. The documents tested are real-world messages collected from the Archipelago client. Benchmark environment: ...
"""EXP-02 / EXP-08 — emit routing task JSON files (reproducible, version-controlled).""" # 19 canonical tasks (v1 disclosure) — ground truth aligned with keyword ...