GRAIL PROTOCOL

📂 Drop arc_overseer_run5_results.json here, or click to browse

Or paste the JSON in the textarea below and click LOAD

BASELINE · RAW MATRIX

8.9%

CELL MATCH · BLIND

900 integers → no grounding.
0/10 eval tasks solved.
The wall is the interface,
not the model.

24.3%

CELL MATCH · REPRESENTATION

Objects extracted, semantics live.
3 near-misses (58–71%).
The model sees the board.
Motor controls: ❌

PENDING

CELL MATCH · DSL HANDS

translate_object: wired
recolor_object: wired
is_monochrome: ✅ proven
[ load results to score ]

009d5c81 at 83.7% → 100%. 3-object task, rule perceived in Run 4, DSL gives motor control to finish.

— PENDING

136b0064 / 1ae2feb7 / 16de56c4 improve to >85%. Gap was motor, not perception.

— PENDING

Object complexity ceiling holds. >15-object tasks stay near zero. DSL fixes motor, not combinatorial search.

— PENDING

Overall cell match > 40%. First correct eval task possible.

— PENDING

TASK ID	CELL MATCH	BAR	CORRECT	CMDS	ERRS	LATENCY
— awaiting arc_overseer_run5_results.json —

GRAIL PROTOCOL · ABR-841 · γ₁ = 14.134725141734693 · The score comes from running it.