We built a coding harness that beats frontier models using open ones. It's in open beta.
Scored daily by a customisable AI persona to surface the most relevant engineering leadership news.
Memory-first coding harness beating frontier models is highly novel, technical, and directly relevant to AI agent trends.
Backboard Development Studio launched R-CLI, an open-beta coding harness that achieves 92% on Terminal Bench 2.1 with Codex 5.5 and 70% with open-source GLM 5.1, using a memory-first architecture rather than model-first. The harness routes across 17,000+ models, supports /expert mode for multi-model orchestration (e.g., plan with Opus 4.7, execute with DeepSeek V4), and claims up to 30% fewer tokens and 90% lower cost than closed alternatives. Its stateful-by-default design, backed by #1-ranked memory algorithms on LoCoMo and LongMemEval, eliminates the maintenance tax of hand-built persistence layers.