Local models triage PRs from the OpenClaw repo for free

June 2026 will be remembered as the moment it became clear that closed models can disappear. With the removal of Claude Fable 5 still fresh in memory, it’s even more obvious why it matters to own your AI stack and know how to run models locally if your business depends on them.

What they did and why

The team behind OpenClaw built a system to triage issues and PRs using local models (for example gemma-4-26b-a4b and qwen3.6-35b-a3b) inside an agent harness called pi. The goal isn’t to replace a traditional classifier like BERT, but to use agents that can ask for context, inspect the repo and return structured labels.

Why use agents instead of a plain classifier? Because running this in the cloud with a paid account brings limits and latency. On local hardware (a GB10 with 128 GB of unified memory in their case) you can get near-real-time notifications and very low costs (just electricity and maintenance).

Metric	gemma-4-26b-a4b	qwen3.6-35b-a3b	DeepSeek-V4-Flash
Precision	0.716 ± 0.010	0.831 ± 0.007	0.938
Recall	0.905 ± 0.004	0.818 ± 0.006	0.714
F1	0.800 ± 0.008	0.824 ± 0.002	0.811
Exact match	0.410 ± 0.014	0.540 ± 0.014	0.509
False positives	227.0 ± 10.5	105.7 ± 6.4	30
False negatives	60.0 ± 2.6	115.3 ± 4.0	181
Wall seconds / row	1.41 ± 0.04	13.51 ± 0.79	144.14
Output tok/s / worker	25	50	13
Concurrency	16	4	1
Total parameters	26B	35B	284B

What they did and why

Arquitectura y flujo técnico

Security: reposhell and limits

Modelos, optimizaciones y resultados

Metodología y auditoría

Casos de uso y límite de aplicabilidad

Lecciones y recomendaciones técnicas

Fuente original

Stay up to date!

Local models triage PRs from the OpenClaw repo for free