You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Independent researcher in auditable AI systems. No-meta governance, observable-only evaluation, deterministic replay. Papers on Zenodo, code on GitHub.
AI agent output checker and workflow verification toolkit for turning agent text, pull requests, external inputs, and messages into auditable JSON reports with evidence routing, residual ledgers, p…
Fixed-budget multi-agent inference benchmark harness for studying when split inference helps or hurts versus a strong single-agent baseline under local context ceilings, using local Ollama gemma3:1…
Fail-closed certification toolkit for small-to-frontier transfer in agentic AI, with typed manifests, replayable evidence, and theory-registry coverage.
Check whether AI-assisted R&D work is truly verified progress. Local-first LOSCR implementation with JSONL evidence ledgers, deterministic replay, service control, and claim checking.