Consulting firms publish thought leadership. We publish working code, reproducible benchmarks, and reference architectures we run ourselves. Labs is where the firm's point of view gets built and tested — not just written down.
Frontier Research
A small standing team that tracks the moving edge of capability — new model families, agentic methods, long-context and tool-use behaviour — and translates it into what changes for the engagements we run. Research here is judged by whether it changes a recommendation, not by citation count.
Open Tooling
The evaluation harnesses, data pipelines, and agent scaffolding we build for ourselves, released as open source where we can. Clients inherit battle-tested tools instead of bespoke one-offs, and the wider community gets to inspect — and improve — how we work.
Reference Architectures
Opinionated, production-grade blueprints for the systems we build most: retrieval over governed data, multi-step agents with human checkpoints, eval-gated deployment. Each is something we have run in production, documented down to the failure modes — not a diagram.
Benchmarks
Task-specific, domain-grounded benchmarks that measure what an executive actually cares about — accuracy on their documents, cost per resolved case, latency under real load — rather than leaderboard scores that rarely survive contact with a client's data.
We publish what worked, what didn’t, and the evidence
Field-tested write-ups: methods papers, post-mortems, and practitioner guides drawn from real engagements (anonymised, with permission). We publish what worked, what didn't, and the evidence — so the claims we make to clients are ones we have shown our work on.
- Eval-Gated Deployment: Shipping Agents You Can Defend to a Risk CommitteePreecursor Labs · Methods2026
- Retrieval Over Governed Data: A Reference Architecture for Regulated IndustriesPreecursor Labs · Reference Architectures2025
- What the Leaderboards Miss: Building Domain Benchmarks That Survive ProductionApplied ML in Practice (Workshop)2025
- Human Checkpoints in Multi-Step Agents: A Field Post-MortemPreecursor Labs · Field Notes2025
- Provenance by Default: Tracing Model Outputs Back to Their SourcesResponsible AI Engineering Symposium2024
The Preecursor Labs Fellowship
Each year we fund a small cohort of researchers and engineers to spend six months inside Labs, working shoulder-to-shoulder with our partners on a problem that matters to a real client. It is not an internship and it is not academic tourism — fellows own a piece of work end to end and ship it.
If you would rather build the methods than read about them, we would like to hear from you.