CTO Pitch
RCF: AI software,
properly.Copy link
A methodology for shipping AI-assisted code
at enterprise grade.
Stravica · 2026
The contract
What I want you
to leave with.Copy link
- A name for the thing you’re probably feeling.
- A way to talk about this with your CEO and colleagues when you have to.
- An honest read on where this all is.
Section 1 · The gap
AI agents are extraordinary
at the first 80%.
They stop being useful
at the 20% that matters.Copy link
The demo runs. The screens look right. Underneath, the agent has made a hundred decisions you never made. Each one quietly wrong in a way that surfaces six weeks later.
AI gets you to demo-ready. Production-ready is a different problem, and the same problem it always was.
Section 1 · The gap
Four blockers, named.Copy link
- Intent drifts. AI produces code without really understanding the business requirement behind it. The output looks right but the intent behind it has quietly slid sideways.
- Hallucinations ship. AI invents methods, misreads schemas, quietly introduces bugs that only surface in prod.
- Trust doesn’t scale. Engineers read every line, which eats back the productivity gain.
- Audit is absent. When something breaks, nobody can trace the code back to the decision that asked for it.
Section 1 · The gap
Demo-ready and production-ready
are different problems.Copy link
The gap AI opened up. Teams plateau at the first; the second is where the senior premium lives.
Section 1 · The gap
You’re not alone in this.Copy link
- 31% of software projects classed as fully successful. Standish CHAOS
- <30% of organisations report material productivity gains from gen-AI in engineering. McKinsey, 2025
- 96% of engineers say they wouldn’t trust unverified AI-generated code in production. Industry survey, 2025
Section 2 · What you’ve tried
The tools you bought.Copy link
Claude Code. Codex. Copilot. Cursor. Cline. They are brilliant at auto-complete, refactors, and scaffolding. They do not fix the chain from intent to evidence.
Section 2 · What you’ve tried
The guidelines you wrote.Copy link
“Use AI responsibly” docs gather dust because they’re advisory, not load-bearing. They sit beside the code, not in the path of the code.
Policy that isn’t on the critical path gets read once and bypassed forever.
Section 2 · What you’ve tried
The PR reviews you mandated.Copy link
Mandating human eyeballs on every AI diff doesn’t fix the problem. It spends the productivity gain on the wrong activity. The reviewer is reading the output, not checking it against the intent. There’s no intent on file to check against.
Section 2 · What you’ve tried
The shape of
the missing piece.Copy link
Not a tool. Not a policy. A methodology.
Section 3 · The principle
Don’t trust the agent.
Trust the chain.Copy link
Section 3 · The principle
The core idea.Copy link
Anchor in requirements. Every requirement breaks into user stories. Every story carries acceptance criteria. Every acceptance criterion maps to a test. Code arrives last; its only job is to make the tests pass.
Every line points up to the decision that justifies it, and down to the test that proves it.
The document chain.Copy link
Intent comes down one arm. Architecture comes down the other. Both meet at the build.
Section 3 · The principle
One acceptance criterion,
one test suite,
no negotiation.Copy link
The central primitive. The contract layer.
Section 3 · The principle
Traceability,
forward and backward.Copy link
From any line of code, trace which AC, which story, which requirement, and who signed it off. From any business decision, trace every test that proves it ships.
The audit becomes a query, not guesswork dressed up as seniority.
The build cycle.Copy link
Define, Build, Review, Test, Finalise. Each stage commits. Each commit is honest.
Section 3 · The principle
When the spec changes,
and it always does.Copy link
The living spec. POs don’t always know what they want until they see it; that’s not a bug. Traceability surfaces the gap; the gap becomes the next FBS.
Same shape whether it’s a bug fix, a new edge case, or a whole new module.
Section 4 · In a regulated context
The audit story.Copy link
Six months after ship, compliance asks: why did this customer get a Tier-2 KYC verdict?
In RCF, the answer is in the AC that scoped the verdict logic, the test suite that asserted it, the FBS that built it, and the human who signed the criterion. The chain holds.
Section 4 · In a regulated context
What regulators are
starting to ask.Copy link
- DORA (in force): operational resilience, traceable change management.
- EU AI Act (high-risk systems): show your working when an AI helped make a decision.
- BCBS 239 (banks): data lineage and aggregation under audit.
- Model risk (SR 11-7, SS1/23): increasingly applied to AI-augmented code paths.
‘The AI did it’ is not a defence any regulator will accept.
The chain is the defence.
Section 4 · In a regulated context
What changes for your
senior engineers.Copy link
Less time writing code. More time specifying ACs and reviewing for theatre. The senior-engineer premium moves up the stack, from typing to judgement. Pay band intact, role shape different.
They will like this.
Section 4 · In a regulated context
What changes for your juniors.Copy link
They ship more, faster, under a structure that makes their work auditable for the first time. The ladder gets clearer: writing crisp ACs is the new “can they spec well?” interview question.
They will like seeing a clearer career path.
Theatre risk and the human signature.Copy link
Standards, AI-assisted extraction, the visible gap, the approval gate.
A signature on an AC has to be a real commitment, not a rubber-stamp.
Section 4 · In a regulated context
This isn’t theory.
It’s running in production.Copy link
One product delivered end-to-end on the RCF methodology, with PoC tooling alongside. A second product on a different stack is in flight. Team adoption is widening.
This documentation site was built using RCF and my Digital Operator, Dave. Our own tooling is being built using RCF.
Section 5
Beyond engineering.Copy link
The same idea, capture how something works and make it load-bearing, generalises beyond writing code.
Section 5 · Three phases
Three phases of adoption.Copy link
- Phase 1: automate the engineer. RCF. Where most teams are trying to be. Working in production today.
- Phase 2: automate the operating engineer. The Digital Operator. Where I am now. The first instance is running.
- Phase 3: automate the roles upstream. PO, architect, compliance, design. Where this is heading.
Section 5 · The Digital Operator
A counterpart,
not an assistant.Copy link
An assistant helps you do your job. A counterpart does the job, and journals how you would have done it while doing it.
The mechanism is closer to a biographer than to a chatbot: a writer who shadows the subject, captures every observation about how they think, and over time becomes able to write in their voice. The Digital Operator does the same, except it also performs the work itself, and the work is how it learns.
The first instance is running. Dave is my engineering Operator. He runs my tech estate, handles my routine work, tightens my drafts, and learns my judgement as he goes.
An assistant helps you. A counterpart replaces your role or function: in your voice, at your standards, with your trail.
Section 5 · The Digital Operator
Capturing the hard-to-formalise.Copy link
Most institutional knowledge isn’t the decisions themselves. It’s the why. Intuition. Decision paths. Justifications. The reason you ranked one option above another.
This kind of knowledge doesn’t get written down because the cost of writing it down has always been higher than the cost of carrying it in your head. The Operator inverts that. Every decision, every justification, every priority call is captured in context, as it happens, by the thing that was already in the loop.
The hardest knowledge to write down, the why,
is what gets captured by default.
Section 5 · The Librarian
The Librarian.Copy link
A journal that nobody reads is a log file. The pattern that makes the Digital Operator more than clever logging is the second role: the Librarian.
The Librarian collates incoming journals across every Operator running across every role. Distills what is repeated. Organises what is distinct. Synthesises across journals into a single institutional store that feeds the next round of work.
What vendors are selling as agent memory is plumbing. The Librarian is a role.
Agents journal. The Librarian remembers, for the whole company, across every role, forever.
Section 5 · The implication
Institutional knowledge
stops being trapped in people.Copy link
When a person moves on, what they knew doesn’t leave with them. When someone new inherits a role, they start from the journal, not from a half-considered legacy handover.
Section 5 · The honest read
Where we are.Copy link
Phase 1 (RCF): one product delivered end-to-end on the methodology, with prototype tooling. A second product on a different stack is in flight.
Phase 2 (Digital Operator): Dave is running. He built the methodology rewrite. He builds and maintains this site. He builds my own tooling. The Librarian function exists; it’s currently small.
Phase 3 (upstream roles): in design.
Early days. The shape is real, the scale is small, what’s possible is being worked out.
Take-home
AI does the typing.
RCF does the governance.Copy link
Resources
Resources, and questions.Copy link
- The methodology, full read.
- The Digital Operator page, the broader operating model.
- This deck, share-link in case anyone wants to revisit.
Speaker notes load here when the deck advances.
Keyboard shortcutsCopy link
- ← →
- Previous / next slide
- ↑ ↓
- Previous / next slide
- Space
- Next slide (with Shift: previous)
- PgUp PgDn
- Previous / next slide
- Home End
- First / last slide
- f
- Toggle fullscreen
- n
- Toggle inline speaker notes
- s
- Open private speaker view in a new window (notes + timer, syncs with this deck)
- t
- Reset timer (speaker view only)
- p
- Toggle present mode (activates click-to-nav)
- ?
- Show / hide this help
- Esc
- Exit fullscreen / close help
Click right half of viewport to advance, left half to go back.
URL hash deep-links to a slide (e.g. #7).