CTO Pitch

RCF: AI software,
properly.Copy link

A methodology for shipping AI-assisted code
at enterprise grade.

Stravica · 2026

Speaker notes

Open cold. No warm-up, no throat-clearing. Land the pattern the industry is living, and let the room map it onto themselves.

I’ll open with what the industry has spent the last year not quite admitting. AI has made software teams faster than anyone has seen, and less sure of what they are shipping than they have been in years. The demo lands. The board is delighted. And the last twenty percent, the part that was always the hard part, is still sitting there unsolved. For an engineering leader, that shows up as a trust problem: code you can’t account for is code you can’t put your name behind. For a product leader, it is a shipping problem: a demo that impresses is not something you can safely put in front of customers. Same problem, two badges. What I want to do today is offer a precise way to think about it, and something you can take back to a CEO or a board when they ask where AI actually stands in engineering.

The contract

What I want you
to leave with.Copy link

A name for the thing you’re probably feeling.
A way to talk about this with your CEO and colleagues when you have to.
An honest read on where this all is.

Section 1 · The gap

AI agents are extraordinary
at the first 80%.
They stop being useful
at the 20% that matters.Copy link

The demo runs. The screens look right. Underneath, the agent has made a hundred decisions you never made. Each one quietly wrong in a way that surfaces six weeks later.

AI gets you to demo-ready. Production-ready is a different problem, and the same problem it always was.

Section 1 · The gap

Four blockers, named.Copy link

Intent drifts. AI produces code without really understanding the business requirement behind it. The output looks right but the intent behind it has quietly slid sideways.
Hallucinations ship. AI invents methods, misreads schemas, quietly introduces bugs that only surface in prod.
Trust doesn’t scale. Engineers read every line, which eats back the productivity gain.
Audit is absent. When something breaks, nobody can trace the code back to the decision that asked for it.

Section 1 · The gap

Demo-ready and production-ready
are different problems.Copy link

The gap AI opened up. Teams plateau at the first; the second is where the senior premium lives.

Section 1 · The gap

You’re not alone in this.Copy link

31% of software projects classed as fully successful. Standish CHAOS
<30% of organisations report material productivity gains from gen-AI in engineering. McKinsey, 2025
96% of engineers say they wouldn’t trust unverified AI-generated code in production. Industry survey, 2025

Section 2 · What you’ve tried

The tools you bought.Copy link

Claude Code. Codex. Copilot. Cursor. Cline. They are brilliant at auto-complete, refactors, and scaffolding. They do not fix the chain from intent to evidence.

Section 2 · What you’ve tried

The guidelines you wrote.Copy link

“Use AI responsibly” docs gather dust because they’re advisory, not load-bearing. They sit beside the code, not in the path of the code.

Policy that isn’t on the critical path gets read once and bypassed forever.

Section 2 · What you’ve tried

The PR reviews you mandated.Copy link

Mandating human eyeballs on every AI diff doesn’t fix the problem. It spends the productivity gain on the wrong activity. The reviewer is reading the output, not checking it against the intent. There’s no intent on file to check against.

Section 2 · What you’ve tried

The shape of
the missing piece.Copy link

Not a tool. Not a policy. A methodology.

Section 3 · The principle

Don’t trust the agent.
Trust the chain.Copy link

Section 3 · The principle

The core idea.Copy link

Anchor in requirements. Every requirement breaks into user stories. Every story carries acceptance criteria. Every acceptance criterion maps to a test. Code arrives last; its only job is to make the tests pass.

Every line points up to the decision that justifies it, and down to the test that proves it.

If this sounds like requirements-driven development, you’re half right. Hold the thought; I’ll place it in a minute.

Where RCF sits.Copy link

Discovery → Shaping → PRD/TAD → Build sequence → FBS → Build cycle → Deploy gates → Operate / Learn

Build-side in scope today. Both ends on the roadmap.

Speaker notes

Where RCF sits in the wider lifecycle.

The shaded band is what’s published today: from an agreed PRD and TAD, through the build sequence and the per-slice build cycle, to a slice that’s tests-green and ready to ship. Both ends of the shaded band are deliberately out of frame for now, and both are on the roadmap. Upstream, the multi-role agreement that turns a vague idea into a structured PRD and TAD is the next workstream. Downstream, the PR, merge, and deploy gates that turn a tests-green slice into a feature actually running in production come after that. Both extend the chain; neither gets a tacked-on chapter. Operate-and-learn beyond that is not on the RCF roadmap. That sits with the digital operator, a separate Stravica concept that lives alongside the methodology rather than inside it.

The full picture is on the scoping page; the digital operator concept lives at /digital-operator/.

The document chain.Copy link

Intent & architecture → FBS → tests

Intent comes down one arm. Architecture comes down the other. Both meet at the build.

Section 3 · The principle

One acceptance criterion,
one test suite,
no negotiation.Copy link

The central primitive. The contract layer.

Section 3 · The principle

Traceability,
forward and backward.Copy link

From any line of code, trace which AC, which story, which requirement, and who signed it off. From any business decision, trace every test that proves it ships.

The audit becomes a query, not guesswork dressed up as seniority.

The build cycle.Copy link

DEFINE → BUILD → REVIEW → TEST → FINALISE

Define, Build, Review, Test, Finalise. Each stage commits. Each commit is honest.

Section 3 · The principle

When the spec changes,
and it always does.Copy link

The living spec. POs don’t always know what they want until they see it; that’s not a bug. Traceability surfaces the gap; the gap becomes the next FBS.

Same shape whether it’s a bug fix, a new edge case, or a whole new module.

A lineage, not a launch.Copy link

Waterfall, RDD, Agile, XP, BDD, DDD, CD, SDD → RCF

Waterfall, RDD, agile, XP, BDD, DDD, continuous delivery, spec-driven development. Each gave a piece. RCF pins them together for the agent era.

Section 3 · The lineage

The two you’ll
compare it to.Copy link

Requirements-driven development. RCF’s ancestor, not its rival. Requirements-first is decades old, with a lifecycle and tooling RCF can’t match. What RCF adds is a tighter chain that lives inside the build, not in a separate tool beside it.
Spec-driven development. This year’s framing for AI-assisted work, and a good one: it makes code correct against the spec. It slots into a single stage of the build cycle, the Build step, generating code against a precise target. What it can’t judge is whether the spec was the right thing to build; the AC-derived test, not the spec, stays the authority. It can’t validate the chain, because it doesn’t know the chain exists.

RCF doesn’t replace these. It holds them together, and holds them against an agent that won’t.

Speaker notes

The two comparisons you’re each making in your head. Let me meet them head-on.

Requirements-driven development got there first, with a defined lifecycle and a tooling ecosystem I can’t match. If you put requirements first and trace them through, you’re already doing the thing it named. RCF’s addition is narrow: keep that chain living inside the build. Spec-driven development is the dominant framing this year, GitHub Spec Kit and the rest; it puts an executable spec at the centre and regenerates code from it, right by construction relative to the spec. The blind spot is the spec itself: SDD has no opinion on whether the spec was the right thing to build, or who is accountable for it. That is the RCF layer. Concretely, that is the Build stage of the five-stage cycle you saw earlier: SDD can generate the code there, but Define wrote the AC-derived tests before it and Test runs them after, so the test, not the spec, is what says yes. What you can’t do is point SDD at the chain and ask it to validate, because it isn’t aware the chain exists. RCF is what gives the spec a reason to exist.

Two pages go deeper: RCF and requirements-driven development and RCF and spec-driven development.

Section 4 · In a regulated context

The audit story.Copy link

Six months after ship, compliance asks: why did this customer get a Tier-2 KYC verdict?

In RCF, the answer is in the AC that scoped the verdict logic, the test suite that asserted it, the FBS that built it, and the human who signed the criterion. The chain holds.

Section 4 · In a regulated context

What regulators are
starting to ask.Copy link

DORA (in force): operational resilience, traceable change management.
EU AI Act (high-risk systems): show your working when an AI helped make a decision.
BCBS 239 (banks): data lineage and aggregation under audit.
Model risk (SR 11-7, SS1/23): increasingly applied to AI-augmented code paths.

‘The AI did it’ is not a defence any regulator will accept.
The chain is the defence.

Section 4 · In a regulated context

What changes for your
senior engineers.Copy link

Less time writing code. More time specifying ACs and reviewing for theatre. The senior-engineer premium moves up the stack, from typing to judgement. Pay band intact, role shape different.

They will like this.

Section 4 · In a regulated context

What changes for your juniors.Copy link

They ship more, faster, under a structure that makes their work auditable for the first time. The ladder gets clearer: writing crisp ACs is the new “can they spec well?” interview question.

They will like seeing a clearer career path.

Theatre risk and the human signature.Copy link

STANDARDS → EXTRACTION → DIFF → APPROVAL → (loops)

Standards, AI-assisted extraction, the visible gap, the approval gate.
A signature on an AC has to be a real commitment, not a rubber-stamp.

Section 4 · In a regulated context

This isn’t theory.
It’s running in production.Copy link

One product delivered end-to-end on the RCF methodology, with PoC tooling alongside. A second product on a different stack is in flight. Team adoption is widening.

This documentation site was built using RCF and my Digital Operator, Dave. Our own tooling is being built using RCF.

Section 5

Beyond engineering.Copy link

The same idea, capture how something works and make it load-bearing, generalises beyond writing code.

Section 5 · Three phases

Three phases of adoption.Copy link

Phase 1: automate the engineer. RCF. Where most teams are trying to be. Working in production today.
Phase 2: automate the operating engineer. The Digital Operator. Where I am now. The first instance is running.
Phase 3: automate the roles upstream. PO, architect, compliance, design. Where this is heading.

Section 5 · The Digital Operator

A counterpart,
not an assistant.Copy link

An assistant helps you do your job. A counterpart does the job, and journals how you would have done it while doing it.

The mechanism is closer to a biographer than to a chatbot: a writer who shadows the subject, captures every observation about how they think, and over time becomes able to write in their voice. The Digital Operator does the same, except it also performs the work itself, and the work is how it learns.

The first instance is running. Dave is my engineering Operator. He runs my tech estate, handles my routine work, tightens my drafts, and learns my judgement as he goes.

An assistant helps you. A counterpart replaces your role or function: in your voice, at your standards, with your trail.

Section 5 · The Digital Operator

Capturing the hard-to-formalise.Copy link

Most institutional knowledge isn’t the decisions themselves. It’s the why. Intuition. Decision paths. Justifications. The reason you ranked one option above another.

This kind of knowledge doesn’t get written down because the cost of writing it down has always been higher than the cost of carrying it in your head. The Operator inverts that. Every decision, every justification, every priority call is captured in context, as it happens, by the thing that was already in the loop.

The hardest knowledge to write down, the why,
is what gets captured by default.

Section 5 · The Librarian

The Librarian.Copy link

A journal that nobody reads is a log file. The pattern that makes the Digital Operator more than clever logging is the second role: the Librarian.

The Librarian collates incoming journals across every Operator running across every role. Distills what is repeated. Organises what is distinct. Synthesises across journals into a single institutional store that feeds the next round of work.

What vendors are selling as agent memory is plumbing. The Librarian is a role.

Agents journal. The Librarian remembers, for the whole company, across every role, forever.

Speaker notes

If you only study one slide from this section, make it this one.

A journal that nobody reads is a log file. The pattern that makes the Digital Operator more than clever logging is the second role: the Librarian. The Librarian collates incoming journals across every Operator running across every role. Distills what is repeated. Organises what is distinct. Synthesises across journals into a single institutional store that feeds the next round of work and improves alignment over time. Distributed, hard-to-collect, often-buried knowledge becomes a single living asset that compounds with use. Vendors are selling something they call agent memory right now: that’s plumbing, an infrastructure tier that lets a single agent remember more. The Librarian is a role: a separate synthesis function whose only job is to make sense of what every other Operator observed, and turn it into knowledge the rest of the team can draw from.

The Digital Operator page goes deeper.

Section 5 · The implication

Institutional knowledge
stops being trapped in people.Copy link

When a person moves on, what they knew doesn’t leave with them. When someone new inherits a role, they start from the journal, not from a half-considered legacy handover.

Section 5 · The honest read

Where we are.Copy link

Phase 1 (RCF): one product delivered end-to-end on the methodology, with prototype tooling. A second product on a different stack is in flight.

Phase 2 (Digital Operator): Dave is running. He built the methodology rewrite. He builds and maintains this site. He builds my own tooling. The Librarian function exists; it’s currently small.

Phase 3 (upstream roles): in design.

Early days. The shape is real, the scale is small, what’s possible is being worked out.

Take-home

AI does the typing.
RCF does the governance.Copy link

Resources

Resources, and questions.Copy link

The methodology, full read.
The Digital Operator page, the broader operating model.
This deck, share-link in case anyone wants to revisit.

RCF: AI software,properly.Copy link

What I want youto leave with.Copy link

AI agents are extraordinaryat the first 80%.They stop being usefulat the 20% that matters.Copy link

Four blockers, named.Copy link

Demo-ready and production-readyare different problems.Copy link

You’re not alone in this.Copy link

The tools you bought.Copy link

The guidelines you wrote.Copy link

The PR reviews you mandated.Copy link

The shape ofthe missing piece.Copy link

Don’t trust the agent.Trust the chain.Copy link

The core idea.Copy link

Where RCF sits.Copy link

The document chain.Copy link

One acceptance criterion,one test suite,no negotiation.Copy link

Traceability,forward and backward.Copy link

The build cycle.Copy link

When the spec changes,and it always does.Copy link

A lineage, not a launch.Copy link

The two you’llcompare it to.Copy link

The audit story.Copy link

What regulators arestarting to ask.Copy link

What changes for yoursenior engineers.Copy link

What changes for your juniors.Copy link

Theatre risk and the human signature.Copy link

This isn’t theory.It’s running in production.Copy link

Beyond engineering.Copy link

Three phases of adoption.Copy link

A counterpart,not an assistant.Copy link

Capturing the hard-to-formalise.Copy link

The Librarian.Copy link

Institutional knowledgestops being trapped in people.Copy link

Where we are.Copy link

AI does the typing.RCF does the governance.Copy link

Resources, and questions.Copy link

RCF: AI software,
properly.Copy link

What I want you
to leave with.Copy link

AI agents are extraordinary
at the first 80%.
They stop being useful
at the 20% that matters.Copy link

Demo-ready and production-ready
are different problems.Copy link

The shape of
the missing piece.Copy link

Don’t trust the agent.
Trust the chain.Copy link

One acceptance criterion,
one test suite,
no negotiation.Copy link

Traceability,
forward and backward.Copy link

When the spec changes,
and it always does.Copy link

The two you’ll
compare it to.Copy link

What regulators are
starting to ask.Copy link

What changes for your
senior engineers.Copy link

This isn’t theory.
It’s running in production.Copy link

A counterpart,
not an assistant.Copy link

Institutional knowledge
stops being trapped in people.Copy link

AI does the typing.
RCF does the governance.Copy link