Machines Cannot Build for Their Own Sessions

1 July 2026 · Tom Cranstoun · 11 min read

Click here to find out what AI provenance looks like

This is a live demo. The author photo further down this page is not a plain image: it carries its MX metadata and a full AI provenance trail embedded inside the file, and that trail names the agent that created it. Drop the photo into the inspector and see what a machine reads. Every company needs this in their CMS. Talk to CogNovaMX.

Ask an AI for a strategic analysis. It'll produce one - thorough, well-structured, properly reasoned. Ask a second AI to build on it. It'll have no idea the first one exists.

This isn't a configuration problem or a product limitation waiting for the next release. It's an architectural condition that applies to almost every AI system in production today, and it has consequences that compound with every AI workflow you ship.

Machines can't build for their own sessions. One AI's output won't be reliably found, parsed, or verified by another without human mediation. The hand-off between machines is broken by design.

The Oral Culture of AI

Before writing, human knowledge was oral. A speaker produced something rich and reasoned. When the session ended, it existed only in human memory. It couldn't be addressed, indexed, verified, or built on systematically. Every new conversation started close to zero.

Writing changed that. A written record has properties that speech doesn't: a fixed location, a persistent form, an addressable identity. One person's work becomes another person's starting point. The discipline compounds.

AI, despite its sophistication, is architecturally oral. An LLM session is rich in the moment and gone when it ends. The reasoning a model performs at 9am isn't available to the next one at 10am. The output from step one can't be verified by a downstream stage unless a human carries it across. Every agent in a multi-agent system is, by default, starting from the same cultural amnesia.

The most capable information-processing machines in history are operating as an oral culture.

What Machines Cannot Do With Their Own Outputs

When an AI system produces an output, the result isn't a governed artefact. It's text. That text typically has none of the six properties that make content machine-actionable.

Property	What is missing from a typical AI output
Identity	No stable identifier. The output can't be reliably found or addressed by another machine.
Structure	No declared structure. The output can't be traversed or parsed without human interpretation.
Semantics	No fixed vocabulary. Two machines reading the same word may not agree on its meaning.
Context	No declared jurisdiction, language, or scope. No instructions for how the output should be interpreted.
Provenance	No author, timestamp, or version. The output can't be traced or compared to prior work.
Trust	No verification path. A downstream machine can't confirm the output is what it claims to be.

The web is the most visible surface, but the same properties are missing from every file type an AI system touches. Every PDF passing through a retrieval pipeline has the same gap. Every image processed by a vision model. Every brand asset, technical document, contract, or report that enters an AI workflow arrives without a stable identifier, without provenance, without machine-readable permissions.

This is concrete. It's the reason enterprise AI pipelines require so much human glue. A language model produces a report. A second system needs to act on it but can't reliably find it, determine whether it's the current version, parse its structure, or verify the source. Humans step in. The promise of autonomous AI workflows stalls at every hand-off.

The systems are extraordinarily capable at producing. Governing what they generate isn't something the current architecture supports.

The Contrast That Clarifies the Problem

Consider what happens when a system does have persistence - a filesystem, memory across sessions, and the ability to write structured artefacts to stable paths.

Such a system can read another machine's work, reconcile it, and save the result to a fixed path with a canonical identifier. It tags the record with session provenance. Future sessions can find it and build on it. The result is a governed artefact, not just content. It has identity. It has structure. Any system that knows where to look can address it.

The contrast with a standard AI chat interface is total. One system produces content for a human reader. What the other system creates, machines can act on directly. Intelligence applied isn't the key difference - the system may be less clever in the narrow sense. The difference is in the properties of what it produces.

One machine produces. Another machine governs.

That gap is what MX is built to address.

Why This Is Structural, Not Incidental

AI chat systems can't govern their own outputs not because their engineers forgot a save button but because the systems were designed around a different model of who the reader is.

Chat interfaces assume a human at the end of the chain: a person who can read the output, judge its quality, copy the relevant parts, and carry the meaning forward. That assumption is reasonable for consumer applications. It's a structural liability for any organisation that needs machines to build on each other's work.

A document designed for human consumption is linear, conversational, implicitly structured. The structure is legible to a reader who knows the conventions. A document designed for machine consumption is explicit, semantically declared, and addressable by a canonical identifier. These aren't the same document. They aren't interchangeable.

Every enterprise deploying AI at scale is discovering this distinction the hard way. Their AI produces richly. Their pipelines fail quietly at every point where one machine's output needs to become the input for the next - without a human in the middle.

Why This Compounds Every Day

This is a production problem that grows with every agent deployed, every workflow extended, every model added to a chain.

HuggingFace hosts millions of models. Every major cloud provider ships its own agent framework. Microsoft, Anthropic, Google, and OpenAI are all building infrastructure for chaining models together. Every one of those deployments creates new hand-off points - places where one machine's output must become another machine's input. Without structured artefacts, the chain depends on a human to carry meaning across.

The scale isn't slowing. More agents are deployed every day. More pipelines are extended. More teams discover the same failure in production and solve it the same way: a human in the middle, doing manually what a governed output would have made automatic.

Regulators have reached the same conclusion from a different direction. The EU AI Act, the UK ICO's forthcoming AI Code of Practice, and equivalent frameworks emerging across regulated markets all point to the same requirement: AI-generated content must be traceable. Not labelled for human readers. Verifiable by machine. Provenance is moving from a design preference to a legal baseline. Companies that have already solved the hand-off problem will satisfy that requirement as a by-product. Those that haven't will need to retrofit it into pipelines never designed to produce governed artefacts.

Note: This page describes regulatory frameworks in general terms only. Nothing here is legal advice. Requirements vary by jurisdiction, organisation type, and use case. Consult qualified legal specialists for guidance specific to your situation.

Compression Is Not Neutral

There is a subtler failure underneath the mechanical one, and it is the one almost nobody designs against. Judging whether an answer actually rests on sound evidence is not a mechanical task; it is an act of interpretation. A machine can retrieve, rank, and summarise at a scale no human matches. What it cannot do, on its own, is weigh sources that disagree and decide which one should carry the day.

So when a model has to produce a single answer, it does the statistically safe thing. It pulls toward whatever the training data repeats most often and smooths the rest away, folding genuine disagreement into one dominant narrative. The rare-but-well-argued objection, the minority reading, the source that sits outside the mainstream - all of it drifts toward the centre and disappears. The answer reads as confident and coherent precisely because the conflict has been averaged out of it.

Older disciplines learned the opposite habit long ago. The work of history is not to crown a single true account but to hold rival interpretations together: to trace dissent, to keep the marginalised voice and the unresolved argument, to protect the small awkward details a tidy summary would throw away. What turns out to matter in a record is often not the agreement but the disagreement. Scale is no excuse to lose it; if anything, scale makes keeping it harder and more important.

This is where the argument of this post meets a much older one. You cannot preserve competing perspectives if the outputs carry no record of who said what, on what evidence, and when. Flattening everything into one narrative is simply what happens when a claim arrives with no provenance and no way to trace it back. A governed artefact resists that by its structure: every claim it carries names its author, its version, and the source that backs it, so a downstream machine can keep two conflicting attributed claims distinct instead of blending them into a single average. Provenance is not only an audit convenience. It is what lets disagreement survive contact with scale.

The Machine Commons Already Being Built

Humans built institutions to solve exactly this problem for human knowledge. Libraries gave it a stable address. Registries gave records a provenance chain. Standards gave documents an interoperable structure. These aren't neutral technical choices - they're governance decisions about how knowledge compounds and stays trusted.

MX-hub, Maxine, and REGINALD are built on exactly that logic, applied to machines.

MX-hub is a governed repository. Every document in it has a canonical identity, a declared version, explicit metadata, and machine-readable permissions. Documents aren't files stored for retrieval - they're COGs: self-describing artefacts that tell any agent what they are, what they permit, and how they relate to everything else in the repository. A machine reading MX-hub doesn't encounter content. It encounters governed meaning.

REGINALD, the public registry that holds and attests MX records, is that layer. It makes provenance verifiable rather than asserted. A COG in MX-hub can be signed and attested - so that when a downstream agent reads it, trust isn't a question of believing the source but of checking the cryptographic record. REGINALD changes "I claim this is authoritative" into something a machine can independently verify. That's the difference between an oral culture and a written one.

Maxine is the agent that operates across this infrastructure. She reads COGs, acts on their instructions, writes back to the repository, and carries context forward across sessions. Maxine isn't a chat interface producing outputs for human readers. She's a machine building for continuity - because every session is governed.

The critical difference is in what she produces. Every document Maxine writes has the metadata embedded from the moment it's created:

a canonical identifier
a declared author, version number, and timestamp
a provenance chain linking it to the COG that authorised the work
machine-readable permissions governing downstream use

The output is a governed artefact from the first character. Any machine that reads it - now or in six months - starts from a complete, verifiable record of what the document is, where it came from, and what it's allowed to do.

This is what breaks the pattern. Most AI systems produce text. Maxine produces provenance. The next agent in the pipeline inherits evidence, not ambiguity.

Together, the three solve the structural problem that breaks every other AI pipeline. MX-hub is the written record. REGINALD is the institution that validates it. Maxine is the agent that can finally build on what came before - and leave something equally solid for what comes next.

Without that infrastructure, AI is the most powerful oral culture in history.

MX-hub, REGINALD, and Maxine are the writing system.

Find Out Where You Stand

Most organisations don't know which of their content - web pages, PDFs, images, assets - is machine-readable and which is opaque. The gap only becomes visible when a pipeline fails or a regulator asks for provenance you can't provide.

CogNovaMX works with organisations at every stage of that discovery:

Content Readiness Assessment. A structured audit of your full content estate: web, PDFs, images, and assets. Maps what machines can read, what they can't, and what closing the gap requires.
Training. Practical workshops for content and product teams on machine-readable documents, governed metadata, and AI-ready architecture.
Consultancy. Hands-on advisory for organisations building AI pipelines, implementing MX standards, or preparing for regulatory content traceability requirements.

Want a first look before you talk to us? Drop a file into the free MX Inspector and see exactly what a machine reads when it opens it.

Get in touch with Tom at CogNovaMX.

Tom Cranstoun is the founder of CogNovaMX and the author of MX: The Handbook and MX: The Protocols. He writes about machine understanding, content governance, and the architecture of AI-readable systems.