Jason Sun - Trust at Scale

The Argument

AI agents are doing real work. Not as demos. In production. Contracts are being reviewed, medical records triaged, incident reports drafted, invoices processed, code written, research synthesized. The shift from “AI as tool” to “AI as worker” is structural, and it is happening faster than the infrastructure can keep up.

When a human consulting firm works on a project, every transaction has paperwork. Engagement letters, subcontracts, NDAs, invoices, deliverable specifications. If the client questions the analysis, the firm traces the chain: the conclusion came from the specialist’s report, which was based on the sub-contractor’s research, which used the data analyst’s findings, which were derived from a licensed dataset. Every step is documented. Every cost is attributable. Every participant is identifiable.

Now do the same thing with AI agents.

An orchestrator agent calls a document processing agent. The document processing agent calls an extraction model. The extraction model calls a classification service. The classification service queries an embedding database. How much did this cost? Nobody knows. Where did the data come from? Nobody knows. Who saw the data? Nobody knows.

The agent economy is running without paperwork.

This is not a temporary gap that the market will resolve on its own. It is a structural absence in the infrastructure: the same kind of absence that existed before double-entry bookkeeping standardized commerce, before SWIFT standardized international banking, before TLS made the internet safe for transactions, before container standards made software deployment reproducible.

Every major technological revolution produces an accountability layer. The accountability layer always comes second. And the accountability layer always outlasts the revolution that created the need for it. We still use double-entry bookkeeping five centuries after Pacioli. SWIFT processes 46 million messages daily, fifty years after its founding. TLS protects every financial transaction on the internet, three decades after Netscape created SSL.

The agent economy has its revolution. It is waiting for its ledger.

The Four Voids

The accountability gap has four components. Solving one without the others leaves the system ungoverned.

Cost Attribution

When an AI system processes a workflow, the cost is either a single number on a monthly invoice (aggregating millions of API calls) or invisible entirely, buried in infrastructure expenses that nobody attributes to specific work. Which API calls served which client? Which steps consumed the most tokens? Which prompts are inefficient? Organizations running AI operations cannot reconstruct the cost tree for a complex workflow from the raw materials available to them.

The consequence is already visible: six-figure surprise bills, pricing done by guesswork, and no ability to distinguish between a $0.50 analysis and a $15.00 one until the monthly invoice arrives.

Data Provenance

Provenance answers three questions: where did this data come from, what happened to it along the way, and who had access at each step? AI systems have no built-in provenance. A model reads a document, processes it, and produces output. Was the document the original or a modified copy? Did the model access other data during processing? Was the output cached, logged, or transmitted to third-party systems?

The Mata v. Avianca case, in which an attorney’s brief contained six fictitious case citations generated by ChatGPT, demonstrated that AI-generated citations have no inherent relationship to reality. The text looks authoritative. The formatting is perfect. The content is invented.

Governance Boundaries

When a consulting firm shares client data with a sub-contractor, there is an NDA. When a hospital shares patient data with a specialist, there is a Business Associate Agreement. When an AI agent passes data to another AI agent, there is nothing. No contract. No access control. No boundary definition. Data flows from agent to agent, potentially from organization to organization, with no framework for controlling, tracking, or auditing the flow.

Calibrated Confidence

This is the fourth void, and perhaps the most insidious.

When a human professional produces work, they communicate uncertainty naturally. “I’m fairly confident in this analysis, but the Southeast Asian data is thin; I’d want to verify those numbers before the board.” The consumer knows where to focus review.

When an AI system produces work, it communicates with uniform certainty. Every statement carries the same typographic weight, the same authoritative tone. A fact drawn from a source document looks identical to a hallucinated statistic. The output is a wall of equal-weight assertions, and the human reviewer has no signal about where to focus attention.

Research on ensemble-based confidence estimation reveals a consistent pattern: models are systematically overconfident. They express 95% certainty on outputs that are correct only 50% of the time. Naive confidence scores (the kind you get from asking a model “how sure are you?”) are worse than useless. They are actively misleading.

Genuine calibration, where a stated 80% confidence means the output is correct 80% of the time, requires infrastructure that does not exist: diverse verification ensembles, historical accuracy tracking, domain-specific thresholds, and systems that learn from human corrections. Without this infrastructure, human-in-the-loop governance is theater. The human is reviewing everything or reviewing nothing, because the system provides no guidance about where attention is most needed.

Calibrated confidence is the routing signal that makes human governance scalable. It tells the accountant which of the 10,000 overnight invoices to review first. It tells the physician how much to trust the model’s 73% probability. It tells the lawyer which contract clause the AI is guessing about. Without it, the other three voids are useful to fill but insufficient.

What This Looks Like in Healthcare

The four voids are most visible, and most consequential, in healthcare, where the stakes are measured in lives.

A physician prescribing blood pressure medication faces extraordinary complexity. Blood pressure fluctuates naturally: sometimes the fluctuation is benign, sometimes it signals a genuine condition. The physician must decide: medicate (risking unnecessary side effects) or wait (risking a cardiac event). The number of available medications is enormous, they belong to overlapping drug classes, and a patient who reacted badly to one medication in a class should not be prescribed another from the same class, a constraint that spans years of history across multiple records.

AI can help. Models can identify which fluctuations are dangerous, cross-reference medication histories, surface relevant studies. But a model that says “73% probability of requiring intervention” is only useful if that 73% is calibrated, if the physician knows whether it actually means 73%, or whether the model is systematically overconfident. The physician also needs to know what the model doesn’t know: is this probability based on a patient population that resembles this patient?

Without calibrated confidence, the physician either trusts the number uncritically (dangerous) or ignores it entirely (wasteful). Human error in medicine is not a technology problem. It is a systems problem. A patient arrives for kidney surgery and receives heart surgery because a label was transposed. A medication is prescribed that interacts fatally with a drug the patient took three years ago, recorded in a different system. These failures demand the same accountability infrastructure the rest of the economy needs.

The data access problem makes it worse. Health systems that hold population-level data across all providers sit on uniquely valuable training sets, but they default to refusal because no trust protocols exist to manage access. The innovators get their data elsewhere, build their products, and the jurisdiction that held the most valuable dataset gains nothing. This is a trust infrastructure failure, not a privacy failure: the data was sensitive enough to require accountability protocols for access, and those protocols did not exist.

The Stack

The accountability infrastructure is a stack of seven layers, each building on the one below, each solving a distinct problem.

7MarketplaceAgent-to-agent commerce

6AgencyIdentity, authorization, consent

5EnforcementRuntime safety verdicts

4TrustReputation from historical transactions

3ExchangeMulti-party settlement

2ComputeDynamic resource allocation

1AccountabilityCost, provenance, governance

The foundation, Layer 1 (Accountability), solves the receipt problem. Every agent operation produces a structured envelope: cost records forming a tree that mirrors the execution tree, provenance chains linking every conclusion to its source data, and governance labels that travel with data through every organizational boundary. Context flows down, results flow up. The envelope is the permanent record of what happened.

Layer 2 (Compute) sits above accountability because every compute operation generates costs and governance events. Dynamic allocation (request, allocate, release, report) replaces static provisioning and feeds directly into the cost tree. Worker self-registration means adding capacity is zero-configuration: start a node, point it at the broker, and it appears in the pool.

Layer 3 (Exchange) is the least mature and one of the most important. When multiple organizations collaborate on a workflow, someone needs to settle the bill. Today this happens through monthly invoices, manual estimates, and disputes that take weeks. The exchange layer automates reconciliation from the exact cost trees that Layer 1 already produces.

Layer 4 (Trust) is the agent economy’s credit bureau, a structured record of accuracy, reliability, compliance, and safety across thousands or millions of transactions rather than a single score. A trust score from 100 transactions is marginal. From 1,000,000, it is authoritative. The value compounds with participation, creating a network effect.

Layer 5 (Enforcement) goes beyond “is this output safe?” to “should this operation proceed, given cost anomalies, data sensitivity, confidence scores, and historical behavior?” The critical innovation is calibrated judgment: diverse ensembles of judge models producing confidence scores where 0.7 actually means correct 70% of the time. Bayesian calibrators on internal model representations distinguish between genuine ambiguity and knowledge gaps. Active learning selectively requests human verification where information gain is highest. The enforcement layer is not a metal detector at an airport. It is a security operations center.

Layer 6 (Agency) solves identity. Today most agents authenticate with shared API keys. The agency layer gives every agent a unique cryptographic identity, explicit authorization chains (who authorized this agent to do this thing), capability-based permissions, and consent management at every organizational boundary.

Layer 7 (Marketplace) is the capstone. Discovery, trust evaluation, terms negotiation, execution, safety monitoring, settlement: every step depends on the layers below. Without the stack, a marketplace is just a directory. With the stack, it is the infrastructure that makes large-scale agent commerce possible.

The Human Shift

A police officer in Ontario finishes a call: a domestic disturbance, a break-and-enter, a collision with injuries. The event is over. The paperwork begins. Forty percent of the shift is administrative. An AI system transcribes the narrative, extracts parties, looks up offense codes, formats the report. The officer reviews, corrects, approves. Time: 10 minutes instead of 45.

The officer did not disappear. The work changed. The human moved from execution to governance.

This pattern is universal. The accountant stops processing invoices and starts reviewing AI-processed invoices. The lawyer stops drafting contracts and starts evaluating AI drafts. The radiologist stops scanning hundreds of X-rays and starts reviewing AI-flagged images. In every case, the skills required change from procedural knowledge (how to do the work) to evaluative knowledge (how to assess whether the AI did the work correctly).

But here is the part that “human in the loop” advocates rarely address: the human governor cannot review everything. The accountant cannot examine every one of the 10,000 invoices the system processed overnight. If the human must review everything, the AI saved no time.

This is where the confidence void becomes a workforce problem, not just an infrastructure problem. Without calibrated confidence (a signal that says “the AI is 95% sure about this invoice but only 60% sure about that one, review that one first”), the human governor is either rubber-stamping everything (dangerous), spot-checking randomly (inefficient), or reviewing everything (pointless).

Interpreting calibrated uncertainty becomes a professional skill, as fundamental to the AI-augmented professional as reading financial statements is to the traditional accountant. The physician needs to understand what it means when the AI is 70% confident in a medication recommendation: what the number says statistically, and what it means clinically, for this patient, with this history.

The education system is not preparing people for this shift. Universities teach students to execute work that AI will increasingly do. Law schools teach contract drafting. Accounting programs teach transaction processing. Medical schools teach scan interpretation. The foundational skills remain essential; you cannot govern what you do not understand. But education stops at execution. It does not teach governance: evaluating AI output, designing AI workflows, measuring AI performance, deciding when to intervene.

The transition from execution to governance is not a demotion. It is a promotion. Governance is harder. It requires deeper expertise, better judgment, and more accountability. The challenge is ensuring enough people have the opportunity to make the transition.

The Convergence

Seven independent forces are converging on the same requirement, and none of them are coordinating.

Regulatory. The EU AI Act, NIST AI frameworks, Canada’s Bill C-27: every jurisdiction is developing requirements for AI transparency, auditability, and governance. These requirements are unmet because the infrastructure to meet them does not exist.

Economic. CFOs discovering six-figure AI bills with no cost attribution. Organizations unable to price their AI-assisted services because they cannot calculate per-workflow costs. The 72% of AI projects that stall at pilot because the integration infrastructure is missing.

Safety. Models making consequential decisions (clinical, legal, financial) without calibrated confidence. Hallucinated citations in court filings. PII leaking across organizational boundaries.

Legal. No liability framework for multi-agent workflows. When Agent A calls Agent B and the output causes harm, who is responsible? Current law has no answer because the audit trail to assign responsibility does not exist.

Environmental. Carbon reporting requirements that cannot be met when compute consumption is unmetered. Organizations required to report environmental impact with no way to attribute energy usage to specific AI operations.

Geopolitical. The CLOUD Act, data sovereignty requirements, sovereign compute strategies. Nations recognizing that AI infrastructure controlled by foreign corporations is a strategic vulnerability.

Enterprise. CISOs responsible for AI governance in organizations deploying agents at scale. AI maturity declining 20% year-over-year because organizations bought the tools without building the operational infrastructure.

These forces do not need to coordinate. They are all demanding the same thing: a way to account for what AI agents do. Cost attribution, data provenance, governance enforcement, and calibrated confidence are not features of a product. They are infrastructure, the same way TCP/IP, HTTPS, and double-entry bookkeeping are infrastructure.

The infrastructure will be built. Someone will build it. The question is whether it will be designed intentionally as a coherent stack, or cobbled together from incompatible patches, each serving a different vendor’s interest, none serving the public interest.

That is the design problem of the decade.

The Architecture Question

The trajectory of AI infrastructure points toward centralization on a scale that has no precedent. The Stargate Project: $500 billion. Microsoft’s data center spend: $80 billion in a single year. The assumption embedded in these investments is that AI compute must be centralized: that scale is the only path to capability, and the future of intelligence belongs to whoever builds the biggest machine.

This assumption confuses training with inference. Training frontier models genuinely requires enormous clusters. But training happens once. Inference, running the trained model, happens billions of times. And inference is where the economic value lives. Every contract reviewed, every report drafted, every invoice processed. All inference.

Inference does not require a $30 billion data center. A $5,000 workstation already runs models that would have required a data center five years ago. GPU performance per dollar improves roughly 40% annually. By 2030, the hardware for a full enterprise AI stack will cost less than a high-end laptop. In ten years, inference-capable hardware is as ubiquitous as Wi-Fi routers.

A critical acceleration: multiple independent groups have converged on the same architectural insight: you can distill a large model’s capabilities into tiny specialized programs that run on sub-billion-parameter models locally. A natural language specification gets compiled into compact weights that execute with no API call, no internet connection, no per-token fee. These compiled programs can be versioned, shared, and composed like software libraries. The convergence is structural, not coordinated. The economics of sending every request to a cloud API are unsustainable at scale.

When every organization can run its own AI on its own hardware, the sovereignty question answers itself. Canadian data stays on Canadian hardware. Patient records never leave the hospital. Defense workloads run air-gapped. Not because of a cloud provider’s contractual promise, but because the data physically never leaves the building. Distribution makes sovereignty a physical reality rather than a legal abstraction.

The distributed architecture (sovereign nodes connected by encrypted mesh, coordinated by protocol) is how the accountability stack gets deployed, as infrastructure that each organization owns rather than a cloud service that concentrates control. The accountability layer, the compute layer, the enforcement layer: all running locally, interoperating through standards. The same architecture that made the internet resilient makes AI infrastructure resilient.

A centralized architecture concentrates power: control over the machines that do cognitive work for the entire economy. A distributed architecture distributes that power to the organizations and communities that use it. These are not equivalent options.

This is the infrastructure AceTeam is building. Not another agent framework or model wrapper, but the accountability layer that sits beneath all of them: distributed, sovereign, designed for a world where AI agents do real work and someone has to account for what happened. The seven-layer stack described above is the architecture. The team, the trust gradient, the human-in-the-loop checkpoints, the calibrated confidence: these are what make it work.

The Complete Argument

This essay is the short version. The book traces the full arc: the structural shift, the accountability void, the historical pattern, the seven-layer stack, the transitions, and what comes next. Available with inline audio.

Read the book 中文版

Jason Sun · jason@aceteam.ai · May 2026

Trust at Scale