The harness decomposes into five interlocking layers. None is sufficient alone; all five must exist for autonomous medical AI to operate in a clinical setting. We treat them as infrastructure primitives - each can be built incrementally, but the system only functions when all five reach minimum viability for a given clinical surface.
Clinical integrations. The pipes between the AI and the clinical environment. EHR connectors (Epic, Cerner, Athenahealth), FHIR adapters for structured data exchange, clinical decision support (CDS) hooks that embed AI output into existing physician workflows, PACS integration for imaging, and HL7 v2 interfaces for legacy systems that will not migrate for a decade. The integration layer has to be bidirectional: the AI reads from the clinical record and writes back into it, with full provenance tracking. This is not an API call - it is a stateful, auditable transaction that must survive interruption, version changes, and regulatory inspection.
Liability architecture.The legal scaffolding that determines who is responsible when an autonomous AI makes a clinical decision. Indemnity contracts between the AI deployer and the health system. Malpractice carve-outs that define whether AI-assisted decisions fall under the physician's existing coverage or require new policy structures. Algorithmic audit trails that reconstruct the AI's reasoning chain for every clinical recommendation - not just the output, but the input data, model version, confidence score, and any human override. This is what makes the liability question tractable: not by eliminating liability but by making it assignable and insurable. The Utah model is the first framework that attempts this at scale - requiring a licensed physician supervisor, an FDA-cleared or exempt algorithm, and a defined scope of autonomous action.
Data standards. Existing health data standards were built for human-generated clinical data. HL7 FHIR handles physician notes, lab results, imaging orders. It does not handle autonomous AI outputs: diagnostic confidence intervals, multi-model consensus scores, reasoning traces, or consent records specific to AI-generated care. The harness requires extensions to FHIR that represent AI clinical outputs as first-class resources - with their own provenance, versioning, and consent semantics. Consent frameworks must address a question that did not exist five years ago: does a patient consent to AI-generated care the same way they consent to physician-directed care? The answer is no, and the data standards have to encode that distinction.
Regulatory scaffolding. The regulatory landscape for autonomous medical AI is moving fast and unevenly. The FDA's SaMD framework was designed for locked algorithms that do not learn; the agency is developing a Predetermined Change Control Plan (PCCP) pathway for adaptive AI, but it is not finalized. State-by-state licensing creates a patchwork: Utah permits autonomous prescribing under supervision; most states have no framework at all. Post-market surveillance for AI differs fundamentally from device surveillance - the “device” changes with every model update. The regulatory scaffolding component of the harness is not lobbying; it is the operational infrastructure to navigate, comply with, and adapt to a regulatory environment that will change materially every twelve to eighteen months for the next decade.
Trust infrastructure. Hospital administrators do not adopt clinical AI because it performs well on benchmarks. They adopt it when they see peer institution evidence, validated clinical studies, physician endorsement, and a risk profile they can present to their board. Trust infrastructure is the systematic production of that evidence: clinical validation studies designed for the specific patient populations a health system serves, evidence packages formatted for hospital board review, physician education programs that build fluency rather than resistance, and transparent reporting of AI performance in production - including failures. Trust is not a marketing exercise. It is an engineering discipline with its own deliverables, timelines, and quality metrics.