Ambient AI Scribes: The Medical Coding Gap You Can't Ignore

Arasu Elango
May 15
5 min read

Ambient AI Scribes: The Medical Coding Gap You Can't Ignore

Ambient AI has become one of the fastest-adopted technologies in healthcare history. According to a 2025 KLAS Research report on ambient speech technology, more than 80 percent of ambulatory organizations now report at least moderate adoption. Clinicians are spending less time typing after hours, notes are more complete, and physician satisfaction scores are up at health systems from Rochester to Roseville.

But there is a problem hiding in plain sight. Most ambient AI tools are coding-naive. They capture the clinical conversation and generate a better note -- but they do not know an ICD-10 code from an E/M level. And when documentation goes into the revenue cycle without coding guidance baked in, health systems pay for it in denials, audit exposure, and rework that nobody budgeted for.

What Ambient AI Actually Does Well

The core value proposition of ambient scribing is real. A physician sees a patient, talks naturally, and the AI converts the conversation into a structured clinical note -- history, physical, assessment, plan, all populated in the EHR. Studies cited by the American Medical Association show meaningful reductions in after-hours documentation time, and many clinicians describe it as transformative for their relationship with patients.

The note quality improvement is genuine. Ambient tools often capture detail that would have been omitted from a rushed, end-of-day entry. Complaints are longer. Histories are more complete. The encounter is more legible.

What ambient AI does not do, in most implementations, is think about how that note will be coded. It does not ask: Does this documentation support the E/M level being billed? Is there a secondary diagnosis here that should be captured for HCC risk adjustment? Is the clinical complexity reflected in a way that survives a payer audit?

The Coding-Naive Problem

In February 2026, HFMA convened a roundtable of health system revenue cycle leaders to examine the coding implications of ambient AI. The finding was blunt: coding-naive tools "expose health systems to denials, audit risk and downstream rework." A better note is necessary but not sufficient. Revenue integrity requires that documentation specifically support the codes being submitted.

The root issue is timing. Traditional medical coding is a retrospective activity -- a coder reviews the completed note after the encounter and assigns codes. Ambient AI tools, even good ones, have generally been designed to serve the documentation workflow, not the coding workflow. They hand off a note to the revenue cycle and consider their job done.

That handoff creates a gap. Revenue cycle teams receive notes that are clinically richer but not necessarily coded correctly. CDI specialists may catch some of the gaps, but most health systems cannot afford to review every encounter. As AKASA Chief Revenue Officer Ben Beadle-Ryby wrote in a May 2026 piece for HFMA, health systems have historically worked within "the practical limits of human scale" -- sampling, auditing the highest-risk encounters, chasing denials after the fact. That is not a failure of expertise. It is a failure of capacity.

What Coding-Aware AI Looks Like

Real-Time CDI Prompts at the Point of Care

The next generation of ambient tools is being built with the revenue cycle as a first-class concern, not an afterthought. Coding-aware platforms do not just listen and transcribe -- they analyze documentation in real time and surface prompts back to the clinician during or immediately after the visit. If a patient's documented symptoms suggest a secondary diagnosis that should be captured for HCC purposes, the system flags it. If the E/M level documented does not match the complexity of care delivered, the physician sees that before the note is signed.

Ambience Healthcare, in September 2025, became the first ambient AI platform to launch inpatient CDI at the point of care, built on OpenAI's models. The system extends coding awareness into inpatient settings -- historically the hardest environment for concurrent documentation improvement -- and integrates directly with Epic and Oracle Cerner. That kind of tight loop between what the clinician says, what goes into the record, and what the coding engine expects is what distinguishes coding-aware ambient AI from standard scribing tools.

From Retrospective Coding to Concurrent Coding

Coding-aware AI also changes the economics of the CDI function itself. When documentation gaps are caught at the point of care -- while the physician is still available to clarify -- the cost of correction is near zero. When those same gaps are caught by a CDI specialist three days later, corrections require queries, physician callbacks, and amended notes. When they are caught by a payer's AI-powered claims review system after submission, they result in denials that cost significantly more to rework per claim than they would have cost to prevent.

The shift from retrospective to concurrent coding is not just about efficiency. It is about accuracy. A physician who is prompted in the moment to document that a patient's hypertension is stage-2 resistant, rather than simply "hypertension," will do so more reliably than a coder who tries to infer the specificity from a completed note days later.

Why the Accuracy Gap Is a Strategic Problem

HFMA's May 2026 analysis of the hospital of the future makes the case that data accuracy is strategic infrastructure, not back-office hygiene. Large language models can now read every clinical record in a health system, not just a statistical sample. They can identify documentation gaps, surface coding opportunities, and help expert teams focus their judgment where it matters most.

For health systems that deployed ambient scribing primarily as a physician-satisfaction tool, that framing is a wake-up call. A tool that improves documentation completeness but leaves coding gaps untouched is not neutral -- it may actually make the problem harder to detect, because the notes look good. The revenue leakage is quieter.

The platforms that will define the next phase of ambient AI in healthcare are the ones treating documentation and coding as a single, integrated workflow. The question for revenue cycle leaders evaluating ambient tools should go beyond "does this reduce physician documentation time?" It should also include:

Does the platform generate ICD-10 diagnosis suggestions and E/M level recommendations grounded in documentation, with audit-defensible rationale?
Does it flag HCC-relevant conditions and prompt documentation specificity at the point of care, before the note is signed?
Does it integrate with CDI workflows so coders and clinicians are working from the same signal?
Can it report on documentation gap rates by physician, specialty, or service line to drive continuous improvement?
Does performance hold across specialties, or is it tuned for primary care and thin elsewhere?

The Bottom Line

Ambient AI is not a passing trend. The productivity gains are real, physician adoption is accelerating, and the technology is getting meaningfully better. But the first wave of deployments has largely been silent on coding -- and that silence has a cost.

The health systems that will get the most out of ambient AI are not just measuring documentation time saved. They are measuring denial rates before and after deployment, CDI query volumes, and coder productivity per chart. Those are the metrics that reveal whether ambient AI is actually improving revenue integrity or simply moving the documentation problem downstream.

If your ambient AI tool is not prompting for coding specificity and CDI issues at the point of care, it is doing half the job. The good news is that the tools to close that gap exist today -- and the case for deploying them is straightforward.

To see how agentic AI can connect documentation and coding into a single, accurate workflow, explore Medikode's automated medical coding platform.

Ambient AI Scribes: The Medical Coding Gap You Can't Ignore