Healthcare providers lose 30-60 minutes per insurance claim error, with the industry wasting over $250 billion annually on administrative overhead. At Kustode, we built a multi-tenant Revenue Cycle Management (RCM) platform processing thousands of daily EDI transactions — 837P claims, 835 remittances, and 270/271 eligibility checks. While we automated the EDI pipes early on, the intelligent workflow orchestration — denial management, prior authorization, claim intervention — remained stubbornly manual. Until we integrated MCP.
This post shares the production lessons from deploying MCP workflows in a HIPAA-compliant healthcare platform. I'll be presenting this in depth at the MCP Dev Summit North America in New York on April 3, 2026.
Healthcare billing is a domain where small errors cascade into massive costs. The numbers are stark:
Behind each of these numbers is a billing specialist navigating between CMS coverage databases, NCCI edit rules, payer-specific policies, and clinical documentation — copying data from one system, cross-referencing in another, and manually constructing appeal letters. The workflows are complex, multi-step, and require domain expertise. They're also perfectly suited for AI agent orchestration.
Before MCP, we evaluated several approaches: hard-coded workflow engines, LangChain-style chains, and custom API orchestration layers. Each had fundamental limitations for our use case:
MCP solved this by separating tool providers (MCP servers) from tool consumers (agents). Each domain knowledge source — CMS coverage, RVU data, medical coding, NCCI edits, denial codes — runs as an independent MCP server. Agents connect to the servers they need for a given task.
Our MCP server topology maps directly to the knowledge domains billing specialists rely on:
┌─────────────────────────────────────────────────────┐
│ Agents │
│ ┌──────────────┐ ┌───────────────┐ ┌─────────────┐ │
│ │ CodeResearch │ │ MedNecessity │ │ DenialResolv│ │
│ └──────┬───────┘ └───────┬───────┘ └──────┬──────┘ │
│ │ MCP tool calls│ │ │
├─────────┼─────────────────┼────────────────┼────────┤
│ MCP Servers │
│ ┌────────────┐ ┌─────────┐ ┌──────────┐ ┌────────┐│
│ │CMS Coverage│ │ CMS RVU │ │Med Coding│ │ NCCI ││
│ └────────────┘ └─────────┘ └──────────┘ └────────┘│
│ ┌──────────────┐ │
│ │ Denial Codes │ │
│ └──────────────┘ │
├─────────────────────────────────────────────────────┤
│ External APIs │
│ CMS.gov │ NIH Clinical Tables │ CMS Downloads │
└─────────────────────────────────────────────────────┘
Each MCP server is built with FastMCP (Python) and exposes domain-specific tools. The CMS Coverage server, for example, provides tools like search_coverage_by_cpt, get_lcd_details, and check_ncd_applicability. The NCCI Edits server provides check_code_pair_edits and get_modifier_requirements.
Here's what happens when a claim denial arrives via an 835 remittance:
cms_coverage.search_coverage_by_cpt("99213") → Gets LCD/NCD coverage criteriancci_edits.check_code_pair("99213", "36415") → Verifies bundling rulesdenial_codes.get_resolution_guidance("CO-4") → Gets payer-specific resolution patternsThe entire workflow that previously took 30-60 minutes of human research completes in under 60 seconds.
Multi-tenancy in healthcare means PHI isolation isn't optional — it's a HIPAA requirement. Our MCP architecture enforces tenant boundaries at three levels:
Healthcare denial resolution isn't a request-response pattern. A denial can take 45+ days to resolve, involving multiple rounds of additional documentation requests, payer communications, and appeals. MCP's tool-call model is inherently stateless — so we needed a state management layer on top:
Deploying MCP in a HIPAA-compliant environment required security patterns beyond standard MCP usage:
We instrument every MCP interaction with OpenTelemetry. Each tool call becomes a span in a distributed trace, carrying:
Prometheus metrics track tool call rates, error rates, and latency distributions per MCP server. Grafana dashboards give us real-time visibility into which tools are being called most, which are slowest, and which have the highest error rates. This observability data feeds into capacity planning for the MCP servers themselves.
After six months of production deployment, clear patterns have emerged for when MCP adds value versus when it's overhead:
MCP wins when:
Traditional APIs win when:
For healthcare billing, the dynamic multi-source research pattern makes MCP a natural fit. Every payer has different rules, every denial has a different resolution path, and the knowledge sources (CMS databases, NCCI edits, payer policies) are constantly being updated.
I'll be presenting this in full detail at the MCP Dev Summit North America on April 3, 2026 in New York. The talk covers production metrics, deployment challenges, live architecture walkthroughs, and lessons learned from integrating MCP into an existing production system (versus greenfield builds).