The Challenge

Medicare pays over $1 trillion annually. Roughly $70 billion of that -- about 7 cents on every dollar -- is estimated to be improper payments. The GAO has repeatedly criticized CMS for lacking quantifiable fraud detection metrics. The existing model is reactive: pay first, chase later, often years after the money is gone.

The ACT-IAC 2026 AI Hackathon posed a direct question: Can AI shift CMS from reactive to proactive program integrity?

We had 13 days.

The Constraint

No protected health information. No data use agreements. No claims-level data. Everything had to work on publicly available CMS datasets -- the same data anyone can download from data.cms.gov.

This is a harder problem than it sounds. Public data is annual aggregates, not individual claims. You can see that a provider billed $2.3M last year, but not whether they billed the same patient twice on the same day. Most academic fraud detection work assumes claims-level access. We didn't have that luxury.

Can AI shift CMS from reactive to proactive program integrity? We had 13 days.

What We Built

A single Go binary that ingests 22 CMS datasets, scores 1.47 million providers, and serves an interactive investigation dashboard.

Integrity is a single Go binary that ingests 22 CMS datasets (548 million rows), scores 1.47 million Medicare providers across 10 years of history, and serves an interactive investigation dashboard -- all from one server.

The Data Pipeline

The system ingests data from CMS, OIG, DEA, Census, and other public sources into an embedded DuckDB database. No external database server. No ETL orchestration platform. Just direct CSV-to-columnar ingestion with SHA-256 deduplication.

MetricValue
Rows ingested548M+
Providers scored1.47M
Coverage2013-2023
Peer groups1,046 (specialty x region x year)
Graph nodes5.8M
Graph edges82.6M

27 Features, 8 Categories

Every provider gets scored on 27 features in eight categories:

  • Billing Volume -- billing far more than specialty peers, sudden spikes, solo practitioners billing like groups.
  • Procedure Mix -- upcoding E&M visits, concentrating in expensive codes, deviating from specialty norms.
  • Drug Prescribing -- opioid prescribing intensity, high-risk medications for elderly, rare drug combinations.
  • Payment Network -- Open Payments manufacturer relationships correlated with prescribing patterns.
  • Sanctions -- LEIE exclusions, Corporate Integrity Agreements, SAM exclusions.
  • Geographic -- billing deviation from county baselines, cross-payer risk signals.
  • Graph Analysis -- six custom network algorithms: prescribing isolation, opioid network contagion, manufacturer concentration, exclusion proximity, geographic deviation, co-prescribing anomalies.
  • Enrichment -- career stage anomalies, controlled substance supply chain deviations from DEA ARCOS data.

Three-Way Ensemble Scoring

1. Weighted Composite

27 features normalized to 0-100, combined with tunable weights. Deterministic, reproducible, transparent.

2. ML Corroboration

Isolation Forest, K-Means, Local Outlier Factor. Corroboration boost up to +30 points.

3. Gradient Boosted Trees with Focal Loss

Trained on LEIE exclusion labels. Validation AUC: 0.965.

The Dashboard

National heatmap, state/county drill-downs, deep provider profiles. Peer comparison radar charts, top contributing factors, 10-year billing trajectories, network visualizations.

Key differentiator: live weight adjustment. Investigators tune feature weights in real-time and watch scores re-rank instantly.

Every score shows exactly how it was computed. The SQL queries are visible. The weights are visible. The peer groups are visible. Not a black box.

Every score shows exactly how it was computed. The SQL queries are visible. The weights are visible. The peer groups are visible. Not a black box.

The Results

Backtest Performance (train 2013-2019, validate 2020-2023)

42%
P@100 Confirmed Fraud
890x
Lift Over Random
0.965
GBT Validation AUC
6.9yr
Avg Detection Lead

42% precision at top 100 means 42 of every 100 highest-flagged providers are confirmed fraud. At base rate 0.047%, that's 890x improvement.

6.9-year lead time means the system would have flagged providers nearly 7 years before OIG excluded them. ~$3.4M prevented waste per provider caught early.

Out-of-Sample Validation

We searched every non-LEIE provider in our top 250 against DOJ press releases, OIG enforcement actions, court records, and state medical boards. 37 confirmed fraud cases. $500M+ total fraud value.

Highlights

RankFinding
#2DOJ settlement $1.2M
#755,000 false claims on deceased patients
#8$14.3M wound care fraud
#22$250M opioid conspiracy
#52$6.5M false claims (independently investigated by ProPublica)

Architecture

Single server. Zero external runtime dependencies. Under $800/month.

Single server: Go binary, DuckDB embedded, GraphWizard in-process, HTMX+Alpine+ECharts frontend embedded, <$800/mo infrastructure.

ComponentChoiceWhy
LanguageGoSingle binary, embedded assets, goroutines for pipeline
DatabaseDuckDB (embedded)No server, columnar analytics on 548M rows
Graph engineGraphWizard (in-process)Replaced 100GB RAM Memgraph dependency
FrontendHTMX + Alpine + EChartsEmbedded in binary, no build step
MLGo ML librariesIsolation Forest, K-Means, LOF, GBT -- all in-process
InfrastructureSingle VPS<$800/mo for the entire system

GraphWizard

Born from necessity. Open-sourced for everyone.

Built during and after the hackathon to replace Memgraph. 40+ algorithms, pure Go, zero dependencies beyond gonum, 97.3% test coverage. MIT licensed, open source.

Replaced a 100GB RAM Memgraph dependency with an in-process library. Full 82.6M-edge graph analysis in ~90 seconds.

MetricValue
Algorithms40+
Dependenciesgonum only
Test coverage97.3%
LicenseMIT
Graph size handled82.6M edges, 5.8M nodes
Analysis time~90 seconds
RAM replaced100GB (Memgraph) to in-process

The Numbers

Hackathon

Duration13 days
Team size4 people
Result1st place, ACT-IAC 2026

Fraud Detection

Confirmed cases37
Total fraud value$500M+
P@10042%
Lift over random890x
Avg lead time6.9 years

Infrastructure

Monthly cost<$800
External runtime deps0

Open Source

ProjectGraphWizard
Algorithms40+
LanguagePure Go