Current Work — Live Analysis

TOCA in Action
DOGE Medicaid Fraud Analysis

A systematic constraint analysis of the largest open-source Medicaid billing dataset in history — demonstrating that a 2,400-year-old structure of knowing can detect what conventional methods cannot.

Dataset: DOGE Medicaid Provider Spending (10.32 GB) · Released February 13, 2026
Methodology: Tetrahedral Ontological Closure Architecture (TOCA) · Patent Pending
Analyst: Steven Easley, CEO — Echosphere.io
Analysis Results — Full Submissions
 Download the DOGE Dataset
40%
Dataset Analyzed
$1.04T
Dollars Covered
805
Findings
289+
NPIs Flagged
$17B+
Flagged Exposure
1
DOJ Validation

Summary of Results

$13.3B
T1015 Phantom Clinics
9,228 providers billing only "clinic visits" with zero other services
$3.07B
New Entity Flooding
16,677 entities created 2023–24 billing $3B in under 24 months
$380M+
DC Behavioral Health
13 providers · 1 DOJ-confirmed · 12 uninvestigated
$190.6M
Drug Testing Mill
28 confirmatory tests per specimen vs. clinical standard of 2–5

DC Behavioral Health Cluster — Synchronized Growth

All providers billing H0034/H0036/H2017/H2019. Prestige Healthcare (marked *) indicted by DOJ August 2024. TOCA identified from data alone.

NPIEntityGrowthRevenueStatus
1619279940Preventive Measure of DC1,805%$10.4MOpen
1033685953Wellness Health Services1,600%$7.4MOpen
1366939225District Healthcare Services1,354%$7.6MOpen
1396048070MBI Health Services1,273%$7.3MOpen
1437628914New Hope Health Services1,053%$5.2MOpen
1144516824Prestige Healthcare Resources *864%$3.3MDOJ Indicted
1447716733NYA Health Services824%$4.6MOpen
1255857405Kinara Health & Home Care736%$3.9MOpen
1982182150Abundant Grace Health Svcs640%$3.6MOpen
1083062871CityCare Health Services446%$2.7MOpen
1417432733Wellness Healthcare ClinicsNEW$3.7MOpen
1720271836PSI Services III102%$801KOpen
1942451216Potential 13th member2,266%$92.5MOpen

Physically Impossible Billing — K-Constraint Violations

Claims-per-beneficiary ratios that exceed physical service delivery limits. These are not outliers — they are impossibilities.

NPIEntity / LocationCodePeak RatioPhysical MaxMultiple
1417409509Daily Home Care, TXS51251,454.6~12012×
1538343983Transport entityT20411,352.9~12011×
1407430143Lifeline Inc, DC995091,237.7~10012×
1720171895Counseling entityH1000854.5~6022×
1225163876SW CT Agency on Aging1286C782.0
1922477975Ogbatue, Riverside CAT1019463.8~96
1457621633JARC, Bloomfield Hills MIH2015494.5

Michigan H2015 — Statewide Pattern

15 providers across Michigan billing exclusively H2015 (Comprehensive Community Support). Many with taxonomy mismatches.

NPIEntityCityMax Ratio
1457621633JARCBloomfield Hills494.5
1144989351Starr's Watchful EyeSouthfield376.1
1619374899Elmira IncInkster204.6
1922157411Community Admin ServicesClinton Twp162.3
1629464227Integrated Living IncSterling Heights124.5
1396068771Quality Choice HCTroy118.5
1891010542Turning LeafLansing117.2
1922181312Community Opportunity CtrLivonia116.7
1871538108Quest IncLivonia113.7
1376633297Angels' PlaceSouthfield107.0
1316059850IMPACTPort Huron105.6
1184836082Kadima Jewish SupportSouthfield91.8

$190.6M Drug Testing Mill — Revenue Trajectory

NPI 1659769446 · 28 confirmatory drug test codes per specimen · Matches DOJ prosecution pattern (Millennium Health, $256M settlement)

YearRevenueMonthly AvgGrowth from 2018
2018$7.1M$592K
2019$11.7M$975K+65%
2020$18.5M$1.55M+160%
2021$28.8M$2.40M+306%
2022$40.4M$3.37M+469%
2023$44.2M$3.68M+522%
2024$39.9M$3.32M+462%

Full-Dataset Projection from 20% Sample

Pattern20% SampleFull Dataset (Est.)
T1015 phantom clinics$13.3B / 9,228 providers$15–20B / 12–15K providers
New entity flooding$3.07B / 16,677 entities$8–15B / 40K+ entities
DC behavioral health cluster$380M+ / 13 providers$400M+ (fully captured)
National H-code patternEmerging$1B+
Drug testing mills$190.6M$500M+ (est.)
Physically impossible billingDozens of casesHundreds of cases
Regional clusters2 confirmed (DC, MI)5–10 estimated
E&M upcoding epidemic534 surges / 78% of findingsUnder quantification ($60B+ in analyzed band)
TOTAL FLAGGED EXPOSURE$17B+$30–50B

Why This Exists

On February 13, 2026, the Department of Government Efficiency (DOGE) and the Department of Health and Human Services released the largest open-source Medicaid billing dataset in history — 10.32 gigabytes of provider spending data spanning 2018–2024. They invited the public to help find fraud.

We accepted.

Not because we expect a bounty. Because this dataset is the first public test of whether the structure we spent fifteen years recovering from Aristotle's ἐπιστήμη can do what we claim it can: detect structural invalidity that conventional methods miss.

Within 48 hours, our methodology independently identified a provider that the Department of Justice had indicted for $10 million in Medicaid fraud — without knowing the indictment existed. We found it from the data alone. Eleven of the twelve co-clustered providers remain apparently uninvestigated.

This page documents everything: what we found, how we found it, and how you can apply the same methodology. We would rather everyone know how to detect fraud than somehow profit from this. That would be more profit than we would ever hope for.

The Methodology: TOCA

The Tetrahedral Ontological Closure Architecture applies four constraints derived from Aristotle's theory of complete knowing. When applied to billing data, these constraints test not whether a claim is statistically unusual, but whether it is structurally possible.

K
Kinetic
Is the billed service volume physically deliverable? 1,454 claims per patient per month isn't an outlier — it's a physical impossibility.
T
Teleological
Is the growth pattern consistent with organic expansion? Twelve synchronized providers growing 500–1,800% in the same codes is not market dynamics.
R
Relational
Does the billing code match the provider's capabilities? A home health agency billing psychiatric community support violates what the entity is.
S
Structural
Do entity networks show coordinated behavior? Synchronized growth, shared codes, geographic clustering — the pattern of organized fraud.

Conventional fraud detection sorts by billing volume and investigates the largest billers. This catches the obvious. TOCA catches the rest — because it tests against limits that cannot be explained by legitimate variance.

Learn the full Hexis architecture →

Methodology Validation

Blind Identification of DOJ-Confirmed Fraud

During analysis of Tranche 6, TOCA flagged a cluster of 12 Washington DC behavioral health providers with synchronized explosive growth. NPI 1144516824 was identified as one of the twelve.

Subsequent investigation revealed that on August 2, 2024 — six months before our analysis — the U.S. Attorney's Office for the District of Columbia indicted the CFO of Prestige Healthcare Resources (that same NPI) and five community support workers for $10M+ in Medicaid fraud.

We found it from data patterns alone. The 11 remaining cluster members show equal or worse patterns and appear uninvestigated. The single largest entity in the cluster ($92.5M) exceeds the indicted entity by 28×.

What We Did

Step 1: Acquisition

The dataset (10.32 GB CSV) was downloaded from the DOGE/HHS public data portal. Each row represents a provider-code-month combination with columns for NPI, HCPCS code, month, beneficiary count, claim count, and total paid amount.

Step 2: Segmentation

The dataset was processed in 50MB tranches (~1 million rows each), sorted by total payment descending. This allowed systematic coverage from highest-billing to lowest-billing providers, with the top tranches capturing the highest-exposure entities first.

Step 3: TOCA Battery

Each tranche was subjected to a four-constraint battery:

K-Constraint scan: Calculate claims-per-beneficiary ratios. Any ratio exceeding physical service limits (accounting for code-specific service durations) is flagged as impossible.

T-Constraint scan: Compare 2018–2019 baseline billing to 2023–2024 billing. Growth exceeding 500% with no corresponding market explanation is flagged.

R-Constraint scan: Identify single-code providers (billing only one HCPCS code with zero diversification) and taxonomy-code mismatches where available.

S-Constraint scan: Cross-reference flagged entities by geography, code family, growth timing, and entity creation date. Clusters of coordinated behavior are escalated.

Step 4: Validation

Every finding was cross-referenced against public DOJ records, state Medicaid fraud reports, and news sources to determine whether patterns had been previously identified.

Step 5: Documentation

Findings were catalogued in sequential submissions, each covering a batch of tranches. This page aggregates all submissions into a single reference.

The Six System-Level Patterns

These are not individual anomalies. These are structural features of the Medicaid billing system that suggest endemic, organized fraud operating at scale.

Pattern 1: T1015 Phantom Clinics ~$13.3 Billion · 9,228 providers

Thousands of providers billing exclusively code T1015 (Clinic Visit) with zero other service codes. No diagnostic imaging, no labs, no prescriptions, no procedures. A legitimate clinic generates diverse codes because patients present with real conditions. An entity billing millions in visits with nothing else is not operating as a clinic.

Status: No public investigation found. Full-dataset projection: $15–20B.
Pattern 2: DC Behavioral Health Cluster $380M+ · 13 providers

Thirteen DC-area behavioral health providers showing synchronized 446–2,266% growth in H0034/H0036/H2017/H2019 codes. One confirmed by DOJ indictment (Prestige Healthcare Resources, August 2024). Twelve apparently uninvestigated. The largest uninvestigated entity ($92.5M) exceeds the indicted entity by 28×.

Status: 1 of 13 confirmed. 12 apparently uninvestigated.
Pattern 3: Michigan H2015 Statewide Multi-million · 15+ providers

Fifteen or more Michigan providers billing exclusively H2015 (Comprehensive Community Support) with >1,000% growth. Several exhibit taxonomy mismatches — home health agencies billing community psychiatric support they cannot deliver.

Status: No public investigation found.
Pattern 4: Multi-State H2015 (MI/IL/NC) Under quantification

The Michigan H2015 pattern extends to Illinois and North Carolina with identical code concentration and growth, suggesting interstate coordination or exploitation of a common billing vulnerability.

Status: No public investigation found.
Pattern 5: New Entity Flooding $3.07 Billion · 16,677 entities

Sixteen thousand entities created in 2023–2024 collectively billing $3.07 billion in under 24 months. Disproportionately clustered in personal care, behavioral health, and transportation codes.

Status: No public investigation found.
Pattern 6: E&M Upcoding Epidemic 534 surges · 78% of all surge findings

Evaluation & Management codes (99213, 99214, 99283, 99284, 99285) account for 78.3% of all surge-growth findings in the 20–40% band. The dominant vector is 99214 (Level 4 office visit): 191 independent providers surging an average of 446%. Providers systematically upcode Level 3 visits to Level 4, and Level 4 ED visits to Level 5. Multiple NPIs surge across 3+ E&M codes simultaneously, indicating practice-wide billing escalation. The pattern spans 2018–2024 — it predates COVID and continues through the most recent data.

Status: Confirmed February 17, 2026. Exposure under quantification — E&M billing in the analyzed band alone exceeds $60 billion.
Pattern 7: National H-Code Behavioral Health Potentially $1B+ (emerging)

H-code explosive growth extends beyond DC nationally. Providers outside the DC cluster show identical code families and growth trajectories, suggesting a national vulnerability in behavioral health Medicaid billing.

Status: Emerging pattern. Under investigation.

Select Findings: The Physically Impossible

These are not statistical outliers. These are billing volumes that violate the laws of physics.

Daily Home Care Services, Brookshire TX NPI 1417409509

1,454 attendant care claims per beneficiary in a single month. At 15-minute units, this equals 363 hours per beneficiary — in a month that contains 720 total hours. This provider claims each patient receives more than half of every hour in existence as attendant care. Even 24-hour care caps at ~120 claims. Actual: 12× the theoretical maximum.

$190.6M Urine Drug Testing Mill NPI 1659769446

A single entity billing $190.6 million across 84 months, including 28 distinct confirmatory urine drug test codes per specimen. Clinical standard: 1 presumptive + 2–5 confirmatory. This entity: 1 presumptive + 28 confirmatory. The textbook pattern prosecuted in United States v. Millennium Health ($256M settlement).

Analysis Log

Each submission covers a batch of tranches. Analysis began February 15, 2026 and is ongoing.

Submission Tranches Findings Key Discovery
001 1–2 1–5 Initial patterns, Columbia Valley dental anomaly
002 3–4 6–12 Michigan H2015 statewide, multi-state H2015
003 3–4 (cont.) 13–17 T1015 $7B concentration, new entity pattern
004 5–6 18–20 DC behavioral health cluster, physically impossible ratios
005 7–20 21–23 10% milestone, expanded patterns validated
006 21–40 24–26 Drug testing mill ($190.6M), new entity explosion
007 20% strategic assessment & Treasury submission
008 Tranche 2 (42 chunks) 27–805 40% milestone. 42.1M rows, $79.5B. 95 R-constraint, 682 T-constraint, 2 K-constraint. Pattern 6 confirmed: E&M upcoding (78% of surges). Filed with OIG Feb 17, 2026.
Full Treasury Submission Document
26 findings · 6 system patterns · $17B+ flagged · Methodology & validation detail
Download PDF
Tranche 2 Report — 40% Milestone
779 findings · 7 system patterns · $79.5B analyzed · Filed with OIG Feb 17, 2026
Download PDF Download TXT

Updates

February 17, 2026
Tranche 2 complete — 40% milestone reached. Bytes 2.0–4.1 GB processed as 42 sequential chunks (42.1 million rows, $79.5 billion). 779 new findings across 209 NPIs: 95 single-code providers confirming T1015/T1017 Pattern 1 across all payment tiers; 682 surge-growth flags revealing E&M codes comprise 78.3% of all surges — now designated Pattern 6 (E&M Upcoding Epidemic); 2 K-constraint volume impossibilities. NPI 1730491945 shows 2,921% surge on 99214. NPI 1962628289 flagged as persistent single-code biller across 13 of 42 chunks. Cumulative: 805 findings, 289+ NPIs, 7 system patterns, ~$1.04 trillion analyzed. Filed with OIG February 17, 2026. Download Tranche 2 Report (PDF)
February 15, 2026
Initial analysis complete. 20.3% of dataset processed (Tranches 1–40, ~28 million rows, ~$960 billion). 26 findings documented across 7 submissions. Treasury submission document compiled. Methodology independently validated against August 2024 DOJ indictment of Prestige Healthcare Resources (NPI 1144516824).

How We Filed with the OIG — A Walkthrough

Transparency is a core principle of this project. Below is the exact text we submitted to the HHS Office of Inspector General Hotline on February 17, 2026. We publish this for three reasons: to document what was filed, to demonstrate that the filing actually occurred, and to give anyone who identifies additional anomalies a template for their own submission.

The OIG does not provide confirmation numbers or receipts for hotline complaints. This public record is our proof of filing.

Important: The OIG portal rejects special characters and emojis. All filed text uses plain ASCII only — double hyphens instead of dashes, spelled-out “dollars” and “percent,” and “and” instead of ampersands. If you use our template, maintain this convention.

Step 1: Category Selection

At tips.oig.hhs.gov, select “Healthcare fraud”“Improper billing”. This routes your complaint to the team that handles Medicaid billing anomalies.

Step 2: Describe the Fraudulent Action (10,000 characters max)

This is the main complaint field. OIG says: provide details about when it happened, where, how it was committed, and how you learned about it. Here is what we submitted:

View Filed Text — Complaint Description (6,806 characters)
SUBJECT: Systematic Medicaid Billing Fraud -- 805 Statistical Anomalies
Across 289+ Providers Identified via Data Analysis of DOGE/HHS Dataset

SUMMARY OF COMPLAINT:

I am reporting systematic Medicaid billing fraud identified through
constraint-based statistical analysis of the medicaid-provider-spending.csv
dataset (10.32 GB), released February 13, 2026, by DOGE/HHS via
opendata.hhs.gov. Analysis of 40 percent of the dataset (approximately
70 million rows covering 1.04 trillion dollars in Medicaid claims) has
identified 805 statistical anomalies across 289+ unique National Provider
Identifiers (NPIs) consistent with indicators of fraud as defined by OIG
(42 CFR 455.2), ACFE, AICPA (AU-C Section 240), GAO Yellow Book standards,
and CMS Program Integrity Manual (Chapter 4).

This is Submission II (Tranche 2), covering bytes 2.0-4.1 GB of the
dataset (42.1 million rows, 79.5 billion dollars). A prior submission
(Submission I, 26 findings, 960 billion dollars, 20 percent coverage) was
filed February 16, 2026.

METHODOLOGY:

Three constraint-based tests were applied systematically:

K-Constraint (Volume Impossibility): Identifies providers whose
claims-per-beneficiary ratios exceed what is physically possible,
for example billing more service hours per patient than exist in a day.

R-Constraint (Single-Code Concentration): Identifies providers billing
exclusively one HCPCS/CPT code above a dollar threshold over sustained
periods. Single-code billing at high volumes is recognized by the ACFE
and OIG as a primary indicator of phantom billing, upcoding, or service
fabrication.

T-Constraint (Temporal Surge): Identifies providers with year-over-year
billing increases exceeding 300 percent, which CMS and OIG recognize as
fraud indicators.

FINDINGS -- SUBMISSION II (40 Percent Milestone):

1. R-CONSTRAINT (Single-Code Providers): 95 findings across 95 NPIs.
Providers billing exclusively one procedure code above 100,000 dollars
per year. T1015 (Medicaid clinic visits) and T1017 (targeted case
management) continue to dominate, confirming Pattern 1 identified in
Submission I. This pattern now extends across all payment tiers, not
just high-volume providers.

2. T-CONSTRAINT (Surge Growth): 682 findings across 209 NPIs.
The dominant finding: Evaluation and Management (E and M) codes account
for 78.3 percent of all surge-growth findings. Designated System Pattern
6: E and M Upcoding Epidemic.
- 99214 (Level 4 office visit): 191 providers surging average 446 percent
- 99213 (Level 3 office visit): 88 providers
- 99285 (Level 5 ED visit): 62 providers
- 99283 and 99284 (Level 3-4 ED visits): 78 providers combined
Highest surge: NPI 1730491945, 2,921 percent increase on code 99214.

3. K-CONSTRAINT (Volume Impossibility): 2 findings.

CONFIRMED SYSTEM PATTERNS (Cumulative):
Pattern 1: T1015/T1017 Phantom Clinic Network -- 13.3 billion dollars+
Pattern 2: DC Behavioral Health Cluster -- 380 million dollars+ / 13 providers
Pattern 3: Drug Testing Mills -- 190.6 million dollars
Pattern 4: Physically Impossible Billing Ratios
Pattern 5: New Entity Flooding -- 3.07 billion dollars / 16,677 entities
Pattern 6: E and M Upcoding Epidemic -- 534 surges / 78 percent of findings (NEW)
Pattern 7: National H-Code Behavioral Health -- emerging

DOJ VALIDATION: Prestige Healthcare Resources, LLC (NPI 1144516824,
Washington, D.C.) validated against DOJ indictment after TOCA flagged it.

SUPPLEMENTAL VERIFICATION: Business license, property, NPPES identity,
and public review checks conducted on subset of flagged providers --
all consistent with detected anomalies.

PROJECTION: 30-50 billion dollars total fraudulent/abusive billing
estimated at full dataset coverage.

Filed in good faith under False Claims Act (31 U.S.C. 3729-3733).
Steven Easley, Founder/CEO, Echosphere.io

Step 3: Attach Supporting Documents (50MB per file)

The portal accepts csv, doc, docx, gif, jpg, jpeg, pdf, png, tif, tiff, txt, xls, and xlsx. We attached:

Step 4: Evidence Description (2,000 characters max)

A secondary text field for listing evidence. Here is what we submitted:

View Filed Text — Evidence Description (1,325 characters)
ATTACHED DOCUMENTS:

1. TOCA_Submission_II.pdf -- Full 10-page report: 779 findings from
40 percent dataset milestone (42.1M rows, 79.5 billion dollars).
Includes methodology, all three constraint analyses (95 single-code,
682 surge-growth, 2 volume-impossible), Pattern 6 confirmation
(E and M Upcoding Epidemic), DOJ validation detail, and
legal/methodological disclosure.

2. TOCA_Submission_II.txt -- Plain text version for archival.

ADDITIONAL EVIDENCE AVAILABLE UPON REQUEST:
- Submission I report (20 percent coverage, 26 findings, filed Feb 16, 2026)
- Full Treasury submission document (17 billion dollars+ flagged exposure)
- Raw analysis scripts and intermediate outputs
- Open-source verification documentation (business licenses, property
  records, NPPES cross-references, patient review screenshots)
- Complete methodology specification for the TOCA constraint architecture

All materials also available at:
https://echosphere.io/pages/proof-medicaid.html

This is Submission II of a planned five-submission series covering the
complete 10.32 GB DOGE/HHS Medicaid dataset. Submissions III-V (60,
80, 100 percent) will be filed as analysis progresses.

What Happens Next

OIG states that every report is reviewed but not every submission results in an investigation. They do not contact every complainant. However, hotline tips are used to build cases, corroborate existing investigations, and identify patterns. Multiple independent reports about the same providers or patterns strengthen the case for formal investigation — which is why your participation matters.

Want to file your own OIG complaint? If you’ve used our data to verify a flagged provider and found corroborating evidence, you can file at tips.oig.hhs.gov. Use the template above as a starting point. Independent reports from multiple sources are more powerful than a single submission. If your evidence suggests fraud exceeding $1 million, consult a qui tam attorney — False Claims Act whistleblowers may recover 15–30% of recovered funds.

Open Investigation — Join the Analysis

The TOCA constraint battery identifies where to look. Confirming what happened requires eyes on the ground. We are recruiting analysts, investigators, journalists, and concerned citizens to help verify and expand these findings using publicly available data.

What We’ve Already Verified Beyond the Data

For a subset of flagged providers, we conducted supplemental open-source verification. These checks are not part of the TOCA statistical methodology — they represent independent corroboration:

How You Can Help

Every flagged NPI is searchable through the CMS NPPES Registry. Here is what independent investigators can check:

Business Registration

Search your state’s Secretary of State registry. Is the entity active? When formed? Do officers appear on other flagged entities?

Physical Address

Use Google Maps, county assessor records, and satellite imagery. Is it a clinical facility? A residence? A vacant lot?

Licensing & Credentials

Check state medical/nursing boards. Is the provider licensed? Any disciplinary actions? License consistent with services billed?

OIG Exclusion List

Search the HHS OIG LEIE database. Excluded individuals sometimes continue billing through new entities.

Public Reviews & Complaints

Google Reviews, BBB, state AG complaints. Patient reports of fabricated visits or phantom billing are direct evidence.

Cross-Dataset Analysis

Join NPPES, CMS Open Payments, state corporate registries, and the DOGE dataset. The new-entity-to-rapid-billing pipeline is one of the strongest fraud predictors.

Report What You Find

  1. Document everything. Screenshots, URLs, dates accessed. Public records can change.
  2. Contact us. Share findings with Echosphere.io for incorporation into cumulative analysis.
  3. Report to authorities. File with the HHS OIG Fraud Hotline (1-800-HHS-TIPS) or your state’s Medicaid Fraud Control Unit.
  4. Protect yourself. Whistleblower protections exist under the False Claims Act. If evidence exceeds $1M, consult a qui tam attorney — you may be entitled to 15–30% of recovered funds.

For Journalists

805 anomalies across 289+ providers. Seven system patterns. One DOJ-validated entity. The dataset is public. The methodology is documented. Every NPI is searchable. Every claim is verifiable. We welcome scrutiny.

For Data Scientists

Our K/R/T constraint battery is deliberately simple — three tests at scale. Network analysis, Benford’s Law, geographic clustering, and cross-program joins would extend these findings. The DOGE dataset is 10.32 GB of structured CSV — manageable with DuckDB on any modern laptop. If you want to contribute, reach out.

The end of knowledge is not power.
It is completeness.

Echosphere.io — Structure for the age of artificial intelligence