Every data source the JIL platform ingests, by LOB and refresh cadence.
JIL is a verification network. The integrity of every CREB we seal traces back to the data we ingested, when we ingested it, where the source is, and which line of business consumed it. This page lists all of them. Federal public datasets (no contract required), commercial subscriptions (when the customer engagement requires depth a public source cannot reach), and customer-supplied records (under BAA / GLBA basis). Replay-grade transparency, not vague positioning.
Public, free, no contract required.
Every row below pulls live from a federal data publisher. No subscription, no DUA, no per-record licensing. JIL ingests, hashes the source file for replay, indexes into postgres, and runs the LOB-specific check pack. These are the sources behind the eight live POC pages and the CMS attestation backbone.
| Source | Provider | Refresh | Format | LOB(s) | Status | Live POC |
|---|---|---|---|---|---|---|
| Fails-to-Deliver Register | SEC FOIA | Monthly (a/b half-files) | pipe-delimited | capmarkets | LIVE | capmarkets-poc (339K rows) |
| USAspending.gov API | Treasury / OMB | Real-time / daily | JSON REST | grants federal-investigator | LIVE | grants-poc (1K awards / $2.96T) |
| DOL OFLC LCA Disclosure | DOL ETA | Quarterly | XLSX | h1b | LIVE | h1b-poc (337K real LCAs) |
| UN Comtrade API | UN Statistics Division | Annual / Quarterly | JSON REST | trade-finance | WIRED | trade-finance-poc (rate-limited; synthetic backstop) |
| USCIS Regional Centers | USCIS | Ad-hoc | HTML / PDF | eb5 federal-investigator | WIRED | eb5-poc (anon-blocked; synthetic backstop) |
| USCIS Data Hub processing-times | USCIS | Quarterly | JSON REST | eb5 | WIRED | eb5-poc |
| NHTSA FARS (Fatality Analysis Reporting System) | NHTSA | Annual | CSV (zipped) | pc | PENDING | pc-poc (build queued) |
| BLS Occupational Injuries (SOII) | Bureau of Labor Statistics | Annual | CSV / API | wc | PENDING | wc-poc (build queued) |
| CMS Medicare Inpatient by Provider+Service | CMS | Annual | CSV | MCO federal-investigator | LIVE | ava-poc (145K rows / $90.94B) |
| CMS Outpatient by Provider+APC | CMS | Annual | CSV | MCO | LIVE | ava-poc (117K rows) |
| CMS DMEPOS by Referring Provider | CMS | Annual | CSV | MCO | LIVE | ava-poc (498K rows) |
| CMS Part D Prescriber by Drug | CMS | Annual | CSV | MCO | LIVE | ava-poc (476K rows) |
| CMS Provider of Services (POS) file | CMS | Quarterly | CSV | MCO federal-investigator | LIVE | ava-poc (44K rows) |
| CMS Hospice utilization | CMS | Annual | CSV | MCO | LIVE | ava-poc (5,772 rows) |
| NPPES (NPI Registry) | CMS | Weekly | CSV bulk | MCO all KYC | LIVE | ava-poc |
| CERT FY2024 detector library | CMS | Annual | internal seed | MCO federal-investigator | LIVE | ava-poc |
| CMS Owners file (regional centers, ownership) | CMS | Quarterly | CSV | MCO | WIRED | UBO graph |
| PECOS (Provider Enrollment Chain & Ownership) | CMS | Quarterly | internal feed | MCO federal-investigator | WIRED | UBO graph |
| MAC jurisdiction map | CMS | Quarterly | internal seed | MCO | LIVE | ava-poc |
| Etherscan public API (token transfers) | Etherscan | Block-level (~12s) | JSON REST | p2p wallet-intel | LIVE | p2p-poc (1K USDC transfers) |
| SEC EDGAR (filings) | SEC | Real-time | JSON / XBRL | capmarkets asset-intel | WIRED | — |
| TreasuryDirect / FFIEC bank financials | Treasury / FFIEC | Quarterly | CSV | capmarkets | WIRED | — |
Cross-vertical compliance feeds.
These feed every LOB. Identity, sanctions, exclusions, beneficial-ownership lookups. Most are free; OpenCorporates carries a free tier for low volume and a paid tier for entity-resolution at scale.
| Source | Provider | Refresh | Type | LOB(s) | Status |
|---|---|---|---|---|---|
| OFAC SDN List | Treasury OFAC | Daily | Public free | all KYC p2p trade-finance | LIVE |
| UN Consolidated Sanctions | UN Security Council | Ad-hoc | Public free | all KYC | LIVE |
| HMT (UK) Consolidated List | HM Treasury UK | Daily | Public free | all KYC | LIVE |
| EU Consolidated Financial Sanctions | EU Commission | Daily | Public free | all KYC | WIRED |
| OpenSanctions / Yente | OpenSanctions | Daily | Public free | all KYC | LIVE |
| OIG LEIE (excluded individuals) | HHS OIG | Monthly | Public free | MCO federal-investigator | WIRED |
| SAM.gov exclusions | GSA | Daily | Public free | grants federal-investigator all vendor | WIRED |
| GLEIF LEI Registry | GLEIF | Daily | Public free | all institutional | LIVE |
| FinCEN Boi Reporting (when published) | FinCEN | Real-time | Public free | all KYB | PENDING |
| FINRA disciplinary database | FINRA | Real-time | Public free | capmarkets | WIRED |
| CFTC enforcement database | CFTC | Real-time | Public free | capmarkets trade-finance | WIRED |
| DOJ enforcement / qui tam relator records | DOJ | Real-time | Public free | federal-investigator MCO | WIRED |
| OpenCorporates (entity registry) | OpenCorporates | Real-time API | Free + paid tier | eb5 all KYB | WIRED |
| RDAP domain age + WHOIS | ICANN / registrars | Real-time | Public free | all BEC | LIVE |
Paid feeds for engagement-grade depth.
Tier 2 of the JIL economic model brings these in on a per-engagement basis. We do not carry the subscription cost as a fixed overhead; the customer engagement either funds the data path or chooses a public-data-only Tier 1 baseline. Every paid feed below has a public-data fallback or is optional for the verticals that consume it.
| Source | Provider | Refresh | Cost band | LOB(s) | Status |
|---|---|---|---|---|---|
| Bloomberg Terminal data | Bloomberg | Real-time | $$$ | capmarkets asset-intel | PENDING (engagement-funded) |
| Refinitiv (LSEG) market reference | LSEG | Real-time | $$$ | capmarkets | PENDING (engagement-funded) |
| Chainalysis KYT / Reactor | Chainalysis | Real-time | $$$ | wallet-intel p2p | PENDING (customer rides their own) |
| TRM Labs | TRM Labs | Real-time | $$$ | wallet-intel p2p | PENDING (customer rides their own) |
| ATTOM Property + Address Intelligence | ATTOM Data | Daily | $$ | MCO pc | WIRED |
| Etherscan Pro (higher rate limit) | Etherscan | Real-time | $ | p2p wallet-intel | WIRED (using free tier today) |
| Helius RPC + DAS API | Helius | Real-time | $ | wallet-intel p2p | WIRED |
| Plaid (banking data) | Plaid | Real-time | $$ | Money Passport | PENDING |
| IRS 4506-C IVES | IRS | On-demand | $ per request | Money Passport | PENDING (IVES participant approval) |
Under BAA, GLBA, or comparable basis.
Customer-supplied records never leave the customer's perimeter. Verdict-engine ingestion runs inside the customer's tenant or against a read-only adapter on the customer's side. JIL receives only the signed verdict record and case-file artifacts, not the underlying data.
Settlement records
capmarkets Trade records, SWIFT 5xx messages, FIX, ISO 20022 sese. Custodian / broker / fund-admin sources. Real-time stream when paid engagement is active.
Position files
capmarkets Daily position records from each system that should agree (custodian, broker, fund-admin, CSD). Cross-system reconciliation runs against this set.
Bank wire records
all Pre-Settlement Outbound wire instructions intercepted before release. Sub-2-second YES / NO / REVIEW gate.
MCO claim records
MCO Provider claim files, encounter records, prior-authorization decisions. PHI; under BAA. Tier 2 claim integrity work.
H-1B beneficiary documents
h1b Sponsor-supplied labor condition files, payroll attestations. Optional Tier 2 deepening.
Workers' comp claims
wc Carrier-supplied claim event records, medical bills, employer records. Tier 2 only.
P&C claim files
pc Carrier-supplied claim event records, repair estimates, photos, telematics. Tier 2 only.
Trade finance documents
trade-finance Letters of credit, bills of lading, customs declarations. Bank-supplied under BAA-equivalent for cross-border ops.
How fresh the verdict is, by source class.
Real-time / block-level
Etherscan, EDGAR, OFAC SDN delta, USAspending API, OpenSanctions, OpenCorporates API, GLEIF, RDAP. Latency from publication to JIL findings: seconds to minutes.
Daily
OFAC SDN full refresh, HMT UK, EU, NPPES delta, SAM.gov exclusions, ATTOM. Standard cron pulls.
Weekly
NPPES bulk, OIG LEIE delta, sanctions consolidation. Tuesday-night cron.
Monthly
SEC fails-to-deliver register (a/b half-files), OIG LEIE full, MAC jurisdiction. Calendar-month rollover.
Quarterly
DOL OFLC LCA disclosure, USCIS Data Hub processing-times, CMS POS file, PECOS, FFIEC bank financials. Calendar-quarter rollover.
Annual
NHTSA FARS, BLS SOII, CMS Inpatient / Outpatient / DMEPOS / Part D / Hospice / SNF, CERT detector library. Calendar-year rollover; lag of 6-18 months from end of year.
Which sources each LOB consumes.
capmarkets LIVE
SEC FTD, EDGAR, FFIEC, FINRA, CFTC, GLEIF + customer settlement records. Optional Tier 2: Bloomberg / Refinitiv.
grants LIVE
USAspending.gov, SAM.gov exclusions, OFAC SDN, GLEIF, OpenCorporates + customer-supplied awardee records.
h1b LIVE
DOL OFLC LCA, USCIS, OFAC, GLEIF, OpenCorporates + sponsor-supplied wage records.
eb5 LIVE
USCIS Data Hub, USCIS regional centers, SEC EDGAR, OFAC, OpenCorporates + investor source-of-funds documentation.
p2p LIVE
Etherscan, OFAC SDN crypto-address attribution, OpenSanctions + customer transaction records. Optional: Chainalysis / TRM.
trade-finance LIVE
UN Comtrade, OFAC, GLEIF + bank-supplied trade documents.
pc PENDING
NHTSA FARS + carrier-supplied claim records. Optional ATTOM for premise.
wc PENDING
BLS SOII, NPPES (medical providers) + carrier-supplied claim records.
MCO · Medicare / Medicaid LIVE
Full CMS stack (Inpatient, Outpatient, DMEPOS, Part D, POS, NPPES, OIG LEIE, MAC, CERT) + customer claim records under BAA.
Every CREB carries the source manifest.
Each CREB-anchored finding embeds a reproducibility manifest that lists the exact source-file hash, ingest timestamp, code version, and signal threshold used. A regulator, auditor, or counterparty can replay the analysis bit-identically using the same federal source file plus the manifest. The data-source pages above are indexed by the same manifest fields.