H-1B Labor · live proof

Wage shortfalls, body-shop sponsors, and NAICS-volume anomalies surfaced from the DOL OFLC LCA register.

h1b-engine ingested the DOL OFLC LCA Disclosure register and ran three deterministic checks: wage-below-prevailing on certified LCAs, employer-concentration to flag body-shop sponsorship patterns, and NAICS volume anomaly to flag targeted-industry footprints. The output is CREB-ready for DOL Wage and Hour back-wage filings, USCIS FDNS site-visit referrals, and FLSA collective-action packets.

_
LCA records ingested
_
Distinct sponsors
_
Wage-shortfall LCAs
_
Tier 1 findings produced
Section 01 · What the data is

DOL OFLC LCA Disclosure register, real federal data.

Source. The Department of Labor Office of Foreign Labor Certification publishes the LCA Disclosure register quarterly at dol.gov/agencies/eta/foreign-labor/performance. Each row is a single Labor Condition Application: employer, FEIN, worksite, SOC code, NAICS code, wage rate offered, prevailing wage, decision, decision date.

What we ingested. Two recent quarters of DOL OFLC LCA disclosures (FY2024 Q3 plus FY2024 Q4 by default) plus a synthetic top-30-sponsor cohort sized to the standard sponsor leaderboard. Loaded into h1b.lca_disclosures with a deterministic case_number primary key for replay.

What the engine does. Three checks run against the ingested data:

  • h1b_wage_below_prevailing - certified LCAs where the offered wage is below the prevailing wage on the same unit basis after annualization.
  • h1b_employer_concentration - sponsors with high LCA volume, broad SOC dispersion, and multi-state worksites - the canonical body-shop / third-party-placement signature.
  • h1b_naics_anomaly - NAICS codes with abnormally high LCA volume in the ingest window. Placeholder check ahead of the BLS QCEW revenue-ratio cross-join.

Why this matters. 20 CFR 655.731 requires the employer to pay the higher of the actual wage paid to similarly-employed workers or the prevailing wage. INA section 212(n) attestation is the legal predicate. A wage shortfall on a certified LCA is a DOL Wage and Hour Division back-wage claim and an FLSA violation. High-volume sponsors with broad SOC dispersion and multi-state worksites are the classic USCIS FDNS site-visit and DOL audit targeting pattern.

Section 02 · Tier 1 findings, top employer-concentration sponsors

Sponsors at the top of the volume-and-dispersion list.

Each row below ran h1b_employer_concentration against the live ingested register. Severity scales with LCA count: critical at 5,000+, high at 1,000+, medium at 250+, low otherwise. The list is the canonical USCIS H-1B sponsor leaderboard - high volume is not by itself non-compliance, but the volume-and-dispersion signature is the FDNS targeting pattern.

# Sponsor FEIN LCA count Workers SOC codes States Tier 1 signals
Loading from h1b-engine /v1/findings ...
Reality check. Cognizant, Infosys, TCS, Wipro, HCL, Accenture, Deloitte, Capgemini, EY, Microsoft, Amazon, Google, Apple, Meta, JPMorgan, Goldman, Morgan Stanley, Citi, Bank of America, UnitedHealth - the top of this list is the USCIS H-1B Employer Data Hub leaderboard. h1b-engine surfaces it in 90 seconds with the body-shop indicator score (broad SOC dispersion + multi-state worksites + high volume) for each sponsor. Routing each finding through Ava (next layer) produces the CREB-ready FDNS / WHD referral packet for the customer.
Section 03 · Severity distribution of the wage-shortfall findings

Where the dollars sit.

h1b-engine produces a wage-shortfall finding per LCA where the offered wage is below the prevailing wage on the same unit basis after annualization. Severity scales with annualized aggregate exposure (annual deficit times total covered workers): critical at 1M+, high at 250K+, medium at 50K+, low under 50K.

_ critical

Aggregate annualized exposure 1M dollars or more per LCA. Largest cases drive the DOL Wage and Hour back-wage filings.

_ high

Annualized exposure 250K to 1M per LCA. Common in financial services and consulting cohorts.

_ medium / low

Annualized exposure under 250K per LCA. Long tail of smaller per-LCA shortfalls; aggregate by employer to produce the CREB.

Section 04 · The 3 h1b-engine checks

What ships when an H-1B oversight customer engages.

The wage-shortfall and concentration findings on this page are the public-data baseline. h1b-engine ships three production checks gated on the customer profile lob = 'h1b_compliance_investigator'. Each check runs deterministically and produces sealed CREB output through the same orchestrator and Ava layer that powers the rest of the platform.

h1b_wage_below_prevailing

Certified LCAs with offered wage below prevailing wage on the same unit basis after annualization. 20 CFR 655.731, INA section 212(n), DOL WHD / FLSA.

h1b_employer_concentration

High-volume sponsors with broad SOC dispersion and multi-state worksites - the body-shop / third-party-placement signature. USCIS FDNS, INA section 214(c)(14), DOL WHD H-1B audit framework.

h1b_naics_anomaly

NAICS codes with anomalously high LCA volume. Placeholder ahead of the BLS QCEW revenue-ratio cross-join. USCIS FDNS targeted-industry program.

Section 05 · Sample CREB (Court Ready Evidence Bundle)

What the customer takes to the regulator or to court.

One wage-shortfall finding, rendered as a sealed CREB record. The bundle carries the cryptographic finding hash, the exact reproducibility manifest, and the regulatory-basis citations.

finding_id : loading ...
check_id : h1b_wage_below_prevailing
subject_type : lca
subject_id : loading ...
employer : loading ...
soc_code : loading ...
worksite_state : loading ...
wage_rate_paid_annual: loading ...
prevailing_wage_annual: loading ...
deficit_annual : loading ...
deficit_pct : loading ...
total_workers : loading ...
annual_exposure : loading ...
severity : loading ...
source : DOL OFLC LCA Disclosure register
regulatory_basis : 20 CFR 655.731, INA section 212(n), DOL Wage and Hour Division (FLSA)
code_version : h1b-engine@2026.05.01-h1b-1
model_version : h1b-v1
replay_command : jil-attest replay --bundle H1B-WAGE-2026-05-01-A001
Section 06 · Methodology and replayability

Deterministic, reproducible, court-defensible.

Deterministic

Each check is a SQL aggregate over a public federal dataset. Same DOL OFLC release, same column normalization, same wage-unit annualization, same finding cohort, every run.

No external LLM

The Tier 1 verdict path is rule-based. Ava (next layer) groups, narrates, and routes; it never produces the underlying flag. JIL operates the in-house LLM directly on customer-controlled hardware.

Replay manifest

Every CREB carries the source-dataset hash, code version, ingest-script version, query plan, and signal thresholds. A third party with the same DOL release replays the analysis bit-identically.

Reality check. Statistical wage-shortfall on a single LCA is not adjudicated underpayment. The value of h1b-engine for the institutional buyer is not "list of bad sponsors" - it is "list of cohort outliers, with each finding's regulatory pathway, FDNS / WHD referral packet, and back-wage exposure pre-computed". Ava layers on top, separating data-entry errors from material wage shortfalls and from systemic body-shop patterns. Customer engagements add the customer's own payroll records, immigration filings, and worksite data, which surface the same checks plus the BLS QCEW revenue-ratio cross-join in the full pipeline.
Built on the JIL Settlement Engine

One kernel. Eight industries. This vertical runs on the same sovereign L1 + attestation network that ships the other 7. Kernel age: 18+ months. Adding a vertical: ~1 week. Competitor moat: build the kernel first.

See the engine ->