Wage shortfalls, body-shop sponsors, and NAICS-volume anomalies surfaced from the DOL OFLC LCA register.
h1b-engine ingested the DOL OFLC LCA Disclosure register and ran three deterministic checks: wage-below-prevailing on certified LCAs, employer-concentration to flag body-shop sponsorship patterns, and NAICS volume anomaly to flag targeted-industry footprints. The output is CREB-ready for DOL Wage and Hour back-wage filings, USCIS FDNS site-visit referrals, and FLSA collective-action packets.
DOL OFLC LCA Disclosure register, real federal data.
Source. The Department of Labor Office of Foreign Labor Certification publishes the LCA Disclosure register quarterly at dol.gov/agencies/eta/foreign-labor/performance. Each row is a single Labor Condition Application: employer, FEIN, worksite, SOC code, NAICS code, wage rate offered, prevailing wage, decision, decision date.
What we ingested. Two recent quarters of DOL OFLC LCA disclosures (FY2024 Q3 plus FY2024 Q4 by default) plus a synthetic top-30-sponsor cohort sized to the standard sponsor leaderboard. Loaded into h1b.lca_disclosures with a deterministic case_number primary key for replay.
What the engine does. Three checks run against the ingested data:
- h1b_wage_below_prevailing - certified LCAs where the offered wage is below the prevailing wage on the same unit basis after annualization.
- h1b_employer_concentration - sponsors with high LCA volume, broad SOC dispersion, and multi-state worksites - the canonical body-shop / third-party-placement signature.
- h1b_naics_anomaly - NAICS codes with abnormally high LCA volume in the ingest window. Placeholder check ahead of the BLS QCEW revenue-ratio cross-join.
Why this matters. 20 CFR 655.731 requires the employer to pay the higher of the actual wage paid to similarly-employed workers or the prevailing wage. INA section 212(n) attestation is the legal predicate. A wage shortfall on a certified LCA is a DOL Wage and Hour Division back-wage claim and an FLSA violation. High-volume sponsors with broad SOC dispersion and multi-state worksites are the classic USCIS FDNS site-visit and DOL audit targeting pattern.
Sponsors at the top of the volume-and-dispersion list.
Each row below ran h1b_employer_concentration against the live ingested register. Severity scales with LCA count: critical at 5,000+, high at 1,000+, medium at 250+, low otherwise. The list is the canonical USCIS H-1B sponsor leaderboard - high volume is not by itself non-compliance, but the volume-and-dispersion signature is the FDNS targeting pattern.
| # | Sponsor | FEIN | LCA count | Workers | SOC codes | States | Tier 1 signals |
|---|---|---|---|---|---|---|---|
| Loading from h1b-engine /v1/findings ... | |||||||
Where the dollars sit.
h1b-engine produces a wage-shortfall finding per LCA where the offered wage is below the prevailing wage on the same unit basis after annualization. Severity scales with annualized aggregate exposure (annual deficit times total covered workers): critical at 1M+, high at 250K+, medium at 50K+, low under 50K.
_ critical
Aggregate annualized exposure 1M dollars or more per LCA. Largest cases drive the DOL Wage and Hour back-wage filings.
_ high
Annualized exposure 250K to 1M per LCA. Common in financial services and consulting cohorts.
_ medium / low
Annualized exposure under 250K per LCA. Long tail of smaller per-LCA shortfalls; aggregate by employer to produce the CREB.
What ships when an H-1B oversight customer engages.
The wage-shortfall and concentration findings on this page are the public-data baseline. h1b-engine ships three production checks gated on the customer profile lob = 'h1b_compliance_investigator'. Each check runs deterministically and produces sealed CREB output through the same orchestrator and Ava layer that powers the rest of the platform.
h1b_wage_below_prevailing
Certified LCAs with offered wage below prevailing wage on the same unit basis after annualization. 20 CFR 655.731, INA section 212(n), DOL WHD / FLSA.
h1b_employer_concentration
High-volume sponsors with broad SOC dispersion and multi-state worksites - the body-shop / third-party-placement signature. USCIS FDNS, INA section 214(c)(14), DOL WHD H-1B audit framework.
h1b_naics_anomaly
NAICS codes with anomalously high LCA volume. Placeholder ahead of the BLS QCEW revenue-ratio cross-join. USCIS FDNS targeted-industry program.
What the customer takes to the regulator or to court.
One wage-shortfall finding, rendered as a sealed CREB record. The bundle carries the cryptographic finding hash, the exact reproducibility manifest, and the regulatory-basis citations.
Deterministic, reproducible, court-defensible.
Deterministic
Each check is a SQL aggregate over a public federal dataset. Same DOL OFLC release, same column normalization, same wage-unit annualization, same finding cohort, every run.
No external LLM
The Tier 1 verdict path is rule-based. Ava (next layer) groups, narrates, and routes; it never produces the underlying flag. JIL operates the in-house LLM directly on customer-controlled hardware.
Replay manifest
Every CREB carries the source-dataset hash, code version, ingest-script version, query plan, and signal thresholds. A third party with the same DOL release replays the analysis bit-identically.
One kernel. Eight industries. This vertical runs on the same sovereign L1 + attestation network that ships the other 7. Kernel age: 18+ months. Adding a vertical: ~1 week. Competitor moat: build the kernel first.