Two digits on a W-2 survive every downstream check most lenders run today
Mortgage processors, auto-loan underwriters, and tenant-screening agents trust the W-2 the applicant brings as primary income proof. The IRS rarely sees the PDF; the lender does. When the applicant edits Box 1 (Wages, tips, other compensation) before uploading, the wrong figure is the figure the file is built on.
htpbe? analyzes the structural layer of the PDF file — the layer that records every edit, even invisible ones. We don’t inspect holograms, phone photos, or ID biometrics. If your fraud problem is a digitally altered or fabricated W-2, we’re the most specific tool for it.
When htpbe? returns INCONCLUSIVE on a W-2, that’s itself a fraud signal in this context — real W-2 exports always come from a US payroll system, never from a desktop tool.
One REST call, one deterministic verdict
Upload the PDF. The API returns INTACT, MODIFIED, or INCONCLUSIVE with named markers — in about three seconds.
How fake and tampered W-2 PDFs actually look
Three real fraud mechanics we catch at the structural PDF layer.
Real W-2 edited after employer issuance
Authentic W-2 comes from a US payroll system (ADP, Paychex, Gusto, QuickBooks Payroll, Workday, Rippling, BambooHR Payroll, Square Payroll, Justworks). The applicant downloads it, opens it in any PDF editor or spreadsheet, edits Box 1 (Wages) or Box 5 (Medicare wages), exports as PDF. The producer field changes from the payroll engine to whichever editor was used.
W-2 fabricated in Word from a template
A W-2-shaped PDF authored in Word using the IRS form layout copied from screenshots, populated with a desired employer and Box 1, exported. The producer is Microsoft Word; the structured payroll-system metadata authentic W-2s carry is missing entirely.
Box arithmetic broken after edit
When Box 1 (Wages) gets edited up, the dependent boxes — Federal income tax withheld, Social Security wages, Medicare wages, state wages — usually do not get touched. The arithmetic relationship breaks. Combined with structural edit markers, the verdict is unambiguous.
The scale
Why your existing checks miss this
Day 1 Certainty verifies the data with the source. It does not cover every borrower or every doc.
Both layers matter. The PDF the borrower uploaded is what your underwriter opens.
Fannie Mae Day 1 Certainty and Freddie Mac AIM verify income data with payroll providers when the borrower’s employer participates and the borrower consents. Coverage is partial — many employers and borrowers fall outside. VOE services call the employer directly, but that step is downstream and slow. Equifax and credit bureaus verify identity and credit, not document integrity. htpbe? catches the W-2 PDF the borrower uploaded at the moment of intake — standalone, no IRS API, no payroll-provider integration required.
Five forensic layers, one deterministic verdict
Every PDF we receive passes through the same structural pipeline — no model training, no thresholds to tune.
Metadata analysis
Creation and modification timestamps, producer and creator fields, XMP metadata — the first layer exposes basic tampering.
File structure
Xref tables, trailer chain, incremental updates. Any edit after export leaves a structural fingerprint here.
Digital signatures
Signature chain integrity and post-signature modifications produce deterministic markers. Certainty-level signal.
Content integrity
Fonts, objects, embedded content, page assembly. Multi-session edits and inserted objects are visible at this layer.
Verdict with markers
Deterministic output: INTACT / MODIFIED / INCONCLUSIVE, with named markers for every finding — suitable for audit trail.
W-2 and adjacent US income-proof PDFs we check
Every type listed below is analyzed at the structural file layer — not the rendered image.
Detection capabilities
Deterministic structural signals. No probabilistic scores, no model training.
Producer signature mismatch
Authentic W-2s carry the producer signature of US payroll software. When the producer is Microsoft Excel, Microsoft Word, LibreOffice, Chrome Headless, or a generic PDF library, the document was edited or fabricated on a desktop — it did not come fresh from the payroll engine.
Incremental update trail
A clean payroll export has one cross-reference table. Re-saves through any editor append a second xref — visible structural evidence of post-issuance editing.
Box arithmetic verification
The relationship between Box 1, Box 2 (Federal income tax withheld), Box 3 (Social Security wages), Box 5 (Medicare wages), and state wages is verified row by row. Edited boxes break the chain unless every dependent field is also adjusted.
Modification timestamp gap
A real W-2 issued by January 31 has CreationDate ≈ ModDate. A months-later modification on a "freshly issued" W-2 is a high-confidence flag for post-export editing.
Font subset divergence across pages
Multi-session edits leave font subset prefix shifts. Single-session legitimate exports have consistent subsets across all pages.
Image-stream artefacts in fabricated W-2s
Fabricated W-2s often paste the IRS form layout from screenshots. Pasted image streams carry different compression characteristics than authentic embedded forms — a structural fingerprint of fabrication.
Two HTTP calls to verify any W-2
Buyers can skip this section — developers, the integration is two HTTP calls.
Step 1 — submit the PDF
curl -X POST https://api.htpbe.tech/v1/analyze \
-H "Authorization: Bearer $HTPBE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"url": "https://your-storage/borrower-w2-2024.pdf"}'Step 2 — read the verdict
{
"id": "w1u2s3a4-5w6t-7w8o-9p0d-f1g2h3i4j5k6",
"status": "modified",
"modification_confidence": "high",
"modification_markers": [
"Spreadsheet producer detected (Microsoft Excel)",
"Two cross-reference tables — incremental update",
"Modification date 7 months after creation date"
],
"producer": "Microsoft Excel",
"creator": "ADP Workforce Now (original)",
"creation_date": 1706659200,
"modification_date": 1726272000,
"has_digital_signature": false,
"xref_count": 2,
"has_incremental_updates": true
}Original came from ADP Workforce Now in late January (the IRS deadline). Then seven months later it was opened in Microsoft Excel and re-saved — adding a second xref table. Verdict: modified at high confidence. The borrower edited a real W-2 after employer issuance.
Customer Stories
Teams that stopped document fraud
Compliance, finance, and risk teams use htpbe? to catch manipulated PDFs before they become costly mistakes.
Caught an invoice where the total had been changed by less than a thousand dollars. Without this I would have approved it without a second look.
Sarah M.
AP Manager
United States
We had three applicants in the same week with bank statements that looked completely fine. Two of them were flagged as modified. You simply cannot see this by reading the document — it is in the file structure.
Lars V.
Risk Analyst, Online Lending
Netherlands
Salary slips were coming with altered figures. We identified two problematic files before the placement was finalised.
Priya K.
HR Operations Lead
India
Since we started checking documents this way, we stopped two applications early in the process that would have been very difficult to reverse later.
Julien R.
Fraud Analyst, Fintech
France
Some applicants were sending PDFs that looked authentic but had been edited in ways not visible to the eye. We now ask for verified originals when something is flagged. Already saved us from a few bad decisions.
Marta S.
Compliance Coordinator
Spain
One invoice was caught because there was a mismatch between the document dates and structure. That particular case would have cost us significantly.
Tariq A.
Finance Manager
United Arab Emirates
Frequently asked questions
modified or inconclusive with producer-mismatch and missing-metadata flags.Related solutions and guides
Mortgage Underwriting
W-2 + tax return + bank statement forensics for US mortgage origination and broker workflows.
Fake 1099 Detection
Sister page — same forensics for the 1099 PDF (NEC, MISC, INT, DIV, K, R).
Fake Pay Stub Detection
Same forensics for monthly pay stubs submitted alongside W-2 as supporting income proof.
Fake Asset Letter Detection
Asset and gift letter forensics — same cluster of US mortgage income docs.
Secure your workflow
Create your account — API key on signup, free test environment on every plan.
From $15/mo. No sales call. Cancel any time.