logologo
  • How it works
  • Why It Matters
  • Statistics
  • Pricing
  • API
logologo
  • How it works
  • Why It Matters
  • Statistics
  • Pricing
  • API
HTPBE?

Structural PDF tamper detection API. Catches edits your KYC stack misses.

Product

  • How It Works
  • Why It Matters
  • Use Cases
  • Pricing

Developers

  • API Reference
  • GitHub/docs
  • Changelogv2.23.1

Resources

  • FAQ
  • Blog
  • Comparisons
  • Legal & Imprint

© 2024–2026 TMI Iurii Rogulia · VAT ID: FI29845875 · Made in Finland 🇫🇮

Status

Algorithm v2.23.1

Tool profile

wkhtmltopdf

wkhtmltopdf appears on both legitimate first-generation output and downstream re-save flows — context (the other tool on the same document) is what flips the signal.

Back to all statistics
Forensic verdict

Mixed signal

Based on 368 appearances across the HTPBE? corpus.

Modification rate
6%-42pp below baseline
Corpus baseline: 48%
Total appearances
368
1.8% of corpus
Modification rate
6%
-42pp below baseline
Role split
100%C/0%P
Creator vs Producer share of appearances

Corpus profile

How wkhtmltopdf shows up in HTPBE? corpus

wkhtmltopdf is an open-source HTML-to-PDF converter built on the WebKit rendering engine. It is used by many internal tools to generate reports from HTML templates.

wkhtmltopdf is genuinely the original generator for many small-business invoices and reports. The signal arises only when wkhtmltopdf is the latest Producer on a document whose Creator was an institutional source (banks, payroll providers, government forms) — that combination does not occur in legitimate first-generation output.

The signal
wkhtmltopdf is genuinely the original generator for many small-business invoices and reports.

Role in the workflow

How wkhtmltopdf shows up in metadata

Every PDF carries a Creator (the application that produced the original document) and a Producer (the engine that wrote the PDF). The same tool can appear in either slot, with very different modification profiles.

CAs Creator · 100%
As Producer · 0%P
CAs Creator
  • Usage
    368
  • Modification rate
    6%
  • Avg file size
    110 KB
PAs Producer
  • Usage
    0
  • Modification rate
    0%

How to read this

The Creator slot typically reflects where a document started life. The Producer slot reflects whatever wrote the bytes — and is the field that gets overwritten when a PDF is opened, edited, and saved by a downstream tool.

A higher modification rate as Producer than as Creator usually means the tool is acting as a re-saver on documents that originated elsewhere. A higher rate as Creator points to fragile workflows around the original authoring app.

Name fingerprints

Also goes by

Different version strings and spellings observed for wkhtmltopdf in the wild. All are merged into the same canonical profile.

wkhtmltopdf 0.12.673.1%
wkhtmltopdf 0.12.418.8%
wkhtmltopdf 0.12.6.16.5%
wkhtmltopdf 0.12.50.5%
wkhtmltopdf 0.12.2.10.5%
wkhtmltopdf 0.12.1.20.5%

Why variants matter

The same tool publishes itself under 6 different metadata strings — version bumps, locale tags, build IDs. We canonicalize them so the corpus reflects one identity, not noise.

Most common
wkhtmltopdf 0.12.6
73.1% of appearances
Variant spread
6 distinct strings
Long-tail share: 26.9%
Observed range
8 Mar 2023 → 20 May 2026

Distributions

What ships alongside wkhtmltopdf

The PDF versions wkhtmltopdf writes when acting as Producer, and the other tools that appear in the same documents.

Common Producers when wkhtmltopdf is the Creator

Qt writes 98% of these files — that pairing is the Adobe-stack default for many institutional pipelines.

Qt97.6%
GPL Ghostscript0.5%
PyPDF0.5%

Related profiles

Tools you’ll see next to wkhtmltopdf

Other tools that frequently share metadata with wkhtmltopdf in the same documents. Each card links to its own forensic profile.

P98% co-occurrence
Qt
Appearances365
Mod rate6%
P1% co-occurrence
GPL Ghostscript
Appearances35
Mod rate43%
P1% co-occurrence
PyPDF
Appearances84
Mod rate4%

Long tail

Notable observations

Smaller cuts of the wkhtmltopdf corpus — useful context, but treat each row as a single data point rather than a strong signal.

Digitally signed
1

PDFs carrying at least one digital signature

Pages parsed
741
Oldest observed
8 Mar 2023 — about 3 years ago

Secure your workflow

Create your account — API key on signup, free test environment on every plan.
From $15/mo. No sales call. Cancel any time.

Start free — close the structural fraud gapSee pricing
Read API docs →