logo

Changelog

Latest updates and improvements to HTPBE

Latestv2.8.0
  • Introduced Layer 4.8 (strip variant): detects documents where pages were converted into many narrow horizontal image strips — an anti-forensic technique that destroys text extractability while preserving visual appearance
  • Fires automatically when 20+ full-width strips per page are present with no font resources; no corroboration required (no legitimate workflow produces this pattern)
  • Previously analyzed files may benefit from re-analysis
v2.7.0
  • Introduced Layer 4.8: detects documents where a page was physically printed, altered, rescanned, and reinserted alongside programmatically generated pages — a forgery pattern used to manipulate specific pages of bank statements, invoices, and contracts
  • Detects both grayscale and color full-page raster scans mixed with programmatic content; fires only when corroborated by a page-import tool signature (PDFlib+PDI, iText, PDFtk, etc.) or incremental updates, preventing false positives from legitimate scanned annexes
  • Previously analyzed files may benefit from re-analysis
v2.6.1
  • Fixed a false-negative in scan detection: binary image streams (CCITT Group 4, JPEG) occasionally contain byte sequences that match PDF text operators, causing scanned PDFs to be classified as institutional instead of "Cannot Verify". Detection now relies solely on the absence of font resources, which is the correct structural invariant.
v2.6.0
  • Introduced Layer 4.7: detects documents assembled from pages rendered in independent sessions — identified by the same typeface appearing with different font subset prefixes across page groups
  • Previously analyzed files may benefit from re-analysis
v2.5.3
  • Fixed false positive where linearized PDFs (ISO 32000-1 fast web view format) were incorrectly flagged as modified — the two-xref structure and small date gap are spec-required artifacts of the two-pass write process, not evidence of editing
v2.5.2
  • Fixed false positive where Microsoft Word documents were incorrectly flagged as design-tool forgeries due to a known font encoding artifact in Office PDF exports
v2.5.1
  • Fixed two parsing issues in Layer 4.6 that caused assembled multi-page documents to pass as intact
  • Previously analyzed assembled documents may benefit from re-analysis
v2.5.0
  • Fixed a parsing gap that caused certain non-standard PDF files to bypass stream-based analysis
  • Added detection of a known PDF editor that previously went undetected when its metadata was spoofed
  • Introduced Layer 4.6: detects documents assembled from multiple independently imported pages
  • Previously analyzed files may benefit from re-analysis
v2.4.0–2.4.2
  • Fixed false "scanned document" classification for PDF 1.5+ files with compressed object streams — fonts and text operators are now correctly detected in modern PDFs
  • Removed false-positive alpha-channel detection — PDFs with images using transparency are no longer incorrectly flagged
  • Fixed detection gap where a removed digital signature could go undetected in certain full-rebuild document forgeries
  • Added Adobe Photoshop, GIMP, and Inkscape to consumer-software origins — documents created in image editors now show "Cannot Verify" instead of a false modification result
  • Previously analyzed files may benefit from re-analysis
v2.3.0
  • Improved analysis consistency: all detection signals are now evaluated uniformly without special-case exceptions
  • Structural anomalies are now included in the analysis output
  • Reduced false positives for PDFs generated with certain XMP-only workflows
  • Previously analyzed files may benefit from re-analysis
v2.2.1
  • Fixed detection reliability for PDFs using modern compressed object streams (PDF 1.5+)
  • Resolved edge cases in content stream parsing and font subset analysis
  • Previously analyzed files may benefit from re-analysis
v2.2.0
  • Introduced detection of template-assembly document forgeries
  • Expanded coverage to identify composites built from design-tool templates
  • Previously analyzed files may benefit from re-analysis
v2.1.7–2.1.8
  • Improved detection accuracy: fixed rare false negatives where a real modification marker was missed
  • Improved post-signature tampering detection using cryptographic verification
  • Fixed false positives for PDFs with an invalid creation date that was still present in metadata
v2.1.6
  • Added "Cannot Verify" result for PDFs created with consumer software (Microsoft Office, LibreOffice, Apple Pages, etc.)
  • New origin detection: API now returns origin.type and origin.software fields
  • New primary status field in API: "intact", "modified", or "inconclusive"
  • Result page now shows a grey "Cannot Verify" badge and explanation for consumer-software PDFs
v2.1.5
  • Improved detection accuracy for Microsoft Office documents
  • Previously analyzed files may benefit from re-analysis
v2.1.2–2.1.4
  • Improved detection accuracy for Microsoft Word documents
  • Reduced false positives for common PDF creation tools
  • Previously analyzed files may benefit from re-analysis
v2.1.1
  • Improved compatibility with modern PDF formats (PDF 1.5+)
  • Enhanced verification for digitally signed documents; significantly reduced false positives on legitimately signed PDFs
v2.1.0
  • Redesigned the detection engine with a new approach to identifying document modifications
  • Improved accuracy by replacing simple metadata comparison with a more robust analysis method
  • Introduced detection of known PDF editing tools
  • Previously analyzed files may benefit from re-analysis
v2.0.4
  • Fixed timezone bug in date display (dates no longer shown in the future)
  • Improved UTC timestamp handling for accurate relative time display
v2.0.3
  • Fixed false positive bug in PDF modification detection
  • Improved accuracy for metadata analysis
v2.0.2
  • Added user dashboard with API key management
  • Implemented passwordless authentication (Google, GitHub, Magic Links)
  • Created billing and subscription management interface
v2.0.1
  • Enhanced API infrastructure with monthly quota management
  • Improved typography and visual consistency
  • Updated documentation for PDF metadata analysis
v2.0.0
  • Major platform update — rebuilt application infrastructure
  • Migrated to Turso database with Drizzle ORM for improved reliability
  • Comprehensive UX improvements across the application
  • Enhanced error handling and stability
v1.0.0
  • Initial public release
  • Core PDF analysis features