Algorithm Updates

Changelog

Latest updates and improvements to HTPBE?

Latestv2.15.1
  • Reduced false positives in document-identifier analysis for server-rendered enterprise reports
  • Broadened forensic-tool metadata-tampering analysis to cover an additional class of self-written rewriter scripts
  • Reduced false positives in synthetic-scan detection for vector-rendered institutional statements
v2.15.0
  • Added cross-layer metadata-timestamp consistency check inside the embedded metadata stream
  • Improved precision of font-mapping evasion analysis to reduce false positives on legitimate multi-subset font embedding
  • Reduced false positives in template-assembly analysis on documents produced by certain markup-flattening tools
v2.14.0
  • Expanded multi-source assembly detection across additional structural layers
  • Reduced false positives in metadata-completeness analysis on enterprise document-composition platforms
  • Strengthened synthetic-scan detection against an additional class of forgeries
  • Added detection of structurally inconsistent tool-pipeline declarations in document metadata
  • Added detection of additional font-level anti-forensic post-processing
  • Added detection of additional desktop-tool flatten-injection patterns
  • Added detection of further fraud-kit metadata fingerprints
  • Added cross-layer color-model consistency checks
v2.13.25
  • Tightened verdict semantics for additional editor-workflow patterns
  • Reduced false positives in metadata-date comparison for certain native-export authoring tools
  • Improved origin classification for documents touched by a commercial desktop PDF editor
  • Strengthened synthetic-scan detection
  • Tightened identity-array suppression on additional open-source library variants
v2.13.24
  • Improved metadata-date parsing for documents from certain diagram-authoring tools that use a non-standard timezone-suffix notation
  • Improved reliability of metadata extraction for documents from certain image-pipeline tools whose metadata strings contain a non-printable trailing byte
  • Expanded font-forensic analysis with three new structural detectors targeting in-place editor field-replacement workflows
v2.13.23
  • Added new detections targeting additional classes of post-creation content modification
  • Strengthened cross-layer metadata analysis
  • Improved multi-source assembly detection
  • Reduced false positives in cross-layer metadata analysis on documents from enterprise variable-data publishing platforms
  • Reduced false positives in document-identifier-array analysis on outputs from certain enterprise form-rendering pipelines
  • Detection extended to documents that have passed through online PDF editing and conversion services
  • Reduced false positives in incremental-update analysis for documents whose update increments are inherent to the signing workflow
  • Detection extended to programmatic PDF processing libraries
  • Reduced false positives in cross-layer metadata analysis on documents containing branded design assets
  • Added detection of additional multi-source page assembly patterns
v2.13.22
  • Improved accuracy of text-operator counting on documents whose page content lives in compressed payload-bearing streams
  • Reduced false positives in sparse-text-overlay analysis on legitimate small-text documents
  • Reduced false positives in vector-outline text analysis on consumer browser-print outputs
v2.13.21
  • Improved accuracy of content-stream analysis on documents using nested page-content layouts
v2.13.20
  • Added detection of an additional template-field substitution forgery pattern
v2.13.19
  • Added detection of an additional synthetic-raster substitution pattern
  • Reduced false positives in mixed-origin page-assembly analysis on additional rendering pipelines
  • Suppressed redundant signals when a more specific marker already fires
  • Improved accuracy of font-duplication analysis on additional rendering pipelines
  • Reworked per-page font-set comparison; covers a previously-undetected multi-source assembly pattern
  • Extended generator-fingerprint coverage to additional programmatic page-assembly libraries
v2.13.18
  • Added detection of additional metadata-tampering tool markers
  • Added detection of additional post-processing tool artifacts
  • Added detection of script-injection patterns on top of re-emitted documents
  • Reduced false positives in graphics-state analysis on large single-pass exports
  • Reduced false positives in cross-metadata-stream date analysis
  • Reduced false positives in sparse-text-overlay analysis
  • Reduced false positives in text-content analysis for multi-byte font encodings
  • Refined verdict semantics for documents whose state cannot be authenticated structurally
v2.13.17
  • Reduced false positives in graphics-state analysis on documents containing embedded native-format roundtrip data streams
  • Reduced false positives in template-assembly, font-session, and producer-consistency analyses on documents from enterprise variable-data composition platforms
  • Reduced false positives in document-identifier analysis for documents from enterprise document-processing pipelines
  • Reduced false positives on documents carrying only legitimate digital-fill-and-sign overlays from browser-rendered base documents
v2.13.16
  • Reduced false positives in multi-source page assembly detection on documents from certain single-pass rendering pipelines
  • Reduced false positives in redaction analysis for documents containing decorative background fills
  • Reduced false positives in font duplication analysis for server-rendered enterprise reports
  • Improved accuracy of signature integrity analysis across reader-specific incremental save patterns
  • Improved accuracy of graphics-state analysis on documents containing non-page-content data streams
  • Improved accuracy of invisible-character analysis for documents from certain browser-rendered environments
  • Improved origin classification for documents from additional online form-editing platforms
v2.13.15
  • Refined verdict semantics for a class of documents whose structural origin cannot establish institutional authenticity on its own — the API response now includes actionable guidance to verify such documents with the issuing organisation
v2.13.14
  • Reduced false positives on documents processed through multi-party digital signing workflows
  • Reduced false positives in multi-source page assembly detection for certain office productivity output
  • Reduced false positives in incremental-update detection for certain office productivity output
v2.13.13
  • Reduced false positives in incremental-modification analysis for documents produced by standard print-stream-to-PDF pipelines
  • Reduced false positives for documents produced by an additional widely-used office suite whose normal library behaviour was previously misidentified
v2.13.12
  • Expanded recognition of tools used for document re-processing
  • Improved generator-fingerprint mismatch detection across the full set of metadata locations a forger may target
  • Added a metadata-toolkit consistency check that flags vendor-mismatched metadata packets
  • Reduced false positives in scan classification for certain programmatic certificate and report generators
  • Expanded structural fingerprint detection to cover an additional family of PDF editing tools — documents edited by tools in this family are now correctly identified as modified when the declared generator contradicts the structural evidence
v2.13.11
  • Expanded detection of document assembly patterns — additional structural markers are now identified when pages within a single document appear to originate from independent source files
  • Expanded structural analysis to cover additional character encoding patterns — certain manipulation techniques used to alter document meaning without changing its visible appearance are now flagged
  • Added detection for incomplete redaction — documents where content marked for removal remains present and recoverable in the file structure are now identified
  • Expanded structural consistency checks to cover additional page-level properties that can indicate post-creation document assembly
  • Added detection of internal timestamp inconsistencies between document objects and document-level metadata
v2.13.10
  • Improved metadata extraction for documents that contain embedded image resources with their own metadata — preventing spurious metadata inconsistency markers
  • Expanded recognition of hardware scanner devices — documents produced by an additional family of multifunction printer units are now correctly classified as inconclusive rather than intact
  • Improved structural classification of scanner output that has been lightly post-processed — these are now correctly classified as inconclusive
v2.13.9
  • Expanded recognition of hardware scanner and multifunction printer devices — documents produced by an additional family of office scanners are now correctly classified as inconclusive
  • Improved structural scan detection across a broader range of scanner firmware variants
  • Improved handling of web-optimised (linearized) PDFs — reducing false positives for documents optimised for fast web delivery
v2.13.8
  • Reduced false positives for documents created with certain web-based design tools — a structural identifier pattern that is always produced by their automated export pipeline no longer incorrectly signals post-creation modification when no other evidence is present
  • Reduced false positives for a class of programmatically generated documents where a graphics state asymmetry is a known generator artifact rather than evidence of post-creation stream editing
  • Improved robustness of browser-origin rendering detection by scanning full page content streams, closing a gap where lengthy preludes could mask the signal
  • Added recognition of an additional online document workflow platform — documents exported by its automated HTML-to-PDF pipeline are now correctly classified as inconclusive rather than modified, since no structural integrity guarantees apply to browser-rendered output
  • Improved cross-field metadata consistency check to correctly handle documents whose titles or author names contain non-ASCII characters — eliminates a class of false positives for internationalized documents
v2.13.7
  • Reduced false positives in identifier-pattern analysis on documents generated by enterprise reporting frameworks
  • Extended the above fix to additional members of the same document generation library family across different programming languages and forks
  • Reduced false positives in metadata cross-field consistency checks for known generator artifacts
  • Improved coverage of graphics-state balance analysis on documents with multi-stream page content
  • Extended detection of residual document structures inherited from prior templates in rebuilt single-revision files
v2.13.6
  • Significantly expanded recognition of design, publishing, and editing tools — documents created with a broader range of non-institutional applications now correctly return inconclusive instead of intact
  • Added detection for additional online document editing tools
  • Improved self-check sampling to exclude documents already classified as inconclusive, reducing false discrepancy reports
v2.13.5
  • Improved detection of documents assembled from pages of different origins — more cases are now correctly identified as modified
  • Improved detection of documents where encoding characteristics of embedded images are inconsistent across pages — a structural indicator of post-creation assembly
  • Improved detection of documents rebuilt from scratch by editing tools — a structural identifier inconsistency now correctly signals post-creation modification
v2.13.4
  • Improved classification of documents that appear to be scanned images of physical pages — these now consistently return inconclusive regardless of the declared software origin
  • Improved recognition of additional scanner device types
  • Improved detection of documents with evidence of post-creation text editing
  • Improved detection of documents where identifying metadata has been replaced
v2.13.3
  • Refined verdict semantics for documents that lack a sufficient temporal baseline for authenticity analysis
  • Fixed false positives for documents generated by server-side browser automation — these were incorrectly classified as consumer software
  • Improved detection of additional metadata-tampering patterns
  • Improved detection of selectively-edited tool-identity fields
v2.13.2
  • Reduced false positives in graphics-state analysis on documents containing payload-bearing binary streams
  • Reduced false positives in graphics-state analysis on text content containing characters that coincide with operator codes
  • Reduced false positives in metadata contradiction detection on documents using non-standard string encoding
  • Reduced false positives for documents generated by enterprise print pipelines with non-standard file framing
v2.13.1
  • Reduced false positives in graphics-state analysis on documents with embedded font programs
v2.13.0
  • Added structural detection of scanned documents based on image placement geometry
  • Added detection of invisible text overlay patterns associated with OCR processing
  • Added detection of content edited in a document editor after initial generation
  • Added detection of byte-level structural manipulation in document files
  • Added detection of post-modification inconsistencies in optimized document structure
v2.12.0
  • Added detection of binary image substitution in scanned documents
v2.11.9
  • Improved detection of documents rendered by a browser print pipeline
  • Improved parsing of non-standard date formats in document metadata
v2.11.8
  • Improved tool identity checks
v2.11.7
  • Expanded online converter recognition
v2.11.6
  • Improved detection of inconsistencies between metadata layers
v2.11.5
  • Reduced false positives for documents with incomplete metadata
  • API is now available at the dedicated subdomain api.htpbe.tech/v1
  • The previous base URL (htpbe.tech/api/v1) continues to work — no migration required
v2.11.4
  • Expanded consumer software recognition
v2.11.3
  • Print-to-PDF documents are now correctly classified as consumer software origin
v2.11.2
  • Reduced false positives in metadata-date comparison for generators that emit a small intra-session timestamp gap
v2.11.1
  • Expanded the list of office software recognized as consumer origin — previously unrecognized editors now correctly return inconclusive instead of intact
v2.11.0
  • Added detection of structurally impossible metadata dates
  • Added detection of minimal incremental updates consistent with metadata-only tampering
  • Improved detection reliability for modern PDF formats (PDF 1.5+)
  • Improved detection of digitally signed documents with long-term validation data
  • Reduced false positives on legitimately signed documents
  • Encrypted PDFs now return a clear error instead of an unreliable result
  • Previously analyzed files may benefit from re-analysis
v2.10.0
  • Fixed misclassification of several server-side PDF generation tools as consumer software — documents generated by institutional automation pipelines were incorrectly returned as inconclusive
  • Improved distinction between browser-based consumer printing and programmatic server-side rendering pipelines that share underlying rendering technology
  • Previously analyzed files from affected institutional pipelines may benefit from re-analysis
v2.9.0
  • Introduced detection of mixed-origin page assembly based on structural rendering pipeline characteristics detectable at the content stream level
  • Documents with pages of confirmed mixed rendering origin are now flagged as modified when corroborated by additional structural evidence
  • Previously analyzed files may benefit from re-analysis
v2.8.0
  • Added detection of an additional anti-forensic rasterization pattern used to destroy text extractability while preserving visual appearance
  • Previously analyzed files may benefit from re-analysis
v2.7.0
  • Added detection of an additional mixed-origin page-assembly forgery pattern
  • False-positive guards prevent flagging of legitimate scanned annexes
  • Previously analyzed files may benefit from re-analysis
v2.6.1
  • Improved scan classification reliability — fixed a false-negative where certain compressed image streams could cause scanned PDFs to be misclassified as institutional instead of inconclusive.
v2.6.0
  • Introduced detection of documents assembled from pages rendered in independent sessions
  • Previously analyzed files may benefit from re-analysis
v2.5.3
  • Reduced false positives for documents using the ISO 32000-1 fast-web-view format
v2.5.2
  • Reduced false positives in design-tool forgery detection for documents from certain office productivity software
v2.5.1
  • Fixed two parsing issues in assembled-document detection that caused certain multi-page documents to pass as intact
  • Previously analyzed assembled documents may benefit from re-analysis
v2.5.0
  • Fixed a parsing gap that caused certain non-standard PDF files to bypass stream-based analysis
  • Added detection of an additional PDF editing tool that previously evaded fingerprinting
  • Introduced detection of documents assembled from multiple independently imported pages
  • Previously analyzed files may benefit from re-analysis
v2.4.0–2.4.2
  • Fixed false "scanned document" classification for modern PDF formats (PDF 1.5+)
  • Removed false-positive alpha-channel detection — PDFs with images using transparency are no longer incorrectly flagged
  • Closed a detection gap where signature-removal could go undetected in certain re-emitted documents
  • Expanded consumer-software origin recognition to include common image and design editors
  • Previously analyzed files may benefit from re-analysis
v2.3.0
  • Improved analysis consistency: all detection signals are now evaluated uniformly without special-case exceptions
  • Structural anomalies are now included in the analysis output
  • Reduced false positives for PDFs generated with certain XMP-only workflows
  • Previously analyzed files may benefit from re-analysis
v2.2.1
  • Fixed detection reliability for PDFs using modern compressed object streams (PDF 1.5+)
  • Resolved edge cases in content stream parsing and font subset analysis
  • Previously analyzed files may benefit from re-analysis
v2.2.0
  • Introduced detection of template-assembly document forgeries
  • Expanded coverage to identify composites built from design-tool templates
  • Previously analyzed files may benefit from re-analysis
v2.1.7–2.1.8
  • Improved detection accuracy: fixed rare false negatives where a real modification marker was missed
  • Improved post-signature tampering detection using cryptographic verification
  • Fixed false positives for PDFs with an invalid creation date that was still present in metadata
v2.1.6
  • Added "Cannot Determine" result for PDFs created with consumer office and word-processing software
  • New origin detection: API now returns origin.type and origin.software fields
  • New primary status field in API: "intact", "modified", or "inconclusive"
  • Result page now shows a grey "Cannot Determine" badge and explanation for consumer-software PDFs
v2.1.5
  • Improved detection accuracy for documents from certain office productivity software
  • Previously analyzed files may benefit from re-analysis
v2.1.2–2.1.4
  • Improved detection accuracy for documents from certain word-processing software
  • Reduced false positives for common PDF creation tools
  • Previously analyzed files may benefit from re-analysis
v2.1.1
  • Improved compatibility with modern PDF formats (PDF 1.5+)
  • Enhanced verification for digitally signed documents; significantly reduced false positives on legitimately signed PDFs
v2.1.0
  • Redesigned the detection engine with a new approach to identifying document modifications
  • Improved accuracy by replacing simple metadata comparison with a more robust analysis method
  • Introduced detection of known PDF editing tools
  • Previously analyzed files may benefit from re-analysis
v2.0.4
  • Fixed timezone bug in date display (dates no longer shown in the future)
  • Improved UTC timestamp handling for accurate relative time display
v2.0.3
  • Fixed false positive bug in PDF modification detection
  • Improved accuracy for metadata analysis
v2.0.2
  • Added user dashboard with API key management
  • Implemented passwordless authentication (Google, GitHub, Magic Links)
  • Created billing and subscription management interface
v2.0.1
  • Enhanced API infrastructure with monthly quota management
  • Improved typography and visual consistency
  • Updated documentation for PDF metadata analysis
v2.0.0
  • Major platform update — rebuilt application infrastructure
  • Migrated to Turso database with Drizzle ORM for improved reliability
  • Comprehensive UX improvements across the application
  • Enhanced error handling and stability
v1.0.0
  • Initial public release
  • Core PDF analysis features