Algorithm Updates
Changelog
Latest updates and improvements to HTPBE?
Latestv2.15.1
- Reduced false positives in document-identifier analysis for server-rendered enterprise reports
- Broadened forensic-tool metadata-tampering analysis to cover an additional class of self-written rewriter scripts
- Reduced false positives in synthetic-scan detection for vector-rendered institutional statements
v2.15.0
- Added cross-layer metadata-timestamp consistency check inside the embedded metadata stream
- Improved precision of font-mapping evasion analysis to reduce false positives on legitimate multi-subset font embedding
- Reduced false positives in template-assembly analysis on documents produced by certain markup-flattening tools
v2.14.0
- Expanded multi-source assembly detection across additional structural layers
- Reduced false positives in metadata-completeness analysis on enterprise document-composition platforms
- Strengthened synthetic-scan detection against an additional class of forgeries
- Added detection of structurally inconsistent tool-pipeline declarations in document metadata
- Added detection of additional font-level anti-forensic post-processing
- Added detection of additional desktop-tool flatten-injection patterns
- Added detection of further fraud-kit metadata fingerprints
- Added cross-layer color-model consistency checks
v2.13.25
- Tightened verdict semantics for additional editor-workflow patterns
- Reduced false positives in metadata-date comparison for certain native-export authoring tools
- Improved origin classification for documents touched by a commercial desktop PDF editor
- Strengthened synthetic-scan detection
- Tightened identity-array suppression on additional open-source library variants
v2.13.24
- Improved metadata-date parsing for documents from certain diagram-authoring tools that use a non-standard timezone-suffix notation
- Improved reliability of metadata extraction for documents from certain image-pipeline tools whose metadata strings contain a non-printable trailing byte
- Expanded font-forensic analysis with three new structural detectors targeting in-place editor field-replacement workflows
v2.13.23
- Added new detections targeting additional classes of post-creation content modification
- Strengthened cross-layer metadata analysis
- Improved multi-source assembly detection
- Reduced false positives in cross-layer metadata analysis on documents from enterprise variable-data publishing platforms
- Reduced false positives in document-identifier-array analysis on outputs from certain enterprise form-rendering pipelines
- Detection extended to documents that have passed through online PDF editing and conversion services
- Reduced false positives in incremental-update analysis for documents whose update increments are inherent to the signing workflow
- Detection extended to programmatic PDF processing libraries
- Reduced false positives in cross-layer metadata analysis on documents containing branded design assets
- Added detection of additional multi-source page assembly patterns
v2.13.22
- Improved accuracy of text-operator counting on documents whose page content lives in compressed payload-bearing streams
- Reduced false positives in sparse-text-overlay analysis on legitimate small-text documents
- Reduced false positives in vector-outline text analysis on consumer browser-print outputs
v2.13.21
- Improved accuracy of content-stream analysis on documents using nested page-content layouts
v2.13.20
- Added detection of an additional template-field substitution forgery pattern
v2.13.19
- Added detection of an additional synthetic-raster substitution pattern
- Reduced false positives in mixed-origin page-assembly analysis on additional rendering pipelines
- Suppressed redundant signals when a more specific marker already fires
- Improved accuracy of font-duplication analysis on additional rendering pipelines
- Reworked per-page font-set comparison; covers a previously-undetected multi-source assembly pattern
- Extended generator-fingerprint coverage to additional programmatic page-assembly libraries
v2.13.18
- Added detection of additional metadata-tampering tool markers
- Added detection of additional post-processing tool artifacts
- Added detection of script-injection patterns on top of re-emitted documents
- Reduced false positives in graphics-state analysis on large single-pass exports
- Reduced false positives in cross-metadata-stream date analysis
- Reduced false positives in sparse-text-overlay analysis
- Reduced false positives in text-content analysis for multi-byte font encodings
- Refined verdict semantics for documents whose state cannot be authenticated structurally
v2.13.17
- Reduced false positives in graphics-state analysis on documents containing embedded native-format roundtrip data streams
- Reduced false positives in template-assembly, font-session, and producer-consistency analyses on documents from enterprise variable-data composition platforms
- Reduced false positives in document-identifier analysis for documents from enterprise document-processing pipelines
- Reduced false positives on documents carrying only legitimate digital-fill-and-sign overlays from browser-rendered base documents
v2.13.16
- Reduced false positives in multi-source page assembly detection on documents from certain single-pass rendering pipelines
- Reduced false positives in redaction analysis for documents containing decorative background fills
- Reduced false positives in font duplication analysis for server-rendered enterprise reports
- Improved accuracy of signature integrity analysis across reader-specific incremental save patterns
- Improved accuracy of graphics-state analysis on documents containing non-page-content data streams
- Improved accuracy of invisible-character analysis for documents from certain browser-rendered environments
- Improved origin classification for documents from additional online form-editing platforms
v2.13.15
- Refined verdict semantics for a class of documents whose structural origin cannot establish institutional authenticity on its own — the API response now includes actionable guidance to verify such documents with the issuing organisation
v2.13.14
- Reduced false positives on documents processed through multi-party digital signing workflows
- Reduced false positives in multi-source page assembly detection for certain office productivity output
- Reduced false positives in incremental-update detection for certain office productivity output
v2.13.13
- Reduced false positives in incremental-modification analysis for documents produced by standard print-stream-to-PDF pipelines
- Reduced false positives for documents produced by an additional widely-used office suite whose normal library behaviour was previously misidentified
v2.13.12
- Expanded recognition of tools used for document re-processing
- Improved generator-fingerprint mismatch detection across the full set of metadata locations a forger may target
- Added a metadata-toolkit consistency check that flags vendor-mismatched metadata packets
- Reduced false positives in scan classification for certain programmatic certificate and report generators
- Expanded structural fingerprint detection to cover an additional family of PDF editing tools — documents edited by tools in this family are now correctly identified as modified when the declared generator contradicts the structural evidence
v2.13.11
- Expanded detection of document assembly patterns — additional structural markers are now identified when pages within a single document appear to originate from independent source files
- Expanded structural analysis to cover additional character encoding patterns — certain manipulation techniques used to alter document meaning without changing its visible appearance are now flagged
- Added detection for incomplete redaction — documents where content marked for removal remains present and recoverable in the file structure are now identified
- Expanded structural consistency checks to cover additional page-level properties that can indicate post-creation document assembly
- Added detection of internal timestamp inconsistencies between document objects and document-level metadata
v2.13.10
- Improved metadata extraction for documents that contain embedded image resources with their own metadata — preventing spurious metadata inconsistency markers
- Expanded recognition of hardware scanner devices — documents produced by an additional family of multifunction printer units are now correctly classified as inconclusive rather than intact
- Improved structural classification of scanner output that has been lightly post-processed — these are now correctly classified as inconclusive
v2.13.9
- Expanded recognition of hardware scanner and multifunction printer devices — documents produced by an additional family of office scanners are now correctly classified as inconclusive
- Improved structural scan detection across a broader range of scanner firmware variants
- Improved handling of web-optimised (linearized) PDFs — reducing false positives for documents optimised for fast web delivery
v2.13.8
- Reduced false positives for documents created with certain web-based design tools — a structural identifier pattern that is always produced by their automated export pipeline no longer incorrectly signals post-creation modification when no other evidence is present
- Reduced false positives for a class of programmatically generated documents where a graphics state asymmetry is a known generator artifact rather than evidence of post-creation stream editing
- Improved robustness of browser-origin rendering detection by scanning full page content streams, closing a gap where lengthy preludes could mask the signal
- Added recognition of an additional online document workflow platform — documents exported by its automated HTML-to-PDF pipeline are now correctly classified as inconclusive rather than modified, since no structural integrity guarantees apply to browser-rendered output
- Improved cross-field metadata consistency check to correctly handle documents whose titles or author names contain non-ASCII characters — eliminates a class of false positives for internationalized documents
v2.13.7
- Reduced false positives in identifier-pattern analysis on documents generated by enterprise reporting frameworks
- Extended the above fix to additional members of the same document generation library family across different programming languages and forks
- Reduced false positives in metadata cross-field consistency checks for known generator artifacts
- Improved coverage of graphics-state balance analysis on documents with multi-stream page content
- Extended detection of residual document structures inherited from prior templates in rebuilt single-revision files
v2.13.6
- Significantly expanded recognition of design, publishing, and editing tools — documents created with a broader range of non-institutional applications now correctly return inconclusive instead of intact
- Added detection for additional online document editing tools
- Improved self-check sampling to exclude documents already classified as inconclusive, reducing false discrepancy reports
v2.13.5
- Improved detection of documents assembled from pages of different origins — more cases are now correctly identified as modified
- Improved detection of documents where encoding characteristics of embedded images are inconsistent across pages — a structural indicator of post-creation assembly
- Improved detection of documents rebuilt from scratch by editing tools — a structural identifier inconsistency now correctly signals post-creation modification
v2.13.4
- Improved classification of documents that appear to be scanned images of physical pages — these now consistently return inconclusive regardless of the declared software origin
- Improved recognition of additional scanner device types
- Improved detection of documents with evidence of post-creation text editing
- Improved detection of documents where identifying metadata has been replaced
v2.13.3
- Refined verdict semantics for documents that lack a sufficient temporal baseline for authenticity analysis
- Fixed false positives for documents generated by server-side browser automation — these were incorrectly classified as consumer software
- Improved detection of additional metadata-tampering patterns
- Improved detection of selectively-edited tool-identity fields
v2.13.2
- Reduced false positives in graphics-state analysis on documents containing payload-bearing binary streams
- Reduced false positives in graphics-state analysis on text content containing characters that coincide with operator codes
- Reduced false positives in metadata contradiction detection on documents using non-standard string encoding
- Reduced false positives for documents generated by enterprise print pipelines with non-standard file framing
v2.13.1
- Reduced false positives in graphics-state analysis on documents with embedded font programs
v2.13.0
- Added structural detection of scanned documents based on image placement geometry
- Added detection of invisible text overlay patterns associated with OCR processing
- Added detection of content edited in a document editor after initial generation
- Added detection of byte-level structural manipulation in document files
- Added detection of post-modification inconsistencies in optimized document structure
v2.12.0
- Added detection of binary image substitution in scanned documents
v2.11.9
- Improved detection of documents rendered by a browser print pipeline
- Improved parsing of non-standard date formats in document metadata
v2.11.8
- Improved tool identity checks
v2.11.7
- Expanded online converter recognition
v2.11.6
- Improved detection of inconsistencies between metadata layers
v2.11.5
- Reduced false positives for documents with incomplete metadata
- API is now available at the dedicated subdomain api.htpbe.tech/v1
- The previous base URL (htpbe.tech/api/v1) continues to work — no migration required
v2.11.4
- Expanded consumer software recognition
v2.11.3
- Print-to-PDF documents are now correctly classified as consumer software origin
v2.11.2
- Reduced false positives in metadata-date comparison for generators that emit a small intra-session timestamp gap
v2.11.1
- Expanded the list of office software recognized as consumer origin — previously unrecognized editors now correctly return inconclusive instead of intact
v2.11.0
- Added detection of structurally impossible metadata dates
- Added detection of minimal incremental updates consistent with metadata-only tampering
- Improved detection reliability for modern PDF formats (PDF 1.5+)
- Improved detection of digitally signed documents with long-term validation data
- Reduced false positives on legitimately signed documents
- Encrypted PDFs now return a clear error instead of an unreliable result
- Previously analyzed files may benefit from re-analysis
v2.10.0
- Fixed misclassification of several server-side PDF generation tools as consumer software — documents generated by institutional automation pipelines were incorrectly returned as inconclusive
- Improved distinction between browser-based consumer printing and programmatic server-side rendering pipelines that share underlying rendering technology
- Previously analyzed files from affected institutional pipelines may benefit from re-analysis
v2.9.0
- Introduced detection of mixed-origin page assembly based on structural rendering pipeline characteristics detectable at the content stream level
- Documents with pages of confirmed mixed rendering origin are now flagged as modified when corroborated by additional structural evidence
- Previously analyzed files may benefit from re-analysis
v2.8.0
- Added detection of an additional anti-forensic rasterization pattern used to destroy text extractability while preserving visual appearance
- Previously analyzed files may benefit from re-analysis
v2.7.0
- Added detection of an additional mixed-origin page-assembly forgery pattern
- False-positive guards prevent flagging of legitimate scanned annexes
- Previously analyzed files may benefit from re-analysis
v2.6.1
- Improved scan classification reliability — fixed a false-negative where certain compressed image streams could cause scanned PDFs to be misclassified as institutional instead of inconclusive.
v2.6.0
- Introduced detection of documents assembled from pages rendered in independent sessions
- Previously analyzed files may benefit from re-analysis
v2.5.3
- Reduced false positives for documents using the ISO 32000-1 fast-web-view format
v2.5.2
- Reduced false positives in design-tool forgery detection for documents from certain office productivity software
v2.5.1
- Fixed two parsing issues in assembled-document detection that caused certain multi-page documents to pass as intact
- Previously analyzed assembled documents may benefit from re-analysis
v2.5.0
- Fixed a parsing gap that caused certain non-standard PDF files to bypass stream-based analysis
- Added detection of an additional PDF editing tool that previously evaded fingerprinting
- Introduced detection of documents assembled from multiple independently imported pages
- Previously analyzed files may benefit from re-analysis
v2.4.0–2.4.2
- Fixed false "scanned document" classification for modern PDF formats (PDF 1.5+)
- Removed false-positive alpha-channel detection — PDFs with images using transparency are no longer incorrectly flagged
- Closed a detection gap where signature-removal could go undetected in certain re-emitted documents
- Expanded consumer-software origin recognition to include common image and design editors
- Previously analyzed files may benefit from re-analysis
v2.3.0
- Improved analysis consistency: all detection signals are now evaluated uniformly without special-case exceptions
- Structural anomalies are now included in the analysis output
- Reduced false positives for PDFs generated with certain XMP-only workflows
- Previously analyzed files may benefit from re-analysis
v2.2.1
- Fixed detection reliability for PDFs using modern compressed object streams (PDF 1.5+)
- Resolved edge cases in content stream parsing and font subset analysis
- Previously analyzed files may benefit from re-analysis
v2.2.0
- Introduced detection of template-assembly document forgeries
- Expanded coverage to identify composites built from design-tool templates
- Previously analyzed files may benefit from re-analysis
v2.1.7–2.1.8
- Improved detection accuracy: fixed rare false negatives where a real modification marker was missed
- Improved post-signature tampering detection using cryptographic verification
- Fixed false positives for PDFs with an invalid creation date that was still present in metadata
v2.1.6
- Added "Cannot Determine" result for PDFs created with consumer office and word-processing software
- New origin detection: API now returns origin.type and origin.software fields
- New primary status field in API: "intact", "modified", or "inconclusive"
- Result page now shows a grey "Cannot Determine" badge and explanation for consumer-software PDFs
v2.1.5
- Improved detection accuracy for documents from certain office productivity software
- Previously analyzed files may benefit from re-analysis
v2.1.2–2.1.4
- Improved detection accuracy for documents from certain word-processing software
- Reduced false positives for common PDF creation tools
- Previously analyzed files may benefit from re-analysis
v2.1.1
- Improved compatibility with modern PDF formats (PDF 1.5+)
- Enhanced verification for digitally signed documents; significantly reduced false positives on legitimately signed PDFs
v2.1.0
- Redesigned the detection engine with a new approach to identifying document modifications
- Improved accuracy by replacing simple metadata comparison with a more robust analysis method
- Introduced detection of known PDF editing tools
- Previously analyzed files may benefit from re-analysis
v2.0.4
- Fixed timezone bug in date display (dates no longer shown in the future)
- Improved UTC timestamp handling for accurate relative time display
v2.0.3
- Fixed false positive bug in PDF modification detection
- Improved accuracy for metadata analysis
v2.0.2
- Added user dashboard with API key management
- Implemented passwordless authentication (Google, GitHub, Magic Links)
- Created billing and subscription management interface
v2.0.1
- Enhanced API infrastructure with monthly quota management
- Improved typography and visual consistency
- Updated documentation for PDF metadata analysis
v2.0.0
- Major platform update — rebuilt application infrastructure
- Migrated to Turso database with Drizzle ORM for improved reliability
- Comprehensive UX improvements across the application
- Enhanced error handling and stability
v1.0.0
- Initial public release
- Core PDF analysis features