Algorithm Updates
Changelog
Latest updates and improvements to HTPBE?
Latestv2.29.0
- New detection class for documents that were reworked in an interactive PDF editor after their original creation — a common way a forged statement or letter is built, where an authentic document is opened in an editing application, a value is altered, and the file is saved back. Files reworked this way previously passed as untouched; they are now recognised as edited after the fact, surfaced under the same edited-in-an-editor outcome as our other content-editing checks.
- Reduced false positives in our scanned-page detection on a class of genuine programmatically-generated enterprise bills and statements that carry lightweight postal-mailing marks — the sorting and franking elements a print-and-mail run stamps onto an otherwise digital document. These thin printed marks were previously mistaken for a full-page scanned image, so authentic machine-issued documents were wrongly flagged as modified. The check now tells a sparse mailing mark apart from a genuine page-filling scan before it fires, so these legitimate bills pass — while a real full-page scan dropped into a digital document is still caught.
- Broadened our edited-in-an-editor detection to recover a further class of documents reworked after their original creation — the kind of post-creation editing behind a forged statement or report. The recovery works independently of which editing application was used, so this class is caught no matter the tool that produced it, and is surfaced under the existing edited-in-an-editor outcome.
- Reduced false positives on a class of genuine institutional financial and retirement statements that an enterprise pipeline composes automatically by assembling pre-rendered components, leaving the document’s descriptive metadata empty by design. That machine-composed shape previously tripped both our layered-revision check and our blanked-metadata check, so authentic composed statements were wrongly flagged as modified. We now recognise this composition pipeline by a validated fingerprint before standing those two flags down — and the allowance is deliberately narrow: it holds only while the document’s own internal structure proves nothing in it was rewritten after assembly, so a forger cannot fabricate the fingerprint to smuggle an edited file through, because the edit itself breaks the proof. These genuine statements no longer read as modified, while any real post-assembly change is still caught.
v2.28.0
- Added a new proprietary integrity signal to the detection suite. We are publicly acknowledging that this signal exists, but — unlike our other checks — we are deliberately not disclosing how it works. The detection method is held back on purpose so that forgers cannot read a description of it and engineer their way around it. When it fires, it is strong, corroborated evidence that the file is a modified derivative rather than an untouched original.
- This is the first check in the catalogue whose mechanism we keep undisclosed by design; every other check continues to be described in plain outcome terms.
v2.27.1
- Reduced false positives in our hidden-duplicate-text check on a class of genuine documents laid out as tables, where each visible cell is placed independently on the page. These distinct, fully-visible values were previously being misread as concealed text stacked in the same spot, and the document was wrongly marked modified. The check now judges where each value actually sits on the finished page before comparing, so separately-placed visible cells are no longer mistaken for one another — while a substitute value genuinely hidden underneath a visible one is still caught.
v2.27.0
- Reduced false positives in our creation-versus-modification timeline check on a class of genuine government and corporate forms generated by a server-side form-rendering pipeline. Documents of this kind carry an authoring-stage origin date that is naturally years apart from the date their final copy was produced — a normal fingerprint of how these forms are issued, not evidence of a later edit. This pipeline is now recognised alongside the other native-export rendering engines for which that gap is expected, so authentic forms are no longer mistaken for modified ones. The allowance stays deliberately narrow: it applies only to clean single-render documents whose independent timeline layers agree, so a fabricated file cannot borrow it to slip past detection, and genuine post-creation edits still fire.
- New detection class for documents rebuilt inside a graphics-design tool from a source file held locally on an operator’s own machine — a construction pattern that no institution uses to issue its statements, certificates, or forms, and strong evidence the file was reassembled by a person rather than produced by the named issuer.
- New detection class for documents whose metadata layer was regenerated by a separate tool in a different locale after the original authoring — a second-pass rewrite that a genuine single-pass generator never produces.
v2.26.1
- Broadened detection of documents assembled by merging pages from more than one source file — recovering a class of plainly-built merges that previously slipped through certified as a single original, including documents with extra pages glued on after the fact.
v2.26.0
- Retired the standalone document-identifier consistency check. Reviewed against a large corpus of genuine documents, the internal identifier records were found to differ legitimately on a single clean render across many established document generators — so on their own they could not reliably separate an authentic file from an edited one, and the check produced false positives on genuine documents while adding nothing the structural modification checks did not already provide. The retirement narrows nothing real: every modification this check could ever evidence is already covered independently by the structural layer.
- New detection class for an institutional document that was opened and re-saved by a second tool after its original creation, with no digital signature to account for the change — a post-creation re-stamp pattern that is now caught on its own structural evidence.
- Restored detection of documents assembled by merging pages from more than one source file — now recovered on its own structural evidence, including on merges that were subsequently rebuilt to look like a single original.
- New detection class for a stamp-overlay edit — substitute values layered on top of an otherwise-untouched original page to alter what it appears to say.
- New detection class for a cover-and-replace edit — original content concealed and replaced with substitute values to change what the page reads.
v2.25.3
- Reduced false positives on a class of genuine documents produced by physical scanning hardware whose output was previously being misread as a synthetically-rendered imitation of a scan — authentic scans of this kind now correctly settle at the not-certifiable scan-origin ceiling instead of being flagged as modified, while truly fabricated pseudo-scans are unaffected.
v2.25.2
- Widened the set of single-pass re-rendered documents we decline to certify as unmodified — files rebuilt in one pass discard the modification-history evidence an intact verdict depends on, so this class is now consistently treated as not-certifiable rather than passed as clean. Genuine institutional documents are unaffected, and structural tamper detectors still run independently.
- Recovered scanned-document recognition on a class of scanned files that were previously slipping through certified as clean — a scanned document can never be confirmed unmodified, so this class now correctly returns the not-certifiable ceiling rather than an intact verdict.
v2.25.1
- Reduced false positives in our tool-versus-structure consistency check on a class of legitimate institutional documents produced by a newer generation of an established generator — genuine fabricated-origin tampering is unaffected.
v2.25.0
- New detection class for documents that misrepresent which tool produced them — flagging origin that was rewritten to disguise where the document actually came from.
- Broadened generator-identity-forgery detection to a further class of documents whose stated origin contradicts the rest of the document.
- Reduced false positives in our generator-identity check on a class of legitimate institutional documents carrying an embedded company logo — identity belonging to a bundled graphic is no longer mistaken for the document’s own origin, while genuine document-level identity disagreement still fires.
v2.24.2
- Reduced false positives in our document-identifier consistency check on a class of legitimate single-pass institutional documents whose identifier records differ purely as a generation-time artifact — real identifier-mismatch tampering on edited documents is unaffected.
- Reduced false positives in our template-assembly detection on a class of legitimately print-rendered documents whose per-page font handling was being misread as the reuse signature of a pieced-together template — detection of genuine template-assembly forgeries still fires through its other independent signals.
v2.24.1
- Reduced false positives in our design-tool / template-assembly detection on a class of legitimate modern documents that were previously being mistaken for pieced-together templates — detection of genuine template-assembly forgeries is unaffected.
- Refined how the page-assembly analysis reads the document so a legitimate modern layout is no longer misread as reused template scaffolding, with real multi-source assembly still flagged through our other independent signals.
v2.24.0
- New detection class for documents whose internal structure references a component that was removed during a rebuild — a tell-tale of an embedded component quietly stripped out while the document was reassembled.
- Hardened structural-integrity checks to flag dangling internal references left behind when a document is rebuilt from an original, without relying on any single tool fingerprint.
v2.23.3
- Recovered tamper detection on a class of edited institutional documents — including documents altered well after their original issue date — that previously slipped through the timestamp-consistency check.
- Reduced false positives on legitimate enterprise report outputs, whose normal compile-to-render timing is no longer mistaken for post-export tampering.
v2.23.2
- Reduced false positives in our content-stream consistency analysis on a class of legitimate documents that were previously being mistaken for tampered files.
- Hardened how the consistency analysis interprets document content, so a class of legitimate files is no longer misread as inconsistent — detection of real inconsistencies is unaffected.
- Expanded the set of desktop PDF editors recognized as re-rendering tools, so documents passed through them are no longer treated as guaranteed originals — the integrity ceiling now matches our existing rule for other re-rendered document classes.
v2.23.1
- Withheld the integrity guarantee on a class of documents produced by operator-runnable rendering pipelines whose output is fully reproducible from a local template — aligns the verdict ceiling with the same rule already applied to other re-rendered document classes.
- Brought the algorithm into line with our previously-documented stance that this family of rendering pipelines cannot ground an integrity guarantee on its own.
- Withheld the integrity guarantee on a further class of programmatically-rendered outputs whose metadata profile reads as bare-defaults — characteristic of unattended document-generation scripts rather than first-party institutional pipelines.
v2.23.0
- New detection class for documents that impersonate a trusted institutional authoring pipeline — catches third-party rebuilds that overwrite the generator field to disguise the document as an institutional export
- Hardened identity-layer corroboration so a document claiming a high-trust origin must also carry the structural fingerprints that the genuine pipeline always leaves behind
v2.22.1
- Reduced false positives on a class of single-pass institutional documents that were previously being mistaken for incrementally re-edited files
- Tightened a re-write-detection rule so a known-legitimate internal layout pattern is no longer treated as evidence of post-creation edits, while the guard against genuine appended edits remains in place
v2.22.0
- New detection class for a font-layer inconsistency characteristic of non-genuine institutional documents
- Hardened the integrity ceiling for a class of re-rendered outputs whose edit history cannot be reconstructed
- Bypass-hardened a previously-trusted producer category against impersonation by reassembled documents
- Extended scanner-origin recognition to a producer-string variant we were missing — recovers correct origin classification for a class of smartphone-scanner outputs
- New detection class for synthetic edit-history patterns where a document claims to have been touched by an older authoring runtime than the one that originally produced it — a configuration that cannot arise from a normal single-device workflow
v2.21.1
- Withheld the integrity guarantee on a class of documents whose authenticity cannot be independently corroborated by our other detection layers.
- Recovered correct origin classification on a class of legitimately-generated documents that the previous heuristic was over-flagging.
v2.21.0
- Added detection for institutional documents re-printed after editing to flatten authoring history — recovered evidence of the underlying edit despite the re-print
- Added a toolchain-residue detector that catches documents whose declared generator contradicts the authoring residue carried elsewhere in the file — characteristic of re-rendering a document that originated in a different pipeline
- Extended typeface fingerprinting to catch fonts whose identity was hand-altered before being embedded in an overlay edit
- Recovered detection of declared file-size inconsistencies that were previously masked by an over-restrictive internal gate
- Removed two detector branches that did not produce unique findings — their coverage is preserved by adjacent markers
- Tightened a signature-workflow carve-out so it can no longer mask overlay-edit evidence
- Recovered standalone detection on metadata-layer timestamp contradictions that were previously suppressed
- Strengthened deliberate-blanking detection against a parsing edge case that previously hid the evidence
- Extended font-embedding expectations to an additional class of legitimate authoring pipelines
- Reduced false positives on enterprise reports generated by institutional reporting pipelines.
v2.19.3
- Reduced false positives in generator-fingerprint analysis on documents from browser-rendered print pipelines
- Reduced false positives in font-duplication analysis on enterprise reports whose typeface variants share a common family
- Reduced false positives in multi-source page assembly and template-assembly analysis on enterprise statement composition pipelines that embed pre-rendered design components
v2.19.2
- Bypass-hardened a previously-trusted generator category against impersonation
- Bypass-hardened the carve-out for legitimately old documents against trivial timestamp manipulation
- Refreshed the registry of online editing and conversion services consulted during origin classification
- Tightened the conditions under which the embedded-metadata timestamp signal can fire — reduces false positives on multi-pass enterprise document-assembly pipelines
- Recovered detection on a class of full-rebuild assembly patterns that previously escaped corroboration
v2.19.1
- Recovered tamper detection on documents where original authoring records were deliberately erased post-process
- Broadened cross-layer metadata-laundering analysis to cover a wider family of browser-rendered print pipelines
- Recovered multi-source assembly detection on shorter documents
- Hardened a previously-trusted raster-origin category against impersonation by editing tools
- Recovered detection on shorter documents flattened from multi-session edits
v2.19.0
- Retired four detection paths whose underlying signals could not be reliably distinguished from legitimate authoring behaviour — keeps the verdict surface honest and removes false-positive risk on bona-fide enterprise documents
- Simplified incomplete-redaction analysis to the single structurally-precise annotation path
v2.18.2
- Improved origin classification of documents produced by online HTML-rendering services and post-creation PDF reprocessing tools — both now resolve to software-origin and cannot certify an unmodified verdict
v2.18.1
- Improved origin classification of documents produced by server-side HTML-layout rendering pipelines — these now fall under software-origin and cannot resolve to an unmodified verdict
v2.18.0
- Added detection of editing history collapsed into a single revision to mask prior modification rounds
- Added detection of programmatically generated pages disguised as a captured scan
- Added detection of markup annotations attached to a base file that carries no creation history
- Improved origin classification accuracy for documents derived from screen-resolution raster sources
v2.17.1
- Improved coverage of editing-tool fingerprint analysis on additional document layouts
- Reduced false positives in embedded image analysis for certain enterprise customer-communications rendering pipelines
v2.17.0
- Added detection of documents presented as a captured image whose pixel content shows machine-rendered uniformity inconsistent with any optical capture device
- Reduced false positives in synthetic-scan analysis on consumer-origin single-page raster documents
- Added detection of documents rebuilt by consumer print-driver utilities to strip authoring history
- Improved origin classification of consumer virtual print drivers
- Improved scan classification on documents wrapped by programmatic generators in ambiguous cases
v2.16.0
- Improved embedded-metadata selection to correctly skip media-asset metadata packets written in element form
- Reduced false positives in synthetic-scan detection for low-information machine-printed labels
- Added detection of form field values that disagree with the displayed widget value
- Added detection of flattened-after-fill interactive forms
- Added detection of extra bytes appended past the document’s normal end marker
- Reduced false positives in multi-source page-assembly analysis for browser print-to-PDF pipelines
- Reduced false positives in trailing-byte analysis for institutional libraries that pad output to a block boundary
- Further reduced false positives in document-identifier analysis for server-rendered enterprise reports
v2.15.1
- Reduced false positives in document-identifier analysis for server-rendered enterprise reports
- Broadened forensic-tool metadata-tampering analysis to cover an additional class of self-written rewriter scripts
- Reduced false positives in synthetic-scan detection for vector-rendered institutional statements
v2.15.0
- Added a cross-layer timestamp consistency check within document metadata
- Improved precision of font-mapping evasion analysis to reduce false positives on legitimate multi-subset font embedding
- Reduced false positives in template-assembly analysis on documents produced by certain markup-flattening tools
v2.14.0
- Expanded multi-source assembly detection across additional structural layers
- Reduced false positives in metadata-completeness analysis on enterprise document-composition platforms
- Strengthened synthetic-scan detection against an additional class of forgeries
- Added detection of structurally inconsistent tool-pipeline declarations in document metadata
- Added detection of additional font-level anti-forensic post-processing
- Added detection of additional desktop-tool flatten-injection patterns
- Added detection of further fraud-kit metadata fingerprints
- Added cross-layer color-model consistency checks
v2.13.25
- Tightened verdict semantics for additional editor-workflow patterns
- Reduced false positives in metadata-date comparison for certain native-export authoring tools
- Improved origin classification for documents touched by a commercial desktop PDF editor
- Strengthened synthetic-scan detection
- Tightened identity-array suppression on additional open-source library variants
v2.13.24
- Improved metadata-date parsing for documents from certain diagram-authoring tools that use a non-standard timezone-suffix notation
- Improved reliability of metadata extraction for documents from certain image-pipeline tools whose metadata strings contain a non-printable trailing byte
- Expanded font-forensic analysis to catch additional editor field-replacement patterns
v2.13.23
- Added new detections targeting additional classes of post-creation content modification
- Strengthened cross-layer metadata analysis
- Improved multi-source assembly detection
- Reduced false positives in cross-layer metadata analysis on documents from enterprise variable-data publishing platforms
- Reduced false positives in document-identifier-array analysis on outputs from certain enterprise form-rendering pipelines
- Detection extended to documents that have passed through online PDF editing and conversion services
- Reduced false positives in incremental-update analysis for documents whose update increments are inherent to the signing workflow
- Detection extended to programmatic PDF processing libraries
- Reduced false positives in cross-layer metadata analysis on documents containing branded design assets
- Added detection of additional multi-source page assembly patterns
v2.13.22
- Improved accuracy of text-operator counting on documents whose page content lives in compressed payload-bearing streams
- Reduced false positives in sparse-text-overlay analysis on legitimate small-text documents
- Reduced false positives in vector-outline text analysis on consumer browser-print outputs
v2.13.21
- Improved accuracy of content-stream analysis on documents using nested page-content layouts
v2.13.20
- Added detection of an additional template-field substitution forgery pattern
v2.13.19
- Added detection of an additional synthetic-raster substitution pattern
- Reduced false positives in mixed-origin page-assembly analysis on additional rendering pipelines
- Suppressed redundant signals when a more specific marker already fires
- Improved accuracy of font-duplication analysis on additional rendering pipelines
- Reworked per-page font-set comparison; covers a previously-undetected multi-source assembly pattern
- Extended generator-fingerprint coverage to additional programmatic page-assembly libraries
v2.13.18
- Added detection of additional metadata-tampering tool markers
- Added detection of additional post-processing tool artifacts
- Added detection of script-injection patterns on top of re-emitted documents
- Reduced false positives in graphics-state analysis on large single-pass exports
- Reduced false positives in cross-metadata-stream date analysis
- Reduced false positives in sparse-text-overlay analysis
- Reduced false positives in text-content analysis for multi-byte font encodings
- Refined verdict semantics for documents whose state cannot be authenticated structurally
v2.13.17
- Reduced false positives in graphics-state analysis on documents containing embedded native-format roundtrip data streams
- Reduced false positives in template-assembly, font-session, and producer-consistency analyses on documents from enterprise variable-data composition platforms
- Reduced false positives in document-identifier analysis for documents from enterprise document-processing pipelines
- Reduced false positives on documents carrying only legitimate digital-fill-and-sign overlays from browser-rendered base documents
v2.13.16
- Reduced false positives in multi-source page assembly detection on documents from certain single-pass rendering pipelines
- Reduced false positives in redaction analysis for documents containing decorative background fills
- Reduced false positives in font duplication analysis for server-rendered enterprise reports
- Improved accuracy of signature integrity analysis across reader-specific incremental save patterns
- Improved accuracy of graphics-state analysis on documents containing non-page-content data streams
- Improved accuracy of invisible-character analysis for documents from certain browser-rendered environments
- Improved origin classification for documents from additional online form-editing platforms
v2.13.15
- Refined verdict semantics for a class of documents whose structural origin cannot establish institutional authenticity on its own — the API response now includes actionable guidance to verify such documents with the issuing organisation
v2.13.14
- Reduced false positives on documents processed through multi-party digital signing workflows
- Reduced false positives in multi-source page assembly detection for certain office productivity output
- Reduced false positives in incremental-update detection for certain office productivity output
v2.13.13
- Reduced false positives in incremental-modification analysis for documents produced by standard print-stream-to-PDF pipelines
- Reduced false positives for documents produced by an additional widely-used office suite whose normal library behaviour was previously misidentified
v2.13.12
- Expanded recognition of tools used for document re-processing
- Improved generator-fingerprint mismatch detection across the full set of metadata locations a forger may target
- Added a metadata-toolkit consistency check that flags vendor-mismatched metadata packets
- Reduced false positives in scan classification for certain programmatic certificate and report generators
- Expanded structural fingerprint detection to cover an additional family of PDF editing tools — documents edited by tools in this family are now correctly identified as modified when the declared generator contradicts the structural evidence
v2.13.11
- Expanded detection of document assembly patterns — additional structural markers are now identified when pages within a single document appear to originate from independent source files
- Expanded structural analysis to cover additional character encoding patterns — certain manipulation techniques used to alter document meaning without changing its visible appearance are now flagged
- Added detection for incomplete redaction — documents where content marked for removal remains present and recoverable in the file structure are now identified
- Expanded structural consistency checks to cover additional page-level properties that can indicate post-creation document assembly
- Added detection of internal timestamp inconsistencies between document objects and document-level metadata
v2.13.10
- Improved metadata extraction for documents that contain embedded image resources with their own metadata — preventing spurious metadata inconsistency markers
- Expanded recognition of hardware scanner devices — documents produced by an additional family of multifunction printer units are now correctly classified as inconclusive rather than intact
- Improved structural classification of scanner output that has been lightly post-processed — these are now correctly classified as inconclusive
v2.13.9
- Expanded recognition of hardware scanner and multifunction printer devices — documents produced by an additional family of office scanners are now correctly classified as inconclusive
- Improved structural scan detection across a broader range of scanner firmware variants
- Improved handling of web-optimised (linearized) PDFs — reducing false positives for documents optimised for fast web delivery
v2.13.8
- Reduced false positives for documents created with certain web-based design tools — a structural identifier pattern that is always produced by their automated export pipeline no longer incorrectly signals post-creation modification when no other evidence is present
- Reduced false positives for a class of programmatically generated documents where a graphics state asymmetry is a known generator artifact rather than evidence of post-creation stream editing
- Improved robustness of browser-origin rendering detection, closing a gap where the signal could be masked on certain page layouts
- Added recognition of an additional online document workflow platform — documents exported by its automated HTML-to-PDF pipeline are now correctly classified as inconclusive rather than modified, since no structural integrity guarantees apply to browser-rendered output
- Improved cross-field metadata consistency check to correctly handle documents whose titles or author names contain non-ASCII characters — eliminates a class of false positives for internationalized documents
v2.13.7
- Reduced false positives in identifier-pattern analysis on documents generated by enterprise reporting frameworks
- Extended the above fix to additional members of the same document generation library family across different programming languages and forks
- Reduced false positives in metadata cross-field consistency checks for known generator artifacts
- Improved coverage of graphics-state balance analysis on documents with multi-stream page content
- Extended detection of residual document structures inherited from prior templates in rebuilt single-revision files
v2.13.6
- Significantly expanded recognition of design, publishing, and editing tools — documents created with a broader range of non-institutional applications now correctly return inconclusive instead of intact
- Added detection for additional online document editing tools
- Improved self-check sampling to exclude documents already classified as inconclusive, reducing false discrepancy reports
v2.13.5
- Improved detection of documents assembled from pages of different origins — more cases are now correctly identified as modified
- Improved detection of documents where encoding characteristics of embedded images are inconsistent across pages — a structural indicator of post-creation assembly
- Improved detection of documents rebuilt from scratch by editing tools — a structural identifier inconsistency now correctly signals post-creation modification
v2.13.4
- Improved classification of documents that appear to be scanned images of physical pages — these now consistently return inconclusive regardless of the declared software origin
- Improved recognition of additional scanner device types
- Improved detection of documents with evidence of post-creation text editing
- Improved detection of documents where identifying metadata has been replaced
v2.13.3
- Refined verdict semantics for documents that lack a sufficient temporal baseline for authenticity analysis
- Fixed false positives for documents generated by server-side browser automation — these were incorrectly classified as consumer software
- Improved detection of additional metadata-tampering patterns
- Improved detection of selectively-edited tool-identity fields
v2.13.2
- Reduced false positives in graphics-state analysis on documents containing payload-bearing binary streams
- Reduced false positives in graphics-state analysis on text content containing characters that coincide with operator codes
- Reduced false positives in metadata contradiction detection on documents using non-standard string encoding
- Reduced false positives for documents generated by enterprise print pipelines with non-standard file framing
v2.13.1
- Reduced false positives in graphics-state analysis on documents with embedded font programs
v2.13.0
- Added structural detection of scanned documents based on image placement geometry
- Added detection of invisible text overlay patterns associated with OCR processing
- Added detection of content edited in a document editor after initial generation
- Added detection of byte-level structural manipulation in document files
- Added detection of post-modification inconsistencies in optimized document structure
v2.12.0
- Added detection of binary image substitution in scanned documents
v2.11.9
- Improved detection of documents rendered by a browser print pipeline
- Improved parsing of non-standard date formats in document metadata
v2.11.8
- Improved tool identity checks
v2.11.7
- Expanded online converter recognition
v2.11.6
- Improved detection of inconsistencies between metadata layers
v2.11.5
- Reduced false positives for documents with incomplete metadata
- API is now available at the dedicated subdomain api.htpbe.tech/v1
- The previous base URL (htpbe.tech/api/v1) continues to work — no migration required
v2.11.4
- Expanded consumer software recognition
v2.11.3
- Print-to-PDF documents are now correctly classified as consumer software origin
v2.11.2
- Reduced false positives in metadata-date comparison for generators that emit a small intra-session timestamp gap
v2.11.1
- Expanded the list of office software recognized as consumer origin — previously unrecognized editors now correctly return inconclusive instead of intact
v2.11.0
- Added detection of structurally impossible metadata dates
- Added detection of minimal incremental updates consistent with metadata-only tampering
- Improved detection reliability for modern PDF formats (PDF 1.5+)
- Improved detection of digitally signed documents with long-term validation data
- Reduced false positives on legitimately signed documents
- Encrypted PDFs now return a clear error instead of an unreliable result
- Previously analyzed files may benefit from re-analysis
v2.10.0
- Fixed misclassification of several server-side PDF generation tools as consumer software — documents generated by institutional automation pipelines were incorrectly returned as inconclusive
- Improved distinction between browser-based consumer printing and programmatic server-side rendering pipelines that share underlying rendering technology
- Previously analyzed files from affected institutional pipelines may benefit from re-analysis
v2.9.0
- Introduced detection of mixed-origin page assembly based on structural rendering pipeline characteristics detectable at the content stream level
- Documents with pages of confirmed mixed rendering origin are now flagged as modified when corroborated by additional structural evidence
- Previously analyzed files may benefit from re-analysis
v2.8.0
- Added detection of an additional anti-forensic rasterization pattern used to destroy text extractability while preserving visual appearance
- Previously analyzed files may benefit from re-analysis
v2.7.0
- Added detection of an additional mixed-origin page-assembly forgery pattern
- False-positive guards prevent flagging of legitimate scanned annexes
- Previously analyzed files may benefit from re-analysis
v2.6.1
- Improved scan classification reliability — fixed a false-negative where certain compressed image streams could cause scanned PDFs to be misclassified as institutional instead of inconclusive.
v2.6.0
- Introduced detection of documents assembled from pages rendered in independent sessions
- Previously analyzed files may benefit from re-analysis
v2.5.3
- Reduced false positives for documents using the ISO 32000-1 fast-web-view format
v2.5.2
- Reduced false positives in design-tool forgery detection for documents from certain office productivity software
v2.5.1
- Fixed two parsing issues in assembled-document detection that caused certain multi-page documents to pass as intact
- Previously analyzed assembled documents may benefit from re-analysis
v2.5.0
- Fixed a parsing gap that caused certain non-standard PDF files to bypass stream-based analysis
- Added detection of an additional PDF editing tool that previously evaded fingerprinting
- Introduced detection of documents assembled from multiple independently imported pages
- Previously analyzed files may benefit from re-analysis
v2.4.0–2.4.2
- Fixed false "scanned document" classification for modern PDF formats (PDF 1.5+)
- Removed false-positive alpha-channel detection — PDFs with images using transparency are no longer incorrectly flagged
- Closed a detection gap where signature-removal could go undetected in certain re-emitted documents
- Expanded consumer-software origin recognition to include common image and design editors
- Previously analyzed files may benefit from re-analysis
v2.3.0
- Improved analysis consistency: all detection signals are now evaluated uniformly without special-case exceptions
- Structural anomalies are now included in the analysis output
- Reduced false positives for PDFs generated with certain XMP-only workflows
- Previously analyzed files may benefit from re-analysis
v2.2.1
- Fixed detection reliability for PDFs using modern compressed object streams (PDF 1.5+)
- Resolved edge cases in content stream parsing and font subset analysis
- Previously analyzed files may benefit from re-analysis
v2.2.0
- Introduced detection of template-assembly document forgeries
- Expanded coverage to identify composites built from design-tool templates
- Previously analyzed files may benefit from re-analysis
v2.1.7–2.1.8
- Improved detection accuracy: fixed rare false negatives where a real modification marker was missed
- Improved post-signature tampering detection using cryptographic verification
- Fixed false positives for PDFs with an invalid creation date that was still present in metadata
v2.1.6
- Added "Cannot Determine" result for PDFs created with consumer office and word-processing software
- New origin detection: API now returns origin.type and origin.software fields
- New primary status field in API: "intact", "modified", or "inconclusive"
- Result page now shows a grey "Cannot Determine" badge and explanation for consumer-software PDFs
v2.1.5
- Improved detection accuracy for documents from certain office productivity software
- Previously analyzed files may benefit from re-analysis
v2.1.2–2.1.4
- Improved detection accuracy for documents from certain word-processing software
- Reduced false positives for common PDF creation tools
- Previously analyzed files may benefit from re-analysis
v2.1.1
- Improved compatibility with modern PDF formats (PDF 1.5+)
- Enhanced verification for digitally signed documents; significantly reduced false positives on legitimately signed PDFs
v2.1.0
- Redesigned the detection engine with a new approach to identifying document modifications
- Improved accuracy by replacing simple metadata comparison with a more robust analysis method
- Introduced detection of known PDF editing tools
- Previously analyzed files may benefit from re-analysis
v2.0.4
- Fixed timezone bug in date display (dates no longer shown in the future)
- Improved UTC timestamp handling for accurate relative time display
v2.0.3
- Fixed false positive bug in PDF modification detection
- Improved accuracy for metadata analysis
v2.0.2
- Added user dashboard with API key management
- Implemented passwordless authentication (Google, GitHub, Magic Links)
- Created billing and subscription management interface
v2.0.1
- Enhanced API infrastructure with monthly quota management
- Improved typography and visual consistency
- Updated documentation for PDF metadata analysis
v2.0.0
- Major platform update — rebuilt application infrastructure
- Migrated to Turso database with Drizzle ORM for improved reliability
- Comprehensive UX improvements across the application
- Enhanced error handling and stability
v1.0.0
- Initial public release
- Core PDF analysis features