Algorithm Updates

Changelog

Latest updates and improvements to HTPBE?

24.06.2026Latestv2.29.0

New detection class for documents that were reworked in an interactive PDF editor after their original creation — a common way a forged statement or letter is built, where an authentic document is opened in an editing application, a value is altered, and the file is saved back. Files reworked this way previously passed as untouched; they are now recognised as edited after the fact, surfaced under the same edited-in-an-editor outcome as our other content-editing checks.
Reduced false positives in our scanned-page detection on a class of genuine programmatically-generated enterprise bills and statements that carry lightweight postal-mailing marks — the sorting and franking elements a print-and-mail run stamps onto an otherwise digital document. These thin printed marks were previously mistaken for a full-page scanned image, so authentic machine-issued documents were wrongly flagged as modified. The check now tells a sparse mailing mark apart from a genuine page-filling scan before it fires, so these legitimate bills pass — while a real full-page scan dropped into a digital document is still caught.
Broadened our edited-in-an-editor detection to recover a further class of documents reworked after their original creation — the kind of post-creation editing behind a forged statement or report. The recovery works independently of which editing application was used, so this class is caught no matter the tool that produced it, and is surfaced under the existing edited-in-an-editor outcome.
Reduced false positives on a class of genuine institutional financial and retirement statements that an enterprise pipeline composes automatically by assembling pre-rendered components, leaving the document’s descriptive metadata empty by design. That machine-composed shape previously tripped both our layered-revision check and our blanked-metadata check, so authentic composed statements were wrongly flagged as modified. We now recognise this composition pipeline by a validated fingerprint before standing those two flags down — and the allowance is deliberately narrow: it holds only while the document’s own internal structure proves nothing in it was rewritten after assembly, so a forger cannot fabricate the fingerprint to smuggle an edited file through, because the edit itself breaks the proof. These genuine statements no longer read as modified, while any real post-assembly change is still caught.

23.06.2026v2.28.0

Added a new proprietary integrity signal to the detection suite. We are publicly acknowledging that this signal exists, but — unlike our other checks — we are deliberately not disclosing how it works. The detection method is held back on purpose so that forgers cannot read a description of it and engineer their way around it. When it fires, it is strong, corroborated evidence that the file is a modified derivative rather than an untouched original.
This is the first check in the catalogue whose mechanism we keep undisclosed by design; every other check continues to be described in plain outcome terms.

22.06.2026v2.27.1

Reduced false positives in our hidden-duplicate-text check on a class of genuine documents laid out as tables, where each visible cell is placed independently on the page. These distinct, fully-visible values were previously being misread as concealed text stacked in the same spot, and the document was wrongly marked modified. The check now judges where each value actually sits on the finished page before comparing, so separately-placed visible cells are no longer mistaken for one another — while a substitute value genuinely hidden underneath a visible one is still caught.

19.06.2026v2.27.0

Reduced false positives in our creation-versus-modification timeline check on a class of genuine government and corporate forms generated by a server-side form-rendering pipeline. Documents of this kind carry an authoring-stage origin date that is naturally years apart from the date their final copy was produced — a normal fingerprint of how these forms are issued, not evidence of a later edit. This pipeline is now recognised alongside the other native-export rendering engines for which that gap is expected, so authentic forms are no longer mistaken for modified ones. The allowance stays deliberately narrow: it applies only to clean single-render documents whose independent timeline layers agree, so a fabricated file cannot borrow it to slip past detection, and genuine post-creation edits still fire.
New detection class for documents rebuilt inside a graphics-design tool from a source file held locally on an operator’s own machine — a construction pattern that no institution uses to issue its statements, certificates, or forms, and strong evidence the file was reassembled by a person rather than produced by the named issuer.
New detection class for documents whose metadata layer was regenerated by a separate tool in a different locale after the original authoring — a second-pass rewrite that a genuine single-pass generator never produces.

18.06.2026v2.26.1

Broadened detection of documents assembled by merging pages from more than one source file — recovering a class of plainly-built merges that previously slipped through certified as a single original, including documents with extra pages glued on after the fact.

17.06.2026v2.26.0

Retired the standalone document-identifier consistency check. Reviewed against a large corpus of genuine documents, the internal identifier records were found to differ legitimately on a single clean render across many established document generators — so on their own they could not reliably separate an authentic file from an edited one, and the check produced false positives on genuine documents while adding nothing the structural modification checks did not already provide. The retirement narrows nothing real: every modification this check could ever evidence is already covered independently by the structural layer.
New detection class for an institutional document that was opened and re-saved by a second tool after its original creation, with no digital signature to account for the change — a post-creation re-stamp pattern that is now caught on its own structural evidence.
Restored detection of documents assembled by merging pages from more than one source file — now recovered on its own structural evidence, including on merges that were subsequently rebuilt to look like a single original.
New detection class for a stamp-overlay edit — substitute values layered on top of an otherwise-untouched original page to alter what it appears to say.
New detection class for a cover-and-replace edit — original content concealed and replaced with substitute values to change what the page reads.

16.06.2026v2.25.3

Reduced false positives on a class of genuine documents produced by physical scanning hardware whose output was previously being misread as a synthetically-rendered imitation of a scan — authentic scans of this kind now correctly settle at the not-certifiable scan-origin ceiling instead of being flagged as modified, while truly fabricated pseudo-scans are unaffected.

15.06.2026v2.25.2

Widened the set of single-pass re-rendered documents we decline to certify as unmodified — files rebuilt in one pass discard the modification-history evidence an intact verdict depends on, so this class is now consistently treated as not-certifiable rather than passed as clean. Genuine institutional documents are unaffected, and structural tamper detectors still run independently.
Recovered scanned-document recognition on a class of scanned files that were previously slipping through certified as clean — a scanned document can never be confirmed unmodified, so this class now correctly returns the not-certifiable ceiling rather than an intact verdict.

13.06.2026v2.25.1

Reduced false positives in our tool-versus-structure consistency check on a class of legitimate institutional documents produced by a newer generation of an established generator — genuine fabricated-origin tampering is unaffected.

10.06.2026v2.25.0

New detection class for documents that misrepresent which tool produced them — flagging origin that was rewritten to disguise where the document actually came from.
Broadened generator-identity-forgery detection to a further class of documents whose stated origin contradicts the rest of the document.
Reduced false positives in our generator-identity check on a class of legitimate institutional documents carrying an embedded company logo — identity belonging to a bundled graphic is no longer mistaken for the document’s own origin, while genuine document-level identity disagreement still fires.

09.06.2026v2.24.2

Reduced false positives in our document-identifier consistency check on a class of legitimate single-pass institutional documents whose identifier records differ purely as a generation-time artifact — real identifier-mismatch tampering on edited documents is unaffected.
Reduced false positives in our template-assembly detection on a class of legitimately print-rendered documents whose per-page font handling was being misread as the reuse signature of a pieced-together template — detection of genuine template-assembly forgeries still fires through its other independent signals.

05.06.2026v2.24.1

Reduced false positives in our design-tool / template-assembly detection on a class of legitimate modern documents that were previously being mistaken for pieced-together templates — detection of genuine template-assembly forgeries is unaffected.
Refined how the page-assembly analysis reads the document so a legitimate modern layout is no longer misread as reused template scaffolding, with real multi-source assembly still flagged through our other independent signals.

04.06.2026v2.24.0

New detection class for documents whose internal structure references a component that was removed during a rebuild — a tell-tale of an embedded component quietly stripped out while the document was reassembled.
Hardened structural-integrity checks to flag dangling internal references left behind when a document is rebuilt from an original, without relying on any single tool fingerprint.

27.05.2026v2.23.3

Recovered tamper detection on a class of edited institutional documents — including documents altered well after their original issue date — that previously slipped through the timestamp-consistency check.
Reduced false positives on legitimate enterprise report outputs, whose normal compile-to-render timing is no longer mistaken for post-export tampering.

26.05.2026v2.23.2

Reduced false positives in our content-stream consistency analysis on a class of legitimate documents that were previously being mistaken for tampered files.
Hardened how the consistency analysis interprets document content, so a class of legitimate files is no longer misread as inconsistent — detection of real inconsistencies is unaffected.
Expanded the set of desktop PDF editors recognized as re-rendering tools, so documents passed through them are no longer treated as guaranteed originals — the integrity ceiling now matches our existing rule for other re-rendered document classes.

23.05.2026v2.23.1

Withheld the integrity guarantee on a class of documents produced by operator-runnable rendering pipelines whose output is fully reproducible from a local template — aligns the verdict ceiling with the same rule already applied to other re-rendered document classes.
Brought the algorithm into line with our previously-documented stance that this family of rendering pipelines cannot ground an integrity guarantee on its own.
Withheld the integrity guarantee on a further class of programmatically-rendered outputs whose metadata profile reads as bare-defaults — characteristic of unattended document-generation scripts rather than first-party institutional pipelines.

22.05.2026v2.23.0

New detection class for documents that impersonate a trusted institutional authoring pipeline — catches third-party rebuilds that overwrite the generator field to disguise the document as an institutional export
Hardened identity-layer corroboration so a document claiming a high-trust origin must also carry the structural fingerprints that the genuine pipeline always leaves behind

21.05.2026v2.22.1

Reduced false positives on a class of single-pass institutional documents that were previously being mistaken for incrementally re-edited files
Tightened a re-write-detection rule so a known-legitimate internal layout pattern is no longer treated as evidence of post-creation edits, while the guard against genuine appended edits remains in place

20.05.2026v2.22.0

New detection class for a font-layer inconsistency characteristic of non-genuine institutional documents
Hardened the integrity ceiling for a class of re-rendered outputs whose edit history cannot be reconstructed
Bypass-hardened a previously-trusted producer category against impersonation by reassembled documents
Extended scanner-origin recognition to a producer-string variant we were missing — recovers correct origin classification for a class of smartphone-scanner outputs
New detection class for synthetic edit-history patterns where a document claims to have been touched by an older authoring runtime than the one that originally produced it — a configuration that cannot arise from a normal single-device workflow

19.05.2026v2.21.1

Withheld the integrity guarantee on a class of documents whose authenticity cannot be independently corroborated by our other detection layers.
Recovered correct origin classification on a class of legitimately-generated documents that the previous heuristic was over-flagging.

18.05.2026v2.21.0

Added detection for institutional documents re-printed after editing to flatten authoring history — recovered evidence of the underlying edit despite the re-print
Added a toolchain-residue detector that catches documents whose declared generator contradicts the authoring residue carried elsewhere in the file — characteristic of re-rendering a document that originated in a different pipeline
Extended typeface fingerprinting to catch fonts whose identity was hand-altered before being embedded in an overlay edit
Recovered detection of declared file-size inconsistencies that were previously masked by an over-restrictive internal gate
Removed two detector branches that did not produce unique findings — their coverage is preserved by adjacent markers
Tightened a signature-workflow carve-out so it can no longer mask overlay-edit evidence
Recovered standalone detection on metadata-layer timestamp contradictions that were previously suppressed
Strengthened deliberate-blanking detection against a parsing edge case that previously hid the evidence
Extended font-embedding expectations to an additional class of legitimate authoring pipelines
Reduced false positives on enterprise reports generated by institutional reporting pipelines.

17.05.2026v2.19.3

Reduced false positives in generator-fingerprint analysis on documents from browser-rendered print pipelines
Reduced false positives in font-duplication analysis on enterprise reports whose typeface variants share a common family
Reduced false positives in multi-source page assembly and template-assembly analysis on enterprise statement composition pipelines that embed pre-rendered design components

16.05.2026v2.19.2

Bypass-hardened a previously-trusted generator category against impersonation
Bypass-hardened the carve-out for legitimately old documents against trivial timestamp manipulation
Refreshed the registry of online editing and conversion services consulted during origin classification
Tightened the conditions under which the embedded-metadata timestamp signal can fire — reduces false positives on multi-pass enterprise document-assembly pipelines
Recovered detection on a class of full-rebuild assembly patterns that previously escaped corroboration

16.05.2026v2.19.1

Recovered tamper detection on documents where original authoring records were deliberately erased post-process
Broadened cross-layer metadata-laundering analysis to cover a wider family of browser-rendered print pipelines
Recovered multi-source assembly detection on shorter documents
Hardened a previously-trusted raster-origin category against impersonation by editing tools
Recovered detection on shorter documents flattened from multi-session edits

16.05.2026v2.19.0

Retired four detection paths whose underlying signals could not be reliably distinguished from legitimate authoring behaviour — keeps the verdict surface honest and removes false-positive risk on bona-fide enterprise documents
Simplified incomplete-redaction analysis to the single structurally-precise annotation path

16.05.2026v2.18.2

Improved origin classification of documents produced by online HTML-rendering services and post-creation PDF reprocessing tools — both now resolve to software-origin and cannot certify an unmodified verdict

16.05.2026v2.18.1

Improved origin classification of documents produced by server-side HTML-layout rendering pipelines — these now fall under software-origin and cannot resolve to an unmodified verdict

15.05.2026v2.18.0

Added detection of editing history collapsed into a single revision to mask prior modification rounds
Added detection of programmatically generated pages disguised as a captured scan
Added detection of markup annotations attached to a base file that carries no creation history
Improved origin classification accuracy for documents derived from screen-resolution raster sources

14.05.2026v2.17.1

Improved coverage of editing-tool fingerprint analysis on additional document layouts
Reduced false positives in embedded image analysis for certain enterprise customer-communications rendering pipelines

13.05.2026v2.17.0

Added detection of documents presented as a captured image whose pixel content shows machine-rendered uniformity inconsistent with any optical capture device
Reduced false positives in synthetic-scan analysis on consumer-origin single-page raster documents
Added detection of documents rebuilt by consumer print-driver utilities to strip authoring history
Improved origin classification of consumer virtual print drivers
Improved scan classification on documents wrapped by programmatic generators in ambiguous cases

12.05.2026v2.16.0

Improved embedded-metadata selection to correctly skip media-asset metadata packets written in element form
Reduced false positives in synthetic-scan detection for low-information machine-printed labels
Added detection of form field values that disagree with the displayed widget value
Added detection of flattened-after-fill interactive forms
Added detection of extra bytes appended past the document’s normal end marker
Reduced false positives in multi-source page-assembly analysis for browser print-to-PDF pipelines
Reduced false positives in trailing-byte analysis for institutional libraries that pad output to a block boundary
Further reduced false positives in document-identifier analysis for server-rendered enterprise reports

11.05.2026v2.15.1

Reduced false positives in document-identifier analysis for server-rendered enterprise reports
Broadened forensic-tool metadata-tampering analysis to cover an additional class of self-written rewriter scripts
Reduced false positives in synthetic-scan detection for vector-rendered institutional statements

09.05.2026v2.15.0

Added a cross-layer timestamp consistency check within document metadata
Improved precision of font-mapping evasion analysis to reduce false positives on legitimate multi-subset font embedding
Reduced false positives in template-assembly analysis on documents produced by certain markup-flattening tools

08.05.2026v2.14.0

Expanded multi-source assembly detection across additional structural layers
Reduced false positives in metadata-completeness analysis on enterprise document-composition platforms
Strengthened synthetic-scan detection against an additional class of forgeries
Added detection of structurally inconsistent tool-pipeline declarations in document metadata
Added detection of additional font-level anti-forensic post-processing
Added detection of additional desktop-tool flatten-injection patterns
Added detection of further fraud-kit metadata fingerprints
Added cross-layer color-model consistency checks

07.05.2026v2.13.25

Tightened verdict semantics for additional editor-workflow patterns
Reduced false positives in metadata-date comparison for certain native-export authoring tools
Improved origin classification for documents touched by a commercial desktop PDF editor
Strengthened synthetic-scan detection
Tightened identity-array suppression on additional open-source library variants

06.05.2026v2.13.24

Improved metadata-date parsing for documents from certain diagram-authoring tools that use a non-standard timezone-suffix notation
Improved reliability of metadata extraction for documents from certain image-pipeline tools whose metadata strings contain a non-printable trailing byte
Expanded font-forensic analysis to catch additional editor field-replacement patterns

05.05.2026v2.13.23

Added new detections targeting additional classes of post-creation content modification
Strengthened cross-layer metadata analysis
Improved multi-source assembly detection
Reduced false positives in cross-layer metadata analysis on documents from enterprise variable-data publishing platforms
Reduced false positives in document-identifier-array analysis on outputs from certain enterprise form-rendering pipelines
Detection extended to documents that have passed through online PDF editing and conversion services
Reduced false positives in incremental-update analysis for documents whose update increments are inherent to the signing workflow
Detection extended to programmatic PDF processing libraries
Reduced false positives in cross-layer metadata analysis on documents containing branded design assets
Added detection of additional multi-source page assembly patterns

04.05.2026v2.13.22

Improved accuracy of text-operator counting on documents whose page content lives in compressed payload-bearing streams
Reduced false positives in sparse-text-overlay analysis on legitimate small-text documents
Reduced false positives in vector-outline text analysis on consumer browser-print outputs

04.05.2026v2.13.21

Improved accuracy of content-stream analysis on documents using nested page-content layouts

04.05.2026v2.13.20

Added detection of an additional template-field substitution forgery pattern

03.05.2026v2.13.19

Added detection of an additional synthetic-raster substitution pattern
Reduced false positives in mixed-origin page-assembly analysis on additional rendering pipelines
Suppressed redundant signals when a more specific marker already fires
Improved accuracy of font-duplication analysis on additional rendering pipelines
Reworked per-page font-set comparison; covers a previously-undetected multi-source assembly pattern
Extended generator-fingerprint coverage to additional programmatic page-assembly libraries

01.05.2026v2.13.18

Added detection of additional metadata-tampering tool markers
Added detection of additional post-processing tool artifacts
Added detection of script-injection patterns on top of re-emitted documents
Reduced false positives in graphics-state analysis on large single-pass exports
Reduced false positives in cross-metadata-stream date analysis
Reduced false positives in sparse-text-overlay analysis
Reduced false positives in text-content analysis for multi-byte font encodings
Refined verdict semantics for documents whose state cannot be authenticated structurally

30.04.2026v2.13.17

Reduced false positives in graphics-state analysis on documents containing embedded native-format roundtrip data streams
Reduced false positives in template-assembly, font-session, and producer-consistency analyses on documents from enterprise variable-data composition platforms
Reduced false positives in document-identifier analysis for documents from enterprise document-processing pipelines
Reduced false positives on documents carrying only legitimate digital-fill-and-sign overlays from browser-rendered base documents

29.04.2026v2.13.16

Reduced false positives in multi-source page assembly detection on documents from certain single-pass rendering pipelines
Reduced false positives in redaction analysis for documents containing decorative background fills
Reduced false positives in font duplication analysis for server-rendered enterprise reports
Improved accuracy of signature integrity analysis across reader-specific incremental save patterns
Improved accuracy of graphics-state analysis on documents containing non-page-content data streams
Improved accuracy of invisible-character analysis for documents from certain browser-rendered environments
Improved origin classification for documents from additional online form-editing platforms

28.04.2026v2.13.15

Refined verdict semantics for a class of documents whose structural origin cannot establish institutional authenticity on its own — the API response now includes actionable guidance to verify such documents with the issuing organisation

27.04.2026v2.13.14

Reduced false positives on documents processed through multi-party digital signing workflows
Reduced false positives in multi-source page assembly detection for certain office productivity output
Reduced false positives in incremental-update detection for certain office productivity output

26.04.2026v2.13.13

Reduced false positives in incremental-modification analysis for documents produced by standard print-stream-to-PDF pipelines
Reduced false positives for documents produced by an additional widely-used office suite whose normal library behaviour was previously misidentified

25.04.2026v2.13.12

Expanded recognition of tools used for document re-processing
Improved generator-fingerprint mismatch detection across the full set of metadata locations a forger may target
Added a metadata-toolkit consistency check that flags vendor-mismatched metadata packets
Reduced false positives in scan classification for certain programmatic certificate and report generators
Expanded structural fingerprint detection to cover an additional family of PDF editing tools — documents edited by tools in this family are now correctly identified as modified when the declared generator contradicts the structural evidence

24.04.2026v2.13.11

Expanded detection of document assembly patterns — additional structural markers are now identified when pages within a single document appear to originate from independent source files
Expanded structural analysis to cover additional character encoding patterns — certain manipulation techniques used to alter document meaning without changing its visible appearance are now flagged
Added detection for incomplete redaction — documents where content marked for removal remains present and recoverable in the file structure are now identified
Expanded structural consistency checks to cover additional page-level properties that can indicate post-creation document assembly
Added detection of internal timestamp inconsistencies between document objects and document-level metadata

24.04.2026v2.13.10

Improved metadata extraction for documents that contain embedded image resources with their own metadata — preventing spurious metadata inconsistency markers
Expanded recognition of hardware scanner devices — documents produced by an additional family of multifunction printer units are now correctly classified as inconclusive rather than intact
Improved structural classification of scanner output that has been lightly post-processed — these are now correctly classified as inconclusive

23.04.2026v2.13.9

Expanded recognition of hardware scanner and multifunction printer devices — documents produced by an additional family of office scanners are now correctly classified as inconclusive
Improved structural scan detection across a broader range of scanner firmware variants
Improved handling of web-optimised (linearized) PDFs — reducing false positives for documents optimised for fast web delivery

22.04.2026v2.13.8

Reduced false positives for documents created with certain web-based design tools — a structural identifier pattern that is always produced by their automated export pipeline no longer incorrectly signals post-creation modification when no other evidence is present
Reduced false positives for a class of programmatically generated documents where a graphics state asymmetry is a known generator artifact rather than evidence of post-creation stream editing
Improved robustness of browser-origin rendering detection, closing a gap where the signal could be masked on certain page layouts
Added recognition of an additional online document workflow platform — documents exported by its automated HTML-to-PDF pipeline are now correctly classified as inconclusive rather than modified, since no structural integrity guarantees apply to browser-rendered output
Improved cross-field metadata consistency check to correctly handle documents whose titles or author names contain non-ASCII characters — eliminates a class of false positives for internationalized documents

21.04.2026v2.13.7

Reduced false positives in identifier-pattern analysis on documents generated by enterprise reporting frameworks
Extended the above fix to additional members of the same document generation library family across different programming languages and forks
Reduced false positives in metadata cross-field consistency checks for known generator artifacts
Improved coverage of graphics-state balance analysis on documents with multi-stream page content
Extended detection of residual document structures inherited from prior templates in rebuilt single-revision files

20.04.2026v2.13.6

Significantly expanded recognition of design, publishing, and editing tools — documents created with a broader range of non-institutional applications now correctly return inconclusive instead of intact
Added detection for additional online document editing tools
Improved self-check sampling to exclude documents already classified as inconclusive, reducing false discrepancy reports

19.04.2026v2.13.5

Improved detection of documents assembled from pages of different origins — more cases are now correctly identified as modified
Improved detection of documents where encoding characteristics of embedded images are inconsistent across pages — a structural indicator of post-creation assembly
Improved detection of documents rebuilt from scratch by editing tools — a structural identifier inconsistency now correctly signals post-creation modification

17.04.2026v2.13.4

Improved classification of documents that appear to be scanned images of physical pages — these now consistently return inconclusive regardless of the declared software origin
Improved recognition of additional scanner device types
Improved detection of documents with evidence of post-creation text editing
Improved detection of documents where identifying metadata has been replaced

16.04.2026v2.13.3

Refined verdict semantics for documents that lack a sufficient temporal baseline for authenticity analysis
Fixed false positives for documents generated by server-side browser automation — these were incorrectly classified as consumer software
Improved detection of additional metadata-tampering patterns
Improved detection of selectively-edited tool-identity fields

15.04.2026v2.13.2

Reduced false positives in graphics-state analysis on documents containing payload-bearing binary streams
Reduced false positives in graphics-state analysis on text content containing characters that coincide with operator codes
Reduced false positives in metadata contradiction detection on documents using non-standard string encoding
Reduced false positives for documents generated by enterprise print pipelines with non-standard file framing

14.04.2026v2.13.1

Reduced false positives in graphics-state analysis on documents with embedded font programs

12.04.2026v2.13.0

Added structural detection of scanned documents based on image placement geometry
Added detection of invisible text overlay patterns associated with OCR processing
Added detection of content edited in a document editor after initial generation
Added detection of byte-level structural manipulation in document files
Added detection of post-modification inconsistencies in optimized document structure

11.04.2026v2.12.0

Added detection of binary image substitution in scanned documents

10.04.2026v2.11.9

Improved detection of documents rendered by a browser print pipeline
Improved parsing of non-standard date formats in document metadata

08.04.2026v2.11.8

Improved tool identity checks

08.04.2026v2.11.7

Expanded online converter recognition

08.04.2026v2.11.6

Improved detection of inconsistencies between metadata layers

06.04.2026v2.11.5

Reduced false positives for documents with incomplete metadata
API is now available at the dedicated subdomain api.htpbe.tech/v1
The previous base URL (htpbe.tech/api/v1) continues to work — no migration required

03.04.2026v2.11.4

Expanded consumer software recognition

02.04.2026v2.11.3

Print-to-PDF documents are now correctly classified as consumer software origin

01.04.2026v2.11.2

Reduced false positives in metadata-date comparison for generators that emit a small intra-session timestamp gap

01.04.2026v2.11.1

Expanded the list of office software recognized as consumer origin — previously unrecognized editors now correctly return inconclusive instead of intact

31.03.2026v2.11.0

Added detection of structurally impossible metadata dates
Added detection of minimal incremental updates consistent with metadata-only tampering
Improved detection reliability for modern PDF formats (PDF 1.5+)
Improved detection of digitally signed documents with long-term validation data
Reduced false positives on legitimately signed documents
Encrypted PDFs now return a clear error instead of an unreliable result
Previously analyzed files may benefit from re-analysis

26.03.2026v2.10.0

Fixed misclassification of several server-side PDF generation tools as consumer software — documents generated by institutional automation pipelines were incorrectly returned as inconclusive
Improved distinction between browser-based consumer printing and programmatic server-side rendering pipelines that share underlying rendering technology
Previously analyzed files from affected institutional pipelines may benefit from re-analysis

26.03.2026v2.9.0

Introduced detection of mixed-origin page assembly based on structural rendering pipeline characteristics detectable at the content stream level
Documents with pages of confirmed mixed rendering origin are now flagged as modified when corroborated by additional structural evidence
Previously analyzed files may benefit from re-analysis

25.03.2026v2.8.0

Added detection of an additional anti-forensic rasterization pattern used to destroy text extractability while preserving visual appearance
Previously analyzed files may benefit from re-analysis

25.03.2026v2.7.0

Added detection of an additional mixed-origin page-assembly forgery pattern
False-positive guards prevent flagging of legitimate scanned annexes
Previously analyzed files may benefit from re-analysis

24.03.2026v2.6.1

Improved scan classification reliability — fixed a false-negative where certain compressed image streams could cause scanned PDFs to be misclassified as institutional instead of inconclusive.

24.03.2026v2.6.0

Introduced detection of documents assembled from pages rendered in independent sessions
Previously analyzed files may benefit from re-analysis

22.03.2026v2.5.3

Reduced false positives for documents using the ISO 32000-1 fast-web-view format

17.03.2026v2.5.2

Reduced false positives in design-tool forgery detection for documents from certain office productivity software

17.03.2026v2.5.1

Fixed two parsing issues in assembled-document detection that caused certain multi-page documents to pass as intact
Previously analyzed assembled documents may benefit from re-analysis

16.03.2026v2.5.0

Fixed a parsing gap that caused certain non-standard PDF files to bypass stream-based analysis
Added detection of an additional PDF editing tool that previously evaded fingerprinting
Introduced detection of documents assembled from multiple independently imported pages
Previously analyzed files may benefit from re-analysis

15.03.2026v2.4.0–2.4.2

Fixed false "scanned document" classification for modern PDF formats (PDF 1.5+)
Removed false-positive alpha-channel detection — PDFs with images using transparency are no longer incorrectly flagged
Closed a detection gap where signature-removal could go undetected in certain re-emitted documents
Expanded consumer-software origin recognition to include common image and design editors
Previously analyzed files may benefit from re-analysis

15.03.2026v2.3.0

Improved analysis consistency: all detection signals are now evaluated uniformly without special-case exceptions
Structural anomalies are now included in the analysis output
Reduced false positives for PDFs generated with certain XMP-only workflows
Previously analyzed files may benefit from re-analysis

12.03.2026v2.2.1

Fixed detection reliability for PDFs using modern compressed object streams (PDF 1.5+)
Resolved edge cases in content stream parsing and font subset analysis
Previously analyzed files may benefit from re-analysis

11.03.2026v2.2.0

Introduced detection of template-assembly document forgeries
Expanded coverage to identify composites built from design-tool templates
Previously analyzed files may benefit from re-analysis

10.03.2026v2.1.7–2.1.8

Improved detection accuracy: fixed rare false negatives where a real modification marker was missed
Improved post-signature tampering detection using cryptographic verification
Fixed false positives for PDFs with an invalid creation date that was still present in metadata

08.03.2026v2.1.6

Added "Cannot Determine" result for PDFs created with consumer office and word-processing software
New origin detection: API now returns origin.type and origin.software fields
New primary status field in API: "intact", "modified", or "inconclusive"
Result page now shows a grey "Cannot Determine" badge and explanation for consumer-software PDFs

25.02.2026v2.1.5

Improved detection accuracy for documents from certain office productivity software
Previously analyzed files may benefit from re-analysis

17.02.2026v2.1.2–2.1.4

Improved detection accuracy for documents from certain word-processing software
Reduced false positives for common PDF creation tools
Previously analyzed files may benefit from re-analysis

16.02.2026v2.1.1

Improved compatibility with modern PDF formats (PDF 1.5+)
Enhanced verification for digitally signed documents; significantly reduced false positives on legitimately signed PDFs

12.02.2026v2.1.0

Redesigned the detection engine with a new approach to identifying document modifications
Improved accuracy by replacing simple metadata comparison with a more robust analysis method
Introduced detection of known PDF editing tools
Previously analyzed files may benefit from re-analysis

11.02.2026v2.0.4

Fixed timezone bug in date display (dates no longer shown in the future)
Improved UTC timestamp handling for accurate relative time display

09.02.2026v2.0.3

Fixed false positive bug in PDF modification detection
Improved accuracy for metadata analysis

08.02.2026v2.0.2

Added user dashboard with API key management
Implemented passwordless authentication (Google, GitHub, Magic Links)
Created billing and subscription management interface

07.02.2026v2.0.1

Enhanced API infrastructure with monthly quota management
Improved typography and visual consistency
Updated documentation for PDF metadata analysis

06.02.2026v2.0.0

Major platform update — rebuilt application infrastructure
Migrated to Turso database with Drizzle ORM for improved reliability
Comprehensive UX improvements across the application
Enhanced error handling and stability

11.10.2024v1.0.0

Initial public release
Core PDF analysis features