TLDR
Episode 14, the series finale, confronts the 42% gap: the DOJ identified over 6 million potentially responsive pages and released approximately 3.5 million. The pipeline that processed 2.1 million documents — extracting 2,383,898 entities, mapping 29.5 million relationships, detecting 125,620 communities — is built, tested, and ready to process whatever comes next (PAPER TRAIL Project, 2026a).
The 42% Gap
The number defines the series' conclusion. The DOJ's own figures document the disparity: six million pages identified as potentially responsive, three and a half million released. The Epstein Files Transparency Act (P.L. 119-38) passed 427 to 1 — the most bipartisan vote of the 118th Congress — mandating release. Compliance is the question (Epstein Files Transparency Act, Pub. L. No. 119-38, 2025).
The known gaps are specific. NPR reported 50+ pages of FBI interviews related to Trump accusations withheld or removed. Three of four FBI 302 summaries are missing. Representatives Khanna and Massie reviewed unredacted files at the DOJ and identified six men whose names remain redacted — described as "likely incriminated." Senator Wyden's Treasury investigation documented $1.08 billion in 4,725 wire transfers through Epstein accounts, but those Treasury records are not in the public corpus (PAPER TRAIL Project, 2026a).
At the same time, 31+ child victims' identities were exposed through redaction failures — the exact inverse of the statute's intent. Victims exposed. Perpetrators protected.
The Series in Numbers
EP14 recaps what 14 episodes and 6 evidence domains produced. 2,383,898 entities extracted at 97.4% NER coverage. 519,000 entity clusters resolved via Splink probabilistic linkage. 125,620 Leiden communities detected. 535,318 structural brokers identified via Burt's constraint. 224 wire transfers totaling $24.1 million parsed. $47.3 million in suspicious activity documented by a single TD Bank SAR. 2,894 FedEx shipments tracked. 863,000 emails indexed. 53 corporate entities mapped. 8 aircraft traced through FAA records (PAPER TRAIL Project, 2026a).
The corrections: 2 observations retracted (OBS-5 and OBS-6, OCR hallucinations). 5 calibration dates corrected. 1 hypothesis refuted (OBS-1, the Robert Crumb FedEx finding). Every error disclosed in the episode where it was discovered.
Ready to Process
The episode's title — "Ready To Process The Unreleased Epstein Files When Released" — is a statement of capability. The pipeline is not theoretical. It has already run across every released document. The same entity resolution, the same cross-domain synthesis, the same ACH scoring, the same error disclosure can process new documents the moment they are released — whether by DOJ compliance, congressional subpoena, or court order. No new infrastructure is needed. The unreleased files are the input. The pipeline is the machine (PAPER TRAIL Project, 2026b).
The Chao1 estimator projects 468,000 entities in the unseen portion of the corpus. The German Tank Problem flags sequential document gaps consistent with spoliation. These are not speculations — they are statistical projections from the observed data, with known confidence intervals.
Why This Episode Matters
EP14 closes the series by refusing to close the investigation. The pipeline identified patterns. It tested hypotheses. It disclosed errors. It declared zero findings because the confidence threshold was not met. And it states, explicitly, that it is ready to run again. The question the series leaves is not whether the pipeline works — 14 episodes demonstrated that it does. The question is whether the remaining 42% of documents will ever be released for it to process. That question belongs to Congress, the courts, and the Department of Justice.
References
Epstein Files Transparency Act, Pub. L. No. 119-38 (2025).
PAPER TRAIL Project. (2026a). EP14 slide content: Series summary, known gaps, pipeline capabilities [Presentation]. communications/ep14_slides/
PAPER TRAIL Project. (2026b). Processing pipeline: 33 scripts, complete processing state [Computer software]. app/scripts/
This research is sponsored by Subthesis.