610 Breakpoints: How PELT Recalibration Tightened Temporal Analysis
TLDR A penalty sweep across 20 logarithmic steps reduced the corpus change-point count from 889 to 610, with the optimal penalty of 0.2069 selected by elbow...
6 investigations
TLDR A penalty sweep across 20 logarithmic steps reduced the corpus change-point count from 889 to 610, with the optimal penalty of 0.2069 selected by elbow...
TLDR After a vision language model extracted 4,286 flights and 392 passenger names from handwritten logs, name deduplication and cross-domain matching...
TLDR Traditional text extraction failed completely on Epstein's handwritten flight logs. A vision language model -- a type of AI that interprets entire page...
TLDR A grouping algorithm that finds clusters of closely connected entities -- called Leiden community detection -- applied to 29.5 million entity...
TLDR An algorithm that finds sudden shifts in document activity patterns -- called PELT -- detected 889 verified breakpoints in document activity time series,...
TLDR A scoring method that flags documents containing unusual combinations of names -- combining two established information-theory measures called IDF and PMI...