278 Guard Gaps: 202 Minutes Without Oversight
TLDR VLM frame-by-frame annotation of 4,178 key frames across all 368 surveillance videos with parseable timestamps reveals guards present in 91.0% of frames...
9 investigations
TLDR VLM frame-by-frame annotation of 4,178 key frames across all 368 surveillance videos with parseable timestamps reveals guards present in 91.0% of frames...
TLDR The 106 control room videos -- previously unanalyzed -- have now been fully processed, yielding 869 annotated frames across 55 videos with parseable...
TLDR Data Set 8 of the DOJ Epstein document release contains 419 MP4 surveillance videos totaling 412.5 hours of footage from the Metropolitan Correctional...
TLDR After a vision language model extracted 4,286 flights and 392 passenger names from handwritten logs, name deduplication and cross-domain matching...
TLDR Vision-language model processing of unredacted flight logs extracted 4,286 flights with 392 unique passenger names at zero errors. The same model on...
TLDR The corpus contains 224 parsed wire transfers totaling $24,059,535.45, spanning April 2004 to August 2019 across five banking institutions....
TLDR OCR engines (software that converts images of text into searchable characters) produce phantom entities from blank form labels and repeated document...
TLDR Qwen2.5-VL-7B (a vision-language model — an AI system that reads images the way a human would) running on a single NVIDIA RTX 4070 with 8 GB of video...
TLDR Traditional text extraction failed completely on Epstein's handwritten flight logs. A vision language model -- a type of AI that interprets entire page...