Documentary Production
EP03: Ghosts in the Machine — When Machines Hallucinate
TLDR Episode 3 of PAPER TRAIL confronts the pipeline's most dangerous failure mode: phantom entities generated by OCR hallucination. Of the top 25...
3 investigations
TLDR Episode 3 of PAPER TRAIL confronts the pipeline's most dangerous failure mode: phantom entities generated by OCR hallucination. Of the top 25...
TLDR A 190-page Ghislaine Maxwell UBS account statement (EFTA01275697.pdf) generated 190 false "Krakow" entity mentions when automated text scanning...
TLDR OCR engines (software that converts images of text into searchable characters) produce phantom entities from blank form labels and repeated document...