Documentary Production
EP03: Ghosts in the Machine — When Machines Hallucinate
TLDR Episode 3 of PAPER TRAIL confronts the pipeline's most dangerous failure mode: phantom entities generated by OCR hallucination. Of the top 25...
3 investigations
TLDR Episode 3 of PAPER TRAIL confronts the pipeline's most dangerous failure mode: phantom entities generated by OCR hallucination. Of the top 25...
TLDR Two observations were retracted after visual inspection revealed that scanning software (OCR, or optical character recognition — software that reads text...
TLDR OCR engines (software that converts images of text into searchable characters) produce phantom entities from blank form labels and repeated document...