Corpus & Data
The 16-Script Pipeline
TLDR A pipeline of 27+ Python scripts transforms 2.1 million raw government documents into a searchable PostgreSQL database with 2.38 million extracted...
3 investigations
TLDR A pipeline of 27+ Python scripts transforms 2.1 million raw government documents into a searchable PostgreSQL database with 2.38 million extracted...
TLDR The entire 2.1 million document Epstein corpus was processed on a single Windows PC: an Intel i9-13950HX with 24 cores, an NVIDIA RTX 4070 with 8 GB of...
TLDR Seven production Scalable Vector Graphics (SVG) images — a type of image that stays sharp at any size — were created at 1920x1080 resolution with a...