A linkable presentation of Aaron Swartz Freedom Of Information Act response from the Secret Service and FBI found here. Instead of having to download the pdfs you can view them as a webpage.
- add cloudfront or fastly cdn for faster global delivery
- build client side ajax system for intelligently loading ~20 jpgs at a time
- provide .tar.gz link to entire OCR and/or pdf dataset
- use poppler/OCR/mechanical turk to build plain-text dataset
- provide full-text search of the database and navigate to specfic page upon click
npm install
MIT