Single file: python contacts-remover.py input.pdf output.pdf
Entire directory (be certain that directory contents are CRS PDFs): python contacts-remover.py /input_directory/ /output_directory/
This script removes author contacts from publicly distributed CRS documents as suggested in the proposed legislation: H.R.4702 - Equal Access to Congressional Research Service Reports Act of 2016 https://www.congress.gov/bill/114th-congress/house-bill/4702?q=%7B%22search%22%3A%5B%22public+access%22%5D%7D&resultIndex=16
The method: The input document(s) is expanded. CRS/LOC/GOV emails and telephone numbers are detected within specific PDF objects. Objects are mapped, then specific contact characters are replaced with blank spaces. The output is compressed. Please report any contact removal failures.
This python script is built on (https://github.com/billmarczak)'s mapping version: https://github.com/antoinemcgrath/contact_removal/commit/ea9635c61c793859aee0d888e10e95da2c85af52 and qpdf**)
**QPDF is used to expand and condense CRS reports files it is released under the terms of Version 2.0 of the Artistic License. Visit the site http://qpdf.sourceforge.net/ for additional information.