Coder Social home page Coder Social logo

pdfshaver's Introduction

This is the repository for the legacy DocumentCloud site, please see the current repository here:

https://github.com/muckrock/documentcloud

______                                      _   _____ _                 _
|  _  \                                    | | /  __ \ |               | |
| | | |___   ___ _   _ _ __ ___   ___ _ __ | |_| /  \/ | ___  _   _  __| |
| | | / _ \ / __| | | | '_ ` _ \ / _ \ '_ \| __| |   | |/ _ \| | | |/ _` |
| |/ / (_) | (__| |_| | | | | | |  __/ | | | |_| \__/\ | (_) | |_| | (_| |
|___/ \___/ \___|\__,_|_| |_| |_|\___|_| |_|\__|\____/_|\___/ \__,_|\__,_|

DocumentCloud is a catalog of primary source documents and a tool for annotating, organizing and publishing them on the web. Documents are contributed by journalists, researchers and archivists.

This codebase contains the entirety of DocumentCloud.org, and pulls together the rest of our open-source projects: Docsplit is used to extract data from incoming documents; that work is parallelized across CloudCrowd; data on the client-side is modeled by Backbone.js, which depends on Underscore.js for all of its abilities; Jammit concatenates and compresses the dozens of CSS and JS files into a single asset package; the NYTimes' Document Viewer displays the documents, while Pixel Ping records the traffic.

If you find a security issue while browsing the source, please email [email protected] to inform us of the problem.

Code contributed to this project is provided under the MIT license (see the LICENSE file). Some components of the project are subject to their own licenses as indicated (see /vendor and /public/javascripts/vendor directories).

pdfshaver's People

Contributors

davidlemayian avatar freedmand avatar knowtheory avatar nathanstitt avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pdfshaver's Issues

Homebrew formula: tools/clang/scripts/update.py is missing

I'm seeing the following error:

build/gyp_pdfium

Updating projects from gyp files...
/usr/local/Cellar/python/2.7.11/Frameworks/Python.framework/Versions/2.7/Resources/Python.app/Contents/MacOS/Python: can't open file '../tools/clang/scripts/update.py': [Errno 2] No such file or directory
gyp: Call to 'python ../tools/clang/scripts/update.py --print-revision' returned exit status 2 while in /private/tmp/pdfium20151228-46857-htvxjl/build/all.gyp.

I'm not sure how to get tools/clang/scripts/update.py. Googling didn't turn up any clear fix.

License?

I am certain I am just missing it -- but I can't seem to find the license for this?

Benchmarks vs docsplit?

Thanks for putting this gem together. Been testing PDFium and wondering if you guys already came to a conclusion on the performance tradeoff between it and the methods in Docsplit (poppler/etc) for image extraction?

Just wondering before diving in and repeating work.

pdfium_ruby not working under ruby 2.3.1

Just tried to install pdfshaver in the latest xenial distro. Used the default system ruby (now 2.3.1), which worked when 2.3.0 was the ubuntu default. The make seems to work, but here's what I get when I require 'pdfshaver':

pry -e "require 'pdfshaver'" LoadError: /var/lib/gems/2.3.0/gems/pdfshaver-0.0.3/lib/pdfium_ruby.so: undefined symbol: FPDFAPIJPEG_jpeg_destroy_decompress - /var/lib/gems/2.3.0/gems/pdfshaver-0.0.3/lib/pdfium_ruby.so from /var/lib/gems/2.3.0/gems/pdfshaver-0.0.3/lib/pdfshaver.rb:13:inrequire_relative'
`

If feels like the fpdf api library is not linking properly.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.