Coder Social home page Coder Social logo

paper-crawler's Introduction

Paper crawler::bug:

The main function: Grab the doi of the paper in Baidu Academic according to the keywords provided by the user, and download the pdf in sci-hub.:eyes:

Operating conditions of the program

Requires python3 or higher version, request library, and a suitable compiler that can run python programs.

Guide:

The code is in the temp.py file. The program is currently set to crawl all the papers related to keywords in a page. Attention:The picture information of the start button of the program may need to be modified to run normally, and may need to be run before Replacement, the subsequent optimized version is under development... :triangular_flag_on_post:

Operation guide

The GUI of this program is very concise and the operation is also very simple. Users only need to enter the keywords of the papers they want to crawl and the save address of the PDF files.

Follow-up goals:

Continue to optimize the GUI of the program, increase the crawl function of the program, increase the filtering and analysis functions of the crawled content, change the setting of only crawling one page, increase the crawl function of custom pages, etc.:bulb:

Author's Voice:

There are many shortcomings and problems in this program, and there are still many areas that need to be optimized and expanded. If you are interested, join us and follow me and give me a little star. Thank you for your encouragement.:blush::blush::blush:

paper-crawler's People

Contributors

code-aifarmer avatar bladehiker avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.