Coder Social home page Coder Social logo

leogoncha / getmethatpage Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pramodkhare/getmethatpage

0.0 2.0 0.0 9 KB

Works just like webgrabber, downloads all the resources i.e. css, js, images from given webpage and stores all the files in single folder

Java 100.00%

getmethatpage's Introduction

GetMeThatPage

Works just like webgrabber, downloads all the resources i.e. css, js, images from given webpage and stores all the files in single folder.

@author - Pramod Khare

@Purpose - This program will grab the whole web page including all its images, css and js files and stores them in a single output directory with all urls in html page modified to point to this directory itself.

@date: 12-Dec-2013

Important Note - Right now, all the files gets stored inside single destination directory (given directory) with no directory structure as of website is maintained, this is because we are not grabbing all pages or multiple pages of a website, we grab only one webpage. So all external links (same domain links) will be modified in downloaded HTML to refer (look into) current directory for all the resources.

So when you open downloaded HTML in browser it will open with proper css and images applied, even though in original webpage all css and js files were from different folder hierarchy.

Note - make sure you include jsoup.jar in you classpath when you run this program. You can download the jars from jsoup.org - at this time of writing it was - http://jsoup.org/packages/jsoup-1.7.3.jar or any other location.

getmethatpage's People

Contributors

pramodkhare avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.