Topic: internet-archiving Goto Github
Some thing interesting about internet-archiving
Some thing interesting about internet-archiving
internet-archiving,Wayback Machine API interface & a command-line tool
User: akamhy
Home Page: https://pypi.org/project/waybackpy/
internet-archiving,🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Organization: archivebox
Home Page: https://archivebox.io
internet-archiving,Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
Organization: archivebox
Home Page: https://chromewebstore.google.com/detail/archivebox-exporter/habonpimjphpdnmcfkaockjnffodikoj
internet-archiving,Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Organization: archivebox
Home Page: https://github.com/ArchiveBox/archivebox-proxy
internet-archiving,Home of the official apt/deb package for Ubuntu/Debian-based systems.
Organization: archivebox
Home Page: https://launchpad.net/~archivebox/+archive/ubuntu/archivebox
internet-archiving,DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
Organization: archivebox
Home Page: https://DigestBox.io
internet-archiving,Home of the official docker image for ArchiveBox
Organization: archivebox
internet-archiving,Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Organization: archivebox
Home Page: https://docs.archivebox.io
internet-archiving,Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Organization: archivebox
Home Page: https://archivebox.io
internet-archiving,😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Organization: archivebox
Home Page: https://archivebox.github.io/good-karma-kit/
internet-archiving,Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Organization: archivebox
Home Page: https://archivebox.io
internet-archiving,Official Python package for ArchiveBox, the self-hosted internet archiving solution.
Organization: archivebox
Home Page: https://pypi.org/project/archivebox/
internet-archiving,Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Organization: archivebox
internet-archiving,Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.
User: fooftilly
internet-archiving,upload stuff to the Internet Archive using a shell script
User: gabldotink
internet-archiving,Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
Organization: httpreserve
internet-archiving,Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
User: itsliamdowd
internet-archiving,Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
User: itsliamdowd
internet-archiving,Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
User: mikwielgus
internet-archiving,A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.
Organization: own-data-privateer
Home Page: https://oxij.org/software/pwebarc/
internet-archiving,🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
User: pirate
Home Page: https://pirate.github.io/internet-archiving-talk/
internet-archiving,🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
User: pirate
Home Page: https://docs.sweeting.me/s/self-host-a-wikipedia-mirror
internet-archiving,Submit URLs listed inside a file to website archival services
User: quoorex
internet-archiving,FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.
User: thelovinator1
internet-archiving,Navigator for Web Archive
User: vegetableman
Home Page: https://vegetableman.github.io/vandal/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.