uga-libraries / web-download Goto Github PK
View Code? Open in Web Editor NEWDownload individual files from saved Archive-It crawls.
License: Creative Commons Attribution Share Alike 4.0 International
Download individual files from saved Archive-It crawls.
License: Creative Commons Attribution Share Alike 4.0 International
When wget returns an error code of 8, it means that a PDF file has been created with the indicated name but that the file was not in fact downloaded. This makes it look like the download was a success, but if the file is clicked on, it cannot be opened. Therefore, the PDF should be deleted if wget has an error code of 8.
PySimpleGUI has added an annual cost. MAGIL confirmed that they can operate the script from the command line instead.
Currently, make_seed_folder() changes the current directory to the seed folder, so PDFs can be saved to the current directory. This has caused problems with other scripts during testing where it is unclear what the directory should be for a valid test.
Include the seed folder as part of the path for saving the PDF so that the current directory is not important.
For now, while we aren't changing the script, having manual testing instructions is sufficient. If we begin more active development, start by making unit tests for all of the functions.
Challenge: the script is accessing the most recent crawl of the websites, so we need to update the test input each time for ones that are using the Archive-It API. The websites would be the same, but the files would not be.
Installing wget on MAGIL workstations was complicated due to them not having administrator access to their machines. Using the python requests or urllib libraries might be simpler.
Script tests if each of the arguments is present and an expected value. If it is possible with threading, doing this in a function would make for simpler code and make it easier to test.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.