Coder Social home page Coder Social logo

wikitools's Introduction

wikitools -- Package for working with MediaWiki wikis

Requirements

  • Python 2.5+. Not compatible with Python 3; not tested on older versions
  • Bob Ippolito's simplejson module, if using Python < 2.6 http://pypi.python.org/pypi/simplejson
  • To upload files or import XML, you need Chris AtLee's poster package http://pypi.python.org/pypi/poster
  • The wiki this is used for should be running at least MediaWiki version 1.13 and have the API enabled.

Installation

  • Run "python setup.py install" or copy the wikitools directory to an appropriate Python module directory.
  • An exe installer for Windows is also available (should be run as an administrator to avoid errors)
  • An RPM for Linux is also available.
  • Arch Linux users may use AUR package for their convenience: https://aur.archlinux.org/packages/python2-wikitools/

Available modules

  • api.py - Contains the APIRequest class, for doing queries directly, see API examples below
  • wiki.py - Contains the Wiki class, used for logging in to the site, storing cookies, and storing basic site information
  • page.py - Contains the Page class for dealing with individual pages on the wiki. Can be used to get page info and text, as well as edit and other actions if enabled on the wiki
  • category.py - Category is a subclass of Page with extra functions for working with categories
  • wikifile.py - File is a subclass of Page with extra functions for working with files - note that there may be some issues with shared repositories, as the pages for files on shared repos technically don't exist on the local wiki.
  • user.py - Contains the User class for getting information about and blocking/unblocking users
  • pagelist.py - Contains several functions for getting a list of Page objects from lists of titles, pageids, or API query results

Further documentation

Current limitations

  • Can only do what the API can do. On a site without the edit-API enabled (disabled by default prior to MediaWiki 1.14), you cannot edit/delete/ protect pages, only retrieve information about them.
  • May have issues with some non-ASCII characters. Most of these bugs should be resolved, though full UTF-8 support is still a little flaky
  • Usage on restricted-access (logged-out users can't read) wikis is mostly untested

Quick start

To make a simple query:

#!/usr/bin/python

from wikitools import wiki
from wikitools import api

# create a Wiki object
site = wiki.Wiki("http://my.wikisite.org/w/api.php") 
# login - required for read-restricted wikis
site.login("username", "password")
# define the params for the query
params = {'action':'query', 'titles':'Main Page'}
# create the request object
request = api.APIRequest(site, params)
# query the API
result = request.query()

The result will look something like:

{u'query':
	{u'pages':
		{u'15580374':
			{u'ns': 0, u'pageid': 15580374, u'title': u'Main Page'}
		}
	}
}

If the API module you need requires a token, you first do something like:

params = { 'action':'query', 'meta':'tokens' }
token = api.APIRequest(site, params).query()['query']['tokens']['csrftoken']
# define the params for the query
params = { 'action':'thank', 'rev':diff, 'token':token }

For most normal usage, you may not have to do API queries yourself and can just use the various classes. For example, to add a template to the top of all the pages in namespace 0 in a category:

#!/usr/bin/python

from wikitools import wiki
from wikitools import category

site = wiki.Wiki("http://my.wikisite.org/w/api.php") 
site.login("username", "password")
# Create object for "Category:Foo"
cat = category.Category(site, "Foo")
# iterate through all the pages in ns 0
for article in cat.getAllMembersGen(namespaces=[0]):
	# edit each page
	article.edit(prependtext="{{template}}\n")

See the MediaWiki API documentation at http://www.mediawiki.org/wiki/API for more information about using the MediaWiki API. You can get an example of what query results will look like by doing the queries in your web browser using the "jsonfm" format option

Licensed under the GNU General Public License, version 3. A copy of the license is included with this release.

Authors

  • Original source code Alex Z. (User:Mr.Z-man @ en.wikipedia) [email protected]
  • Some code/assistance (User:Bjweeks @ en.wikipedia)

wikitools's People

Contributors

alexz-enwp avatar fhocutt avatar hexmode avatar jbzdarkid avatar mzmcbride avatar nemobis avatar ser avatar tjoneslo avatar vit1251 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

wikitools's Issues

add library maintenance status to README

Indicating wikitools' maintenance status would help potential contributors decide where makes the most sense to contribute and would help set accurate expectations for responses.

How to fix "The supplied MD5 hash was incorrect" error?

Getting the following error while uploading a PDF file to https://commons.wikimedia.org

Logged in.
Uploading the file পদ্মাপুরাণ - নারায়ণ দেব.pdf
Traceback (most recent call last):
File "local.py", line 140, in
upload_pdf_file(pdf_file)
File "local.py", line 132, in upload_pdf_file
page.edit(text=wikidata)
File "/usr/local/lib/python2.7/dist-packages/wikitools/page.py", line 623, in edit
result = req.query()
File "/usr/local/lib/python2.7/dist-packages/wikitools/api.py", line 165, in query
raise APIError(data['error']['code'], data['error']['info'])
wikitools.api.APIError: (u'badmd5', u'The supplied MD5 hash was incorrect')

It is reported here
tshrinivasan/tools-for-wiki#17

What may the reason for this?
How can we solve this?

TypeError on certain queries

Greetings! I think I found a bug--

Try this:

import wikitools
file_query = {'action': 'query',
'titles': u'Image:MLK and Malcolm X USNWR cropped.jpg',
'prop': 'imageinfo',
'iiprop': 'extmetadata'}

file_request = wikitools.api.APIRequest(site, file_query)
query_result = file_request.query() # throws TypeError

It has something to do with api.py around line 289-- you use sets to quickly de-dupe some list(s), but this query tries to dedupe a list of dicts in that block of code, throwing a TypeError because dicts are unhashable.

using wikitools with MediaWiki in localhost

Hi,
I don't really get what it means by "my.wikisite.org" when I am using wikitools.wiki, for example,

site = wiki.Wiki("http://my.wikisite.org/w/api.php") 

I am using ubuntu and mediawiki-1.28.0.
I have MediaWiki installed in my local server, and there is api.php in mediawiki-1.28.0 directory. What I want is using this api.php instead of api.php in en.wikipedia.org.
However it seems that url parameter on wiki.Wiki initializer only accept http url.
How can I use local MediaWiki?

Thanks

Please tag release 1.1.1 in the git tree

I think it would be useful to be able to retrieve the content of version 1.1.1 which is available on pypi through a git tag. I think it was the rev. 330 in SVN, but I'm not sure.

Thanks in advance.

Package for Debian/Ubuntu

At Wikimedia Tools in the past we have packaged wikitools basically with the motive to deploy it in "some" way :-). As part of Reduce amount of Tools-local packages, I'll be spending some time on proper Debian packaging for wikitools and some other Python modules.

For that, I have filed an Intent to Package. This issue here mirrors that bug report and serves as a pointer until the packaging is complete.

Also, if someone is using Debian or Ubuntu outside of Wikimedia Tools or knows which code paths are especially important to test, please comment here. Thanks!

Fall back from queryGen() to query() on versions with querycontinue only

Case in point: Wikia runs MediaWiki 1.19. I want to use queryGen() for WikiTeam/wikiteam#311 to be future-proof, but I can't because it fails to pick up query-continue in a query like http://clubpenguin.wikia.com/api.php?action=query&prop=revisions&rvlimit=max&rvprop=ids&titles=Club%20Penguin%20Wiki&format=json:

{"query":{"pages":{"322415":{"pageid":322415,"ns":0,"title":"Club Penguin Wiki","revisions":[{"revid":1848307,"parentid":1839553},{"revid":1839553,"parentid":1820781},{"revid":1820781,"parentid":1819988},{"revid":1819988,"parentid":1819987},{"revid":1819987,"parentid":1818891},{"revid":1818891,"parentid":1818888},{"revid":1818888,"parentid":1775902},{"revid":1775902,"parentid":1775886},{"revid":1775886,"parentid":1775382},{"revid":1775382,"parentid":1775111},{"revid":1775111,"parentid":1775095},{"revid":1775095,"parentid":1774857},{"revid":1774857,"parentid":1774854},{"revid":1774854,"parentid":1760752},{"revid":1760752,"parentid":1759091},{"revid":1759091,"parentid":1759053},{"revid":1759053,"parentid":1759000},{"revid":1759000,"parentid":1758999},{"revid":1758999,"parentid":1758503},{"revid":1758503,"parentid":1758502},{"revid":1758502,"parentid":1746055},{"revid":1746055,"parentid":1699642},{"revid":1699642,"parentid":1699435},{"revid":1699435,"parentid":1693668},{"revid":1693668,"parentid":1661219},{"revid":1661219,"parentid":1658749},{"revid":1658749,"parentid":1646415},{"revid":1646415,"parentid":1623527},{"revid":1623527,"parentid":1611003},{"revid":1611003,"parentid":1583283},{"revid":1583283,"parentid":1546174},{"revid":1546174,"parentid":1379985},{"revid":1379985,"parentid":1287788},{"revid":1287788,"parentid":1287785},{"revid":1287785,"parentid":1256558},{"revid":1256558,"parentid":1243633},{"revid":1243633,"parentid":1164619},{"revid":1164619,"parentid":1128886},{"revid":1128886,"parentid":1033935},{"revid":1033935,"parentid":1026689},{"revid":1026689,"parentid":1017437},{"revid":1017437,"parentid":954007},{"revid":954007,"parentid":953176},{"revid":953176,"parentid":906139},{"revid":906139,"parentid":877864},{"revid":877864,"parentid":864038},{"revid":864038,"parentid":845901},{"revid":845901,"parentid":845898},{"revid":845898,"parentid":792557},{"revid":792557,"parentid":789571},{"revid":789571,"parentid":789566},{"revid":789566,"parentid":781192},{"revid":781192,"parentid":780170},{"revid":780170,"parentid":780167},{"revid":780167,"parentid":780155},{"revid":780155,"parentid":769644},{"revid":769644,"parentid":764426},{"revid":764426,"parentid":761996},{"revid":761996,"parentid":761875},{"revid":761875,"parentid":760284},{"revid":760284,"parentid":757484},{"revid":757484,"parentid":757480},{"revid":757480,"parentid":757479},{"revid":757479,"parentid":738107},{"revid":738107,"parentid":732325},{"revid":732325,"parentid":732290},{"revid":732290,"parentid":724684},{"revid":724684,"parentid":723477},{"revid":723477,"parentid":723363},{"revid":723363,"parentid":721204},{"revid":721204,"parentid":682376},{"revid":682376,"parentid":682373},{"revid":682373,"parentid":678775},{"revid":678775,"parentid":664775},{"revid":664775,"parentid":653761},{"revid":653761,"parentid":620484},{"revid":620484,"parentid":620475},{"revid":620475,"parentid":620473},{"revid":620473,"parentid":620472},{"revid":620472,"parentid":611551},{"revid":611551,"parentid":586355},{"revid":586355,"parentid":576017},{"revid":576017,"parentid":575848},{"revid":575848,"parentid":575708},{"revid":575708,"parentid":575541},{"revid":575541,"parentid":574997},{"revid":574997,"parentid":574202},{"revid":574202,"parentid":574201},{"revid":574201,"parentid":574200},{"revid":574200,"parentid":574196},{"revid":574196,"parentid":574194},{"revid":574194,"parentid":574192},{"revid":574192,"parentid":574184},{"revid":574184,"parentid":574181},{"revid":574181,"parentid":574180},{"revid":574180,"parentid":574177},{"revid":574177,"parentid":574174},{"revid":574174,"parentid":574173},{"revid":574173,"parentid":574172},{"revid":574172,"parentid":574171},{"revid":574171,"parentid":574170},{"revid":574170,"parentid":574168},{"revid":574168,"parentid":574166},{"revid":574166,"parentid":574165},{"revid":574165,"parentid":574164},{"revid":574164,"parentid":574163},{"revid":574163,"parentid":574162},{"revid":574162,"parentid":574161},{"revid":574161,"parentid":574160},{"revid":574160,"parentid":574159},{"revid":574159,"parentid":574158},{"revid":574158,"parentid":574157},{"revid":574157,"parentid":574156},{"revid":574156,"parentid":574155},{"revid":574155,"parentid":574153},{"revid":574153,"parentid":574152},{"revid":574152,"parentid":574150},{"revid":574150,"parentid":574148},{"revid":574148,"parentid":574147},{"revid":574147,"parentid":574146},{"revid":574146,"parentid":574145},{"revid":574145,"parentid":574144},{"revid":574144,"parentid":574143},{"revid":574143,"parentid":574142},{"revid":574142,"parentid":574141},{"revid":574141,"parentid":574140},{"revid":574140,"parentid":574139},{"revid":574139,"parentid":574133},{"revid":574133,"parentid":574132},{"revid":574132,"parentid":574128},{"revid":574128,"parentid":574121},{"revid":574121,"parentid":574035},{"revid":574035,"parentid":574034},{"revid":574034,"parentid":574033},{"revid":574033,"parentid":573811},{"revid":573811,"parentid":573810},{"revid":573810,"parentid":573806},{"revid":573806,"parentid":573802},{"revid":573802,"parentid":573793},{"revid":573793,"parentid":573791},{"revid":573791,"parentid":571555},{"revid":571555,"parentid":571551},{"revid":571551,"parentid":571180},{"revid":571180,"parentid":570619},{"revid":570619,"parentid":570618},{"revid":570618,"parentid":570550},{"revid":570550,"parentid":570480},{"revid":570480,"parentid":569282},{"revid":569282,"parentid":565470},{"revid":565470,"parentid":563969},{"revid":563969,"parentid":562772},{"revid":562772,"parentid":562455},{"revid":562455,"parentid":557875},{"revid":557875,"parentid":557822},{"revid":557822,"parentid":539247},{"revid":539247,"parentid":539240},{"revid":539240,"parentid":539237},{"revid":539237,"parentid":515559},{"revid":515559,"parentid":510764},{"revid":510764,"parentid":508277},{"revid":508277,"parentid":508266},{"revid":508266,"parentid":507361},{"revid":507361,"parentid":507109},{"revid":507109,"parentid":502625},{"revid":502625,"parentid":501895},{"revid":501895,"parentid":501892},{"revid":501892,"parentid":501882},{"revid":501882,"parentid":501881},{"revid":501881,"parentid":501879},{"revid":501879,"parentid":501823},{"revid":501823,"parentid":501822},{"revid":501822,"parentid":498076},{"revid":498076,"parentid":498063},{"revid":498063,"parentid":490417},{"revid":490417,"parentid":486863},{"revid":486863,"parentid":485332},{"revid":485332,"parentid":485331},{"revid":485331,"parentid":484004},{"revid":484004,"parentid":483947},{"revid":483947,"parentid":483946},{"revid":483946,"parentid":483945},{"revid":483945,"parentid":483940},{"revid":483940,"parentid":483934},{"revid":483934,"parentid":483933},{"revid":483933,"parentid":483932},{"revid":483932,"parentid":482742},{"revid":482742,"parentid":481876},{"revid":481876,"parentid":480224},{"revid":480224,"parentid":480223},{"revid":480223,"parentid":478770},{"revid":478770,"parentid":478289},{"revid":478289,"parentid":478288},{"revid":478288,"parentid":477971},{"revid":477971,"parentid":475826},{"revid":475826,"parentid":473220},{"revid":473220,"parentid":473058},{"revid":473058,"parentid":473056},{"revid":473056,"parentid":473041},{"revid":473041,"parentid":437724},{"revid":437724,"parentid":437309},{"revid":437309,"parentid":429923},{"revid":429923,"parentid":420771},{"revid":420771,"parentid":417103},{"revid":417103,"parentid":403727},{"revid":403727,"parentid":403715},{"revid":403715,"parentid":368926},{"revid":368926,"parentid":368845},{"revid":368845,"parentid":368619},{"revid":368619,"parentid":368613},{"revid":368613,"parentid":368567},{"revid":368567,"parentid":368550},{"revid":368550,"parentid":368164},{"revid":368164,"parentid":367637},{"revid":367637,"parentid":350635},{"revid":350635,"parentid":350524},{"revid":350524,"parentid":350520},{"revid":350520,"parentid":350519},{"revid":350519,"parentid":350517},{"revid":350517,"parentid":350516},{"revid":350516,"parentid":350222},{"revid":350222,"parentid":350217},{"revid":350217,"parentid":348061},{"revid":348061,"parentid":339095},{"revid":339095,"parentid":338837},{"revid":338837,"parentid":336813},{"revid":336813,"parentid":336655},{"revid":336655,"parentid":336055},{"revid":336055,"parentid":336035},{"revid":336035,"parentid":336020},{"revid":336020,"parentid":334001},{"revid":334001,"parentid":333951},{"revid":333951,"parentid":333502},{"revid":333502,"parentid":333501},{"revid":333501,"parentid":333500},{"revid":333500,"parentid":333499},{"revid":333499,"parentid":327675},{"revid":327675,"parentid":324410},{"revid":324410,"parentid":324407},{"revid":324407,"parentid":324406},{"revid":324406,"parentid":324402},{"revid":324402,"parentid":324398},{"revid":324401,"parentid":324398},{"revid":324398,"parentid":324393},{"revid":324393,"parentid":324352},{"revid":324352,"parentid":324103},{"revid":324103,"parentid":320868},{"revid":320868,"parentid":314109},{"revid":314109,"parentid":314108},{"revid":314108,"parentid":314072},{"revid":314072,"parentid":313106},{"revid":313106,"parentid":313105},{"revid":313105,"parentid":309757},{"revid":309757,"parentid":301273},{"revid":301273,"parentid":298365},{"revid":298365,"parentid":298203},{"revid":298203,"parentid":298129},{"revid":298129,"parentid":298024},{"revid":298024,"parentid":298020},{"revid":298020,"parentid":298018},{"revid":298018,"parentid":298015},{"revid":298015,"parentid":297855},{"revid":297855,"parentid":297735},{"revid":297735,"parentid":297709},{"revid":297709,"parentid":297064},{"revid":297064,"parentid":296276},{"revid":296276,"parentid":296067},{"revid":296067,"parentid":296065},{"revid":296065,"parentid":295867},{"revid":295867,"parentid":295861},{"revid":295861,"parentid":295858},{"revid":295858,"parentid":295783},{"revid":295783,"parentid":295782},{"revid":295782,"parentid":295780},{"revid":295780,"parentid":295778},{"revid":295778,"parentid":295777},{"revid":295777,"parentid":295774},{"revid":295774,"parentid":295770},{"revid":295770,"parentid":295767},{"revid":295767,"parentid":295739},{"revid":295739,"parentid":295737},{"revid":295737,"parentid":295736},{"revid":295736,"parentid":295732},{"revid":295732,"parentid":295729},{"revid":295729,"parentid":295728},{"revid":295728,"parentid":295721},{"revid":295721,"parentid":295717},{"revid":295717,"parentid":294237},{"revid":294237,"parentid":293717},{"revid":293717,"parentid":293068},{"revid":293068,"parentid":293066},{"revid":293066,"parentid":291766},{"revid":291766,"parentid":291691},{"revid":291691,"parentid":291466},{"revid":291466,"parentid":291380},{"revid":291380,"parentid":291324},{"revid":291324,"parentid":291321},{"revid":291321,"parentid":291319},{"revid":291319,"parentid":291318},{"revid":291318,"parentid":291303},{"revid":291303,"parentid":291292},{"revid":291292,"parentid":291143},{"revid":291143,"parentid":291139},{"revid":291139,"parentid":291136},{"revid":291136,"parentid":291135},{"revid":291135,"parentid":290431},{"revid":290431,"parentid":290430},{"revid":290430,"parentid":290428},{"revid":290428,"parentid":290417},{"revid":290417,"parentid":288896},{"revid":288896,"parentid":288858},{"revid":288858,"parentid":288822},{"revid":288822,"parentid":288821},{"revid":288821,"parentid":287531},{"revid":287531,"parentid":287011},{"revid":287011,"parentid":287010},{"revid":287010,"parentid":287004},{"revid":287004,"parentid":287003},{"revid":287003,"parentid":286941},{"revid":286941,"parentid":286937},{"revid":286937,"parentid":286936},{"revid":286936,"parentid":286387},{"revid":286387,"parentid":286381},{"revid":286381,"parentid":286372},{"revid":286372,"parentid":281874},{"revid":281874,"parentid":281869},{"revid":281869,"parentid":281849},{"revid":281849,"parentid":281393},{"revid":281393,"parentid":281379},{"revid":281379,"parentid":281378},{"revid":281378,"parentid":281368},{"revid":281368,"parentid":273457},{"revid":273457,"parentid":273456},{"revid":273456,"parentid":273453},{"revid":273453,"parentid":271598},{"revid":271598,"parentid":271004},{"revid":271004,"parentid":270627},{"revid":270627,"parentid":270626},{"revid":270626,"parentid":270550},{"revid":270550,"parentid":270549},{"revid":270549,"parentid":270546},{"revid":270546,"parentid":270475},{"revid":270475,"parentid":270472},{"revid":270472,"parentid":269544},{"revid":269544,"parentid":268812},{"revid":268812,"parentid":268785},{"revid":268785,"parentid":267665},{"revid":267665,"parentid":267630},{"revid":267630,"parentid":266852},{"revid":266852,"parentid":266805},{"revid":266805,"parentid":266804},{"revid":266804,"parentid":266244},{"revid":266244,"parentid":266241},{"revid":266241,"parentid":265177},{"revid":265177,"parentid":265158},{"revid":265158,"parentid":265147},{"revid":265147,"parentid":265073},{"revid":265073,"parentid":265070},{"revid":265070,"parentid":261992},{"revid":261992,"parentid":261991},{"revid":261991,"parentid":261460},{"revid":261460,"parentid":261459},{"revid":261459,"parentid":261401},{"revid":261401,"parentid":261293},{"revid":261293,"parentid":261282},{"revid":261282,"parentid":261099},{"revid":261099,"parentid":261012},{"revid":261012,"parentid":261011},{"revid":261011,"parentid":261009},{"revid":261009,"parentid":260993},{"revid":260993,"parentid":260989},{"revid":260989,"parentid":260988},{"revid":260988,"parentid":260788},{"revid":260788,"parentid":260716},{"revid":260716,"parentid":259514},{"revid":259514,"parentid":258945},{"revid":258945,"parentid":257630},{"revid":257630,"parentid":256994},{"revid":256994,"parentid":256823},{"revid":256823,"parentid":253257},{"revid":253257,"parentid":251917},{"revid":251917,"parentid":251864},{"revid":251864,"parentid":251259},{"revid":251259,"parentid":249912},{"revid":249912,"parentid":249201},{"revid":249201,"parentid":249185},{"revid":249185,"parentid":249152},{"revid":249152,"parentid":249140},{"revid":249140,"parentid":249130},{"revid":249130,"parentid":249127},{"revid":249127,"parentid":249114},{"revid":249114,"parentid":249113},{"revid":249113,"parentid":249112},{"revid":249112,"parentid":249111},{"revid":249111,"parentid":249108},{"revid":249108,"parentid":248985},{"revid":248985,"parentid":248961},{"revid":248961,"parentid":248750},{"revid":248750,"parentid":247945},{"revid":247945,"parentid":247944},{"revid":247944,"parentid":247776},{"revid":247776,"parentid":247770},{"revid":247770,"parentid":247589},{"revid":247589,"parentid":247385},{"revid":247385,"parentid":247283},{"revid":247283,"parentid":247277},{"revid":247277,"parentid":247117},{"revid":247117,"parentid":247048},{"revid":247048,"parentid":246980},{"revid":246980,"parentid":246975},{"revid":246975,"parentid":246396},{"revid":246396,"parentid":245936},{"revid":245936,"parentid":245867},{"revid":245867,"parentid":245866},{"revid":245866,"parentid":245861},{"revid":245861,"parentid":245576},{"revid":245576,"parentid":245575},{"revid":245575,"parentid":245574},{"revid":245574,"parentid":245428},{"revid":245428,"parentid":245424},{"revid":245424,"parentid":245197},{"revid":245197,"parentid":245193},{"revid":245193,"parentid":245190},{"revid":245190,"parentid":245165},{"revid":245165,"parentid":245043},{"revid":245043,"parentid":245042},{"revid":245042,"parentid":245040},{"revid":245040,"parentid":245039},{"revid":245039,"parentid":245038},{"revid":245038,"parentid":245037},{"revid":245037,"parentid":245036},{"revid":245036,"parentid":244991},{"revid":244991,"parentid":244464},{"revid":244464,"parentid":244463},{"revid":244463,"parentid":244462},{"revid":244462,"parentid":243871},{"revid":243871,"parentid":243868},{"revid":243868,"parentid":243867},{"revid":243867,"parentid":243866},{"revid":243866,"parentid":243863},{"revid":243863,"parentid":243862},{"revid":243862,"parentid":243484},{"revid":243484,"parentid":243476},{"revid":243476,"parentid":243475},{"revid":243475,"parentid":243472},{"revid":243472,"parentid":243267},{"revid":243267,"parentid":243264},{"revid":243264,"parentid":243204},{"revid":243204,"parentid":243202},{"revid":243202,"parentid":243200},{"revid":243200,"parentid":243199},{"revid":243199,"parentid":243197},{"revid":243197,"parentid":243185},{"revid":243185,"parentid":243184},{"revid":243184,"parentid":243180},{"revid":243180,"parentid":243177},{"revid":243177,"parentid":242616},{"revid":242616,"parentid":241618},{"revid":241618,"parentid":240817},{"revid":240817,"parentid":240816},{"revid":240816,"parentid":240714},{"revid":240714,"parentid":240520},{"revid":240520,"parentid":240519},{"revid":240519,"parentid":240481},{"revid":240481,"parentid":240479},{"revid":240479,"parentid":239504},{"revid":239504,"parentid":239311},{"revid":239311,"parentid":239199},{"revid":239199,"parentid":239194},{"revid":239194,"parentid":239193},{"revid":239193,"parentid":239141},{"revid":239141,"parentid":238770},{"revid":238770,"parentid":237355},{"revid":237355,"parentid":236560},{"revid":236560,"parentid":236557},{"revid":236557,"parentid":236219},{"revid":236219,"parentid":235449},{"revid":235449,"parentid":235410},{"revid":235410,"parentid":234860},{"revid":234860,"parentid":234842},{"revid":234842,"parentid":233240},{"revid":233240,"parentid":232578},{"revid":232578,"parentid":232577},{"revid":232577,"parentid":232470},{"revid":232470,"parentid":232360},{"revid":232360,"parentid":232180},{"revid":232180,"parentid":232178},{"revid":232178,"parentid":232176},{"revid":232176,"parentid":232175},{"revid":232175,"parentid":232172},{"revid":232172,"parentid":232169},{"revid":232169,"parentid":232165},{"revid":232165,"parentid":232114},{"revid":232114,"parentid":232113},{"revid":232113,"parentid":232109},{"revid":232109,"parentid":231996},{"revid":231996,"parentid":231537},{"revid":231537,"parentid":231525},{"revid":231525,"parentid":231271},{"revid":231271,"parentid":230656},{"revid":230656,"parentid":230575}]}}},"limits":{"revisions":500},"query-continue":{"revisions":{"rvstartid":230575}}}

I expected wikitools to fall back to the "correct" method depending on what's available.

Publish an updated version on pypi

The code seems to have evolved a bit since the last published release on pypi.

I found a problematic bug with the User-Agent header, for instance, which is fixed now, so an updated release wouldn't mind, I think.

Thanks in advance.

Add suppport for logging HTTP requests

When things go wrong, it helps to be able to debug the HTTP requests taking place.

I think it would be great to have a mean to set a debug flag so as to get a log.

For the moment, I've patched the code to get
self.opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(wiki.cookies), urllib2.HTTPSHandler(debuglevel=1))
in api.py

Hope this helps.

Cant delete pages

When tried to use delete() method on a wiki page,

page.delete(pagename, reason="spam")
getting the following error.

File "delete-pages.py", line 118, in
delete_page(page_name)
File "delete-pages.py", line 101, in delete_page
page.delete(pagename, reason = "spam")
TypeError: delete() got multiple values for keyword argument 'reason'

I tried giving various values for reason including True and False.

Getting same error.

Please guide me on how to delete the pages.

Thanks.

Problem connecting with https-site

from wikitools import *
s = wiki.Wiki('https://aquanautweb.de/testwiki/api.php')

results in

==== RESTART: C:\Users\XXXXXXX\test_wikitools.py ====
URLError: <urlopen error EOF occurred in violation of protocol (_ssl.c:590)> trying request again in 5 seconds
URLError: <urlopen error EOF occurred in violation of protocol (_ssl.c:590)> trying request again in 10 seconds

What can be the problem?

Py3 branch can not be installed with setuptools (dependency_links, install_requires), only through requirements.txt, due to version overload

In a Python 3 project, I used to install wikitools by adding a line in requirements.txt pointing to the py3 branch looking this this:

-e git+https://github.com/alexz-enwp/wikitools.git@py3#egg=wikitools

This works fine for development purposes. However, for releasing the package, it is recommended to define dependencies in setup.py instead, as Donald Stufft (the core developer of PyPI) explains in this blog post. However, in a nutshell, a setup.py in of the following form will install the Python 2 version from PyPI instead of the Python 3 version from GitHub:

from setuptools import setup
setup(
    dependency_links = [
        'https://github.com/alexz-enwp/wikitools/tarball/py3#egg=wikitools_py3-1.2'
        ],
    install_requires = [
        'wikitools==1.2'
        ]
    )

The problem is that wikitools is already available in version 1.2 (and 1.3) on PyPI, so pip will give it priority. The py3 branch contains something labeled as version 1.2, which will therefore be ignored by pip.

The problem can be solved in three ways:

  • The wikitoolsmodule for Python 3 is renamed into something like wikitools_py3. Without starting a new repository, this can be achieved by simply changing the name parameter in wikitools' own setup.py in the existing py3 branch. EDIT: It actually requires a lot of changes, more than I thought, see pull request, so the next option might be a lot easier.
  • Alternatively (temporarily) the version parameter in wikitools' own setup.py in the py3 branch can be pumped to something higher than what is available on PyPI, e.g. 1.4 (as of today's latest version on PyPI being 1.3).
  • The ideal (theoretical) solution would be if wikitools had a common codebase for supporting both Python 2 and Python 3, which wont be trivial ...

For reference, two related discussions on StackOverflow:

is project completely deprecated now ?

Hi Alex,

I noticed for the first time this error:

from wikitools import wiki
from wikitools import api

wiki.Wiki("https://en.wikipedia.org/w/api.php") 

# error:
HTTPError: HTTP Error 400: Browser Connection Security Warning trying request again in 5 seconds

Notice I am just running from a unix console in my local environment, and from a virtual environment with python 2.7.

It is the first time I am having this issue.

Please let me know if I have to completely replace the use of the library with others, like requests, or if it just a temporary issue...

Thank you and Best wishes for new year Alex!!

Add the ability to use Pywikibot credentials

If I'm writing a script that uses both PWB and wikitools, I shouldn't have to type in my credentials for wikitools - I should be able to just pass my PWB wiki into wikitools.

(This might not be technically feasible, but boy would it be awesome.)

Support: Is it possible to log in (getting an edit key), using wikitools?

Is it possible to log in (getting an edit key), so that I can edit pages on a restricted-access site?

When I try to run the second example in the README file, I get:

Traceback (most recent call last):
  File "create_gene_pages.py", line 5, in <module>
    cat = category.Category(site, "Foo")
  File "/home/samuel/.pyenv/versions/2.7.6/lib/python2.7/site-packages/wikitools/category.py", line 34, in __init__
    page.Page.__init__(self, site=site, title=title, check=check, followRedir=followRedir, section=section, sectionnumber=sectionnumber, pageid=pageid)
  File "/home/samuel/.pyenv/versions/2.7.6/lib/python2.7/site-packages/wikitools/page.py", line 108, in __init__
    self.setPageInfo()
  File "/home/samuel/.pyenv/versions/2.7.6/lib/python2.7/site-packages/wikitools/page.py", line 141, in setPageInfo
    response = req.query()
  File "/home/samuel/.pyenv/versions/2.7.6/lib/python2.7/site-packages/wikitools/api.py", line 154, in query
    raise APIError(data['error']['code'], data['error']['info'])
wikitools.api.APIError: (u'readapidenied', u'You need read permission to use this module')

import wikitools ImportError: No module named wikitools

wikisource@wikisource-Inspiron-3542:$ cd Desktop
wikisource@wikisource-Inspiron-3542:
/Desktop$ cd /OCR4wikisource-master
wikisource@wikisource-Inspiron-3542:
/OCR4wikisource-master$ python do_ocr.py
INFO:main:Running do_ocr.py 1.54
INFO:root:Operating System = "Ubuntu 16.04.4 LTS"

INFO:main:URL = https://upload.wikimedia.org/wikipedia/commons/d/dd/Alochana_Magazine_October_1958.pdf
INFO:main:Columns = 1
INFO:main:Wiki Username = Gurlal (Bot)
INFO:main:Wiki Password = Not logging the password
INFO:main:Wiki Source Language Code = pa
INFO:main:Keep Temp folder in Google Drive = no
INFO:main:Original URL = https://upload.wikimedia.org/wikipedia/commons/d/dd/Alochana_Magazine_October_1958.pdf
INFO:main:File Name = Alochana_Magazine_October_1958.pdf
INFO:main:File Type = pdf
INFO:main:Created Temp folder OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52
INFO:root:Alochana_Magazine_October_1958.pdf Already Exists. Skipping the download.
INFO:main:Aligining the Pages of PDF file.

INFO:main:Running mutool poster -x 1 "Alochana_Magazine_October_1958.pdf" currentfile.pdf
INFO:main:Spliting the PDF into single pages.

INFO:main:Running pdftk currentfile.pdf burst
INFO:main:Joining the PDF files ...

INFO:main:Running Command cp pg_0001.pdf page_00001.pdf
INFO:main:Running Command cp pg_0002.pdf page_00002.pdf
INFO:main:Running Command cp pg_0003.pdf page_00003.pdf
INFO:main:Running Command cp pg_0004.pdf page_00004.pdf
INFO:main:Running Command cp pg_0005.pdf page_00005.pdf
INFO:main:Running Command cp pg_0006.pdf page_00006.pdf
INFO:main:Running Command cp pg_0007.pdf page_00007.pdf
INFO:main:Running Command cp pg_0008.pdf page_00008.pdf
INFO:main:Running Command cp pg_0009.pdf page_00009.pdf
INFO:main:Running Command cp pg_0010.pdf page_00010.pdf
INFO:main:Running Command cp pg_0011.pdf page_00011.pdf
INFO:main:Running Command cp pg_0012.pdf page_00012.pdf
INFO:main:Running Command cp pg_0013.pdf page_00013.pdf
INFO:main:Running Command cp pg_0014.pdf page_00014.pdf
INFO:main:Running Command cp pg_0015.pdf page_00015.pdf
INFO:main:Running Command cp pg_0016.pdf page_00016.pdf
INFO:main:Running Command cp pg_0017.pdf page_00017.pdf
INFO:main:Running Command cp pg_0018.pdf page_00018.pdf
INFO:main:Running Command cp pg_0019.pdf page_00019.pdf
INFO:main:Running Command cp pg_0020.pdf page_00020.pdf
INFO:main:Running Command cp pg_0021.pdf page_00021.pdf
INFO:main:Running Command cp pg_0022.pdf page_00022.pdf
INFO:main:Running Command cp pg_0023.pdf page_00023.pdf
INFO:main:Running Command cp pg_0024.pdf page_00024.pdf
INFO:main:Running Command cp pg_0025.pdf page_00025.pdf
INFO:main:Running Command cp pg_0026.pdf page_00026.pdf
INFO:main:Running Command cp pg_0027.pdf page_00027.pdf
INFO:main:Running Command cp pg_0028.pdf page_00028.pdf
INFO:main:Running Command cp pg_0029.pdf page_00029.pdf
INFO:main:Running Command cp pg_0030.pdf page_00030.pdf
INFO:main:Running Command cp pg_0031.pdf page_00031.pdf
INFO:main:Running Command cp pg_0032.pdf page_00032.pdf
INFO:main:Running Command cp pg_0033.pdf page_00033.pdf
INFO:main:Running Command cp pg_0034.pdf page_00034.pdf
INFO:main:Running Command cp pg_0035.pdf page_00035.pdf
INFO:main:Running Command cp pg_0036.pdf page_00036.pdf
INFO:main:Running Command cp pg_0037.pdf page_00037.pdf
INFO:main:Running Command cp pg_0038.pdf page_00038.pdf
INFO:main:Running Command cp pg_0039.pdf page_00039.pdf
INFO:main:Running Command cp pg_0040.pdf page_00040.pdf
INFO:main:Running Command cp pg_0041.pdf page_00041.pdf
INFO:main:Running Command cp pg_0042.pdf page_00042.pdf
INFO:main:Running Command cp pg_0043.pdf page_00043.pdf
INFO:main:Running Command cp pg_0044.pdf page_00044.pdf
INFO:main:Running Command cp pg_0045.pdf page_00045.pdf
INFO:main:Running Command cp pg_0046.pdf page_00046.pdf
INFO:main:Running Command cp pg_0047.pdf page_00047.pdf
INFO:main:Running Command cp pg_0048.pdf page_00048.pdf
INFO:main:Running Command cp pg_0049.pdf page_00049.pdf
INFO:main:Running Command cp pg_0050.pdf page_00050.pdf
INFO:main:Running Command cp pg_0051.pdf page_00051.pdf
INFO:main:Running Command cp pg_0052.pdf page_00052.pdf
INFO:main:Running Command cp pg_0053.pdf page_00053.pdf
INFO:main:Running Command cp pg_0054.pdf page_00054.pdf
INFO:main:Running Command cp pg_0055.pdf page_00055.pdf
INFO:main:Running Command cp pg_0056.pdf page_00056.pdf
INFO:main:Running Command cp pg_0057.pdf page_00057.pdf
INFO:main:Running Command cp pg_0058.pdf page_00058.pdf
INFO:main:Running Command cp pg_0059.pdf page_00059.pdf
INFO:main:Running Command cp pg_0060.pdf page_00060.pdf
INFO:main:Running Command cp pg_0061.pdf page_00061.pdf
INFO:main:Running Command cp pg_0062.pdf page_00062.pdf
INFO:main:Running Command cp pg_0063.pdf page_00063.pdf
INFO:main:Running Command cp pg_0064.pdf page_00064.pdf
INFO:main:
Creating a folder in Google Drive to upload files. Folder Name : OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52

INFO:main:Running gdmkdir.py "OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52" | tee folder_in_google_drive.log
id: 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t
drive view: https://drive.google.com/drive/folders/167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t
folder view: https://drive.google.com/drive/folders/167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t
INFO:main:

uploading page_00001.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00001.pdf | tee page_00001.log
Uploading file: page_00001.pdf
File size: 608.0KB
Completed!
id: 10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA
drive url: https://docs.google.com/document/d/10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=10G6TKHUMz2wynbGBqz_uv8ZrrG_6zPf1YqH2EUY6pzA&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00001.pdf | tee page_00001.log

File location: /home/wikisource/OCR4wikisource-master/page_00001.txt
File size in bytes: 447
INFO:main:
Creating temp file touch page_00001.upload

INFO:main:

========

INFO:main:

uploading page_00002.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00002.pdf | tee page_00002.log
Uploading file: page_00002.pdf
File size: 274.1KB
id: 1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM
drive url: https://docs.google.com/document/d/1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1N24wzlhvUrZsgHKand_LZx0twBMIkW7l97mcR9SglWM&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00002.pdf | tee page_00002.log

File location: /home/wikisource/OCR4wikisource-master/page_00002.txt
File size in bytes: 1026
INFO:main:
Creating temp file touch page_00002.upload

INFO:main:

========

INFO:main:

uploading page_00003.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00003.pdf | tee page_00003.log
Uploading file: page_00003.pdf
File size: 720.8KB
Completed!
id: 1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds
drive url: https://docs.google.com/document/d/1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1JeMIz-dFCk3CAhXQA0Lar3fUuJuIGMrX0n2BMnTrBds&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00003.pdf | tee page_00003.log

File location: /home/wikisource/OCR4wikisource-master/page_00003.txt
File size in bytes: 3586
INFO:main:
Creating temp file touch page_00003.upload

INFO:main:

========

INFO:main:

uploading page_00004.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00004.pdf | tee page_00004.log
Uploading file: page_00004.pdf
File size: 897.5KB
Completed!
id: 1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU
drive url: https://docs.google.com/document/d/1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1Tt0llBp2uFCmIeQ3uzi56sJ-Dncp6O69AxazqUuTJvU&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00004.pdf | tee page_00004.log

File location: /home/wikisource/OCR4wikisource-master/page_00004.txt
File size in bytes: 4621
INFO:main:
Creating temp file touch page_00004.upload

INFO:main:

========

INFO:main:

uploading page_00005.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00005.pdf | tee page_00005.log
Uploading file: page_00005.pdf
File size: 829.7KB
Completed!
id: 1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY
drive url: https://docs.google.com/document/d/1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1mJAskagjGc4QlD1QCafbYzRscvQBMWzeewXr93FfFwY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00005.pdf | tee page_00005.log

File location: /home/wikisource/OCR4wikisource-master/page_00005.txt
File size in bytes: 4287
INFO:main:
Creating temp file touch page_00005.upload

INFO:main:

========

INFO:main:

uploading page_00006.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00006.pdf | tee page_00006.log
Uploading file: page_00006.pdf
File size: 783.8KB
Completed!
id: 14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc
drive url: https://docs.google.com/document/d/14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=14z6hvfWusT6pGP-EpZNwEpsVhHGA1ikVc6K4KXcaTMc&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00006.pdf | tee page_00006.log

File location: /home/wikisource/OCR4wikisource-master/page_00006.txt
File size in bytes: 3687
INFO:main:
Creating temp file touch page_00006.upload

INFO:main:

========

INFO:main:

uploading page_00007.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00007.pdf | tee page_00007.log
Uploading file: page_00007.pdf
File size: 811.1KB
Completed!
id: 1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM
drive url: https://docs.google.com/document/d/1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1phTVJB5KI_3Vc8Hm2KPB0mu4MWEsxwH4RgclmU8pAXM&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00007.pdf | tee page_00007.log

File location: /home/wikisource/OCR4wikisource-master/page_00007.txt
File size in bytes: 3956
INFO:main:
Creating temp file touch page_00007.upload

INFO:main:

========

INFO:main:

uploading page_00008.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00008.pdf | tee page_00008.log
Uploading file: page_00008.pdf
File size: 912.1KB
Completed!
id: 1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o
drive url: https://docs.google.com/document/d/1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1Ztr62JRjt3vClazoXNlQxelSSwKbMf0O652OmLnKQ7o&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00008.pdf | tee page_00008.log

File location: /home/wikisource/OCR4wikisource-master/page_00008.txt
File size in bytes: 4596
INFO:main:
Creating temp file touch page_00008.upload

INFO:main:

========

INFO:main:

uploading page_00009.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00009.pdf | tee page_00009.log
Uploading file: page_00009.pdf
File size: 762.2KB
Completed!
id: 1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI
drive url: https://docs.google.com/document/d/1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1gtYYJUC76LfqG19mCtGepprR0CPUoeL5Bzsf2NvT3gI&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00009.pdf | tee page_00009.log

File location: /home/wikisource/OCR4wikisource-master/page_00009.txt
File size in bytes: 3857
INFO:main:
Creating temp file touch page_00009.upload

INFO:main:

========

INFO:main:

uploading page_00010.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00010.pdf | tee page_00010.log
Uploading file: page_00010.pdf
File size: 646.3KB
Completed!
id: 1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0
drive url: https://docs.google.com/document/d/1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1whmytr_UOwhMiS2TZqcdlwCrGP2W_ZVXsvI6l_Rt5O0&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00010.pdf | tee page_00010.log

File location: /home/wikisource/OCR4wikisource-master/page_00010.txt
File size in bytes: 2573
INFO:main:
Creating temp file touch page_00010.upload

INFO:main:

========

INFO:main:

uploading page_00011.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00011.pdf | tee page_00011.log
Uploading file: page_00011.pdf
File size: 686.4KB
Completed!
id: 1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY
drive url: https://docs.google.com/document/d/1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1yC0RtMUXr-OFGEwm5h1TqzBOUVr6XTdBDNLNGo3YURY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00011.pdf | tee page_00011.log

File location: /home/wikisource/OCR4wikisource-master/page_00011.txt
File size in bytes: 3273
INFO:main:
Creating temp file touch page_00011.upload

INFO:main:

========

INFO:main:

uploading page_00012.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00012.pdf | tee page_00012.log
Uploading file: page_00012.pdf
File size: 888.0KB
Completed!
id: 1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M
drive url: https://docs.google.com/document/d/1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1ITZhNqyvdGr31mtaJvMhY1BNHDoO-vOTSCjhpdFHF8M&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00012.pdf | tee page_00012.log

File location: /home/wikisource/OCR4wikisource-master/page_00012.txt
File size in bytes: 4493
INFO:main:
Creating temp file touch page_00012.upload

INFO:main:

========

INFO:main:

uploading page_00013.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00013.pdf | tee page_00013.log
Uploading file: page_00013.pdf
File size: 807.2KB
Completed!
id: 1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys
drive url: https://docs.google.com/document/d/1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1WuniJ74q11Hylh0zIyGMh14_mIjqk_MdbSZv0Wl7-ys&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00013.pdf | tee page_00013.log

File location: /home/wikisource/OCR4wikisource-master/page_00013.txt
File size in bytes: 4004
INFO:main:
Creating temp file touch page_00013.upload

INFO:main:

========

INFO:main:

uploading page_00014.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00014.pdf | tee page_00014.log
Uploading file: page_00014.pdf
File size: 430.3KB
id: 1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20
drive url: https://docs.google.com/document/d/1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1V7KrDMysinPU12qnlNKbyWt-kyJD1oDcJzgemkZjo20&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00014.pdf | tee page_00014.log

File location: /home/wikisource/OCR4wikisource-master/page_00014.txt
File size in bytes: 1557
INFO:main:
Creating temp file touch page_00014.upload

INFO:main:

========

INFO:main:

uploading page_00015.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00015.pdf | tee page_00015.log
Uploading file: page_00015.pdf
File size: 863.1KB
Completed!
id: 1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc
drive url: https://docs.google.com/document/d/1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1MmIBHE1uuSGjQhBmvtESQMcGDO9puwpeFxvNf1LvJxc&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00015.pdf | tee page_00015.log

File location: /home/wikisource/OCR4wikisource-master/page_00015.txt
File size in bytes: 4132
INFO:main:
Creating temp file touch page_00015.upload

INFO:main:

========

INFO:main:

uploading page_00016.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00016.pdf | tee page_00016.log
Uploading file: page_00016.pdf
File size: 862.2KB
Completed!
id: 1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94
drive url: https://docs.google.com/document/d/1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1H_VbABP9RbFZWzkZF_AXzc4zS0XJGzuxMjv1t3Abr94&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00016.pdf | tee page_00016.log

File location: /home/wikisource/OCR4wikisource-master/page_00016.txt
File size in bytes: 4023
INFO:main:
Creating temp file touch page_00016.upload

INFO:main:

========

INFO:main:

uploading page_00017.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00017.pdf | tee page_00017.log
Uploading file: page_00017.pdf
File size: 869.2KB
Completed!
id: 1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U
drive url: https://docs.google.com/document/d/1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1nBwb7kxsOLfHE8E3gTEcsj0nDWhZeUSCpQafKUqfp0U&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00017.pdf | tee page_00017.log

File location: /home/wikisource/OCR4wikisource-master/page_00017.txt
File size in bytes: 4218
INFO:main:
Creating temp file touch page_00017.upload

INFO:main:

========

INFO:main:

uploading page_00018.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00018.pdf | tee page_00018.log
Uploading file: page_00018.pdf
File size: 703.2KB
Completed!
id: 1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70
drive url: https://docs.google.com/document/d/1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1hv3OnzN5A9yiIR60i0EbmV4n-aTlg5iXRhoTQihtq70&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00018.pdf | tee page_00018.log

File location: /home/wikisource/OCR4wikisource-master/page_00018.txt
File size in bytes: 3287
INFO:main:
Creating temp file touch page_00018.upload

INFO:main:

========

INFO:main:

uploading page_00019.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00019.pdf | tee page_00019.log
Uploading file: page_00019.pdf
File size: 853.7KB
Completed!
id: 1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I
drive url: https://docs.google.com/document/d/1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1JOjpgT2gxwOEqT_V0_0_h9fu4w9jbxiPEWvgaqFQl4I&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00019.pdf | tee page_00019.log

File location: /home/wikisource/OCR4wikisource-master/page_00019.txt
File size in bytes: 4535
INFO:main:
Creating temp file touch page_00019.upload

INFO:main:

========

INFO:main:

uploading page_00020.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00020.pdf | tee page_00020.log
Uploading file: page_00020.pdf
File size: 872.6KB
Completed!
id: 19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk
drive url: https://docs.google.com/document/d/19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=19ZV_NLBBY8TaMNr3JrkMoc_AHj7bwwV5e9muWLRCqXk&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00020.pdf | tee page_00020.log

File location: /home/wikisource/OCR4wikisource-master/page_00020.txt
File size in bytes: 4319
INFO:main:
Creating temp file touch page_00020.upload

INFO:main:

========

INFO:main:

uploading page_00021.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00021.pdf | tee page_00021.log
Uploading file: page_00021.pdf
File size: 851.6KB
Completed!
id: 1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw
drive url: https://docs.google.com/document/d/1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1mVCpbyYEtJ2A9MdbU7rpBr9xClYaaX7ghEL3wrXtIsw&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00021.pdf | tee page_00021.log

File location: /home/wikisource/OCR4wikisource-master/page_00021.txt
File size in bytes: 4240
INFO:main:
Creating temp file touch page_00021.upload

INFO:main:

========

INFO:main:

uploading page_00022.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00022.pdf | tee page_00022.log
Uploading file: page_00022.pdf
File size: 849.1KB
Completed!
id: 1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds
drive url: https://docs.google.com/document/d/1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1VeqRf5rZw4m22VNx9UBduIBsZib9MfIkMF0ZFSylzds&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00022.pdf | tee page_00022.log

File location: /home/wikisource/OCR4wikisource-master/page_00022.txt
File size in bytes: 4228
INFO:main:
Creating temp file touch page_00022.upload

INFO:main:

========

INFO:main:

uploading page_00023.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00023.pdf | tee page_00023.log
Uploading file: page_00023.pdf
File size: 866.5KB
Completed!
id: 1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo
drive url: https://docs.google.com/document/d/1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1-shSd7suF7i5nDy73b-JLASBguDqEI0_GB2QEJqeImo&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00023.pdf | tee page_00023.log

File location: /home/wikisource/OCR4wikisource-master/page_00023.txt
File size in bytes: 4322
INFO:main:
Creating temp file touch page_00023.upload

INFO:main:

========

INFO:main:

uploading page_00024.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00024.pdf | tee page_00024.log
Uploading file: page_00024.pdf
File size: 896.2KB
Completed!
id: 13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8
drive url: https://docs.google.com/document/d/13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=13ImOeSFHGq15bGSHO5zR0LB1C-H4IseWMIzyRLasyO8&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00024.pdf | tee page_00024.log

File location: /home/wikisource/OCR4wikisource-master/page_00024.txt
File size in bytes: 4249
INFO:main:
Creating temp file touch page_00024.upload

INFO:main:

========

INFO:main:

uploading page_00025.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00025.pdf | tee page_00025.log
Uploading file: page_00025.pdf
File size: 893.7KB
Completed!
id: 13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk
drive url: https://docs.google.com/document/d/13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=13crZnvb26Ve3AQG9ITZSwfosGToUWOSWWITtpzjcZYk&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00025.pdf | tee page_00025.log

File location: /home/wikisource/OCR4wikisource-master/page_00025.txt
File size in bytes: 4419
INFO:main:
Creating temp file touch page_00025.upload

INFO:main:

========

INFO:main:

uploading page_00026.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00026.pdf | tee page_00026.log
Uploading file: page_00026.pdf
File size: 894.5KB
Completed!
id: 1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY
drive url: https://docs.google.com/document/d/1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1iGCEafd-DN2-4V7BbSEfnlHL82OesLHGhKNPdj4OtrY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00026.pdf | tee page_00026.log

File location: /home/wikisource/OCR4wikisource-master/page_00026.txt
File size in bytes: 4421
INFO:main:
Creating temp file touch page_00026.upload

INFO:main:

========

INFO:main:

uploading page_00027.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00027.pdf | tee page_00027.log
Uploading file: page_00027.pdf
File size: 877.7KB
Completed!
id: 1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw
drive url: https://docs.google.com/document/d/1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1nhRDb7MnareaClDuFI1ho4Mov2JUSrBHjLTJsjK9Ifw&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00027.pdf | tee page_00027.log

File location: /home/wikisource/OCR4wikisource-master/page_00027.txt
File size in bytes: 4350
INFO:main:
Creating temp file touch page_00027.upload

INFO:main:

========

INFO:main:

uploading page_00028.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00028.pdf | tee page_00028.log
Uploading file: page_00028.pdf
File size: 864.0KB
Completed!
id: 1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY
drive url: https://docs.google.com/document/d/1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1kkj8Uqa_zMj4zXVpHB9Pk6aesdwYILNukJ_EQyE_NLY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00028.pdf | tee page_00028.log

File location: /home/wikisource/OCR4wikisource-master/page_00028.txt
File size in bytes: 4198
INFO:main:
Creating temp file touch page_00028.upload

INFO:main:

========

INFO:main:

uploading page_00029.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00029.pdf | tee page_00029.log
Uploading file: page_00029.pdf
File size: 865.1KB
Completed!
id: 1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI
drive url: https://docs.google.com/document/d/1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1IBnY_6XC15GreskGabEJ_DDCvGPGMaBVMrmkN1eXwZI&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00029.pdf | tee page_00029.log

File location: /home/wikisource/OCR4wikisource-master/page_00029.txt
File size in bytes: 4332
INFO:main:
Creating temp file touch page_00029.upload

INFO:main:

========

INFO:main:

uploading page_00030.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00030.pdf | tee page_00030.log
Uploading file: page_00030.pdf
File size: 643.2KB
Completed!
id: 1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0
drive url: https://docs.google.com/document/d/1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1arKayQl7vnhHGyBiRGZ3IBlqeNJjQkWwun3qStXI7Y0&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00030.pdf | tee page_00030.log

File location: /home/wikisource/OCR4wikisource-master/page_00030.txt
File size in bytes: 3068
INFO:main:
Creating temp file touch page_00030.upload

INFO:main:

========

INFO:main:

uploading page_00031.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00031.pdf | tee page_00031.log
Uploading file: page_00031.pdf
File size: 704.8KB
Completed!
id: 1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg
drive url: https://docs.google.com/document/d/1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1Es7uzETXmp9eRVHbbpzr4uJSXfBfAw_uaG2elzH_5bg&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00031.pdf | tee page_00031.log

File location: /home/wikisource/OCR4wikisource-master/page_00031.txt
File size in bytes: 3309
INFO:main:
Creating temp file touch page_00031.upload

INFO:main:

========

INFO:main:

uploading page_00032.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00032.pdf | tee page_00032.log
Uploading file: page_00032.pdf
File size: 921.4KB
Completed!
id: 1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E
drive url: https://docs.google.com/document/d/1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1gK4HjraEBWCrq5qrkDp3GhncxWtz6UatoT8yuzgw10E&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00032.pdf | tee page_00032.log

File location: /home/wikisource/OCR4wikisource-master/page_00032.txt
File size in bytes: 4429
INFO:main:
Creating temp file touch page_00032.upload

INFO:main:

========

INFO:main:

uploading page_00033.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00033.pdf | tee page_00033.log
Uploading file: page_00033.pdf
File size: 891.0KB
Completed!
id: 1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8
drive url: https://docs.google.com/document/d/1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1dX6dtfy6n483lTys998iTXlxTc6VV9IUtzlY6h2LPm8&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00033.pdf | tee page_00033.log

File location: /home/wikisource/OCR4wikisource-master/page_00033.txt
File size in bytes: 4450
INFO:main:
Creating temp file touch page_00033.upload

INFO:main:

========

INFO:main:

uploading page_00034.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00034.pdf | tee page_00034.log
Uploading file: page_00034.pdf
File size: 897.4KB
Completed!
id: 196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4
drive url: https://docs.google.com/document/d/196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=196OXre0kjSzisRX776gWMiFGbG36dPJwz56XBxcwrL4&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00034.pdf | tee page_00034.log

File location: /home/wikisource/OCR4wikisource-master/page_00034.txt
File size in bytes: 4127
INFO:main:
Creating temp file touch page_00034.upload

INFO:main:

========

INFO:main:

uploading page_00035.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00035.pdf | tee page_00035.log
Uploading file: page_00035.pdf
File size: 894.5KB
Completed!
id: 1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54
drive url: https://docs.google.com/document/d/1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1IgDyyE8yFuBAH6xIQvku6yw2mXpKSMRIZJ0U0XfUI54&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00035.pdf | tee page_00035.log

File location: /home/wikisource/OCR4wikisource-master/page_00035.txt
File size in bytes: 4598
INFO:main:
Creating temp file touch page_00035.upload

INFO:main:

========

INFO:main:

uploading page_00036.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00036.pdf | tee page_00036.log
Uploading file: page_00036.pdf
File size: 719.8KB
Completed!
id: 1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58
drive url: https://docs.google.com/document/d/1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1tK8ruOb_fFDoMxwxs_fNyW8rN7eQwliIvd3YLiu8k58&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00036.pdf | tee page_00036.log

File location: /home/wikisource/OCR4wikisource-master/page_00036.txt
File size in bytes: 3480
INFO:main:
Creating temp file touch page_00036.upload

INFO:main:

========

INFO:main:

uploading page_00037.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00037.pdf | tee page_00037.log
Uploading file: page_00037.pdf
File size: 683.4KB
Completed!
id: 1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM
drive url: https://docs.google.com/document/d/1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1rnCfFReXTYGwsf4PIyL9nSZUmrextPf-5dXwGaMp7iM&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00037.pdf | tee page_00037.log

File location: /home/wikisource/OCR4wikisource-master/page_00037.txt
File size in bytes: 3375
INFO:main:
Creating temp file touch page_00037.upload

INFO:main:

========

INFO:main:

uploading page_00038.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00038.pdf | tee page_00038.log
Uploading file: page_00038.pdf
File size: 913.7KB
Completed!
id: 1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98
drive url: https://docs.google.com/document/d/1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1rsvrCNgCyaao53sigKqJU6RuKAj9Z9orXI8KwNZGD98&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00038.pdf | tee page_00038.log

File location: /home/wikisource/OCR4wikisource-master/page_00038.txt
File size in bytes: 4463
INFO:main:
Creating temp file touch page_00038.upload

INFO:main:

========

INFO:main:

uploading page_00039.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00039.pdf | tee page_00039.log
Uploading file: page_00039.pdf
File size: 732.9KB
Completed!
id: 18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0
drive url: https://docs.google.com/document/d/18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=18-aYbMkUlped9f6HGxnYc_tlDS3X7fSoOooalHIDKw0&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00039.pdf | tee page_00039.log

File location: /home/wikisource/OCR4wikisource-master/page_00039.txt
File size in bytes: 3435
INFO:main:
Creating temp file touch page_00039.upload

INFO:main:

========

INFO:main:

uploading page_00040.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00040.pdf | tee page_00040.log
Uploading file: page_00040.pdf
File size: 585.6KB
Completed!
id: 1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k
drive url: https://docs.google.com/document/d/1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1HuN3CaZmteOauULBTmQBnUYcQsOK5a5HD-v_ytMIC_k&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00040.pdf | tee page_00040.log

File location: /home/wikisource/OCR4wikisource-master/page_00040.txt
File size in bytes: 2537
INFO:main:
Creating temp file touch page_00040.upload

INFO:main:

========

INFO:main:

uploading page_00041.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00041.pdf | tee page_00041.log
Uploading file: page_00041.pdf
File size: 535.5KB
Completed!
id: 14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE
drive url: https://docs.google.com/document/d/14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=14lDRKrPa51g2gUfGbknd9ES8MpqFpFX1SDgRoY4e-SE&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00041.pdf | tee page_00041.log

File location: /home/wikisource/OCR4wikisource-master/page_00041.txt
File size in bytes: 2351
INFO:main:
Creating temp file touch page_00041.upload

INFO:main:

========

INFO:main:

uploading page_00042.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00042.pdf | tee page_00042.log
Uploading file: page_00042.pdf
File size: 535.8KB
Completed!
id: 10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30
drive url: https://docs.google.com/document/d/10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=10KNBy5lsdJpj2T8lXgZYdySlPtsyYM9x2qXUl4__F30&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00042.pdf | tee page_00042.log

File location: /home/wikisource/OCR4wikisource-master/page_00042.txt
File size in bytes: 2202
INFO:main:
Creating temp file touch page_00042.upload

INFO:main:

========

INFO:main:

uploading page_00043.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00043.pdf | tee page_00043.log
Uploading file: page_00043.pdf
File size: 485.5KB
id: 184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY
drive url: https://docs.google.com/document/d/184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=184jd_RZhTG9jxTzNGXCJan_VVwiMOFFiZMzMsu1GavY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00043.pdf | tee page_00043.log

File location: /home/wikisource/OCR4wikisource-master/page_00043.txt
File size in bytes: 2189
INFO:main:
Creating temp file touch page_00043.upload

INFO:main:

========

INFO:main:

uploading page_00044.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00044.pdf | tee page_00044.log
Uploading file: page_00044.pdf
File size: 508.1KB
id: 1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30
drive url: https://docs.google.com/document/d/1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1_Nsfdgm9dc-rewuU54a1Nex9YpUEzbMbo8meTZzMO30&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00044.pdf | tee page_00044.log

File location: /home/wikisource/OCR4wikisource-master/page_00044.txt
File size in bytes: 2168
INFO:main:
Creating temp file touch page_00044.upload

INFO:main:

========

INFO:main:

uploading page_00045.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00045.pdf | tee page_00045.log
Uploading file: page_00045.pdf
File size: 526.1KB
Completed!
id: 1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ
drive url: https://docs.google.com/document/d/1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1PLNJ9ufleL0yx30Sj8NATOZP5htNGn_2dGswXp5tHBQ&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00045.pdf | tee page_00045.log

File location: /home/wikisource/OCR4wikisource-master/page_00045.txt
File size in bytes: 2328
INFO:main:
Creating temp file touch page_00045.upload

INFO:main:

========

INFO:main:

uploading page_00046.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00046.pdf | tee page_00046.log
Uploading file: page_00046.pdf
File size: 505.5KB
id: 1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0
drive url: https://docs.google.com/document/d/1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1O5lLfeNCVwKxwMGSY72j0ZuLJU0Y1etFUfNOBdhR2h0&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00046.pdf | tee page_00046.log

File location: /home/wikisource/OCR4wikisource-master/page_00046.txt
File size in bytes: 2093
INFO:main:
Creating temp file touch page_00046.upload

INFO:main:

========

INFO:main:

uploading page_00047.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00047.pdf | tee page_00047.log
Uploading file: page_00047.pdf
File size: 535.8KB
Completed!
id: 1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ
drive url: https://docs.google.com/document/d/1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1RuV4HtrqAftcNHJyxqn5RSgAUdXPxKVn4cH7eeumqgQ&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00047.pdf | tee page_00047.log

File location: /home/wikisource/OCR4wikisource-master/page_00047.txt
File size in bytes: 2334
INFO:main:
Creating temp file touch page_00047.upload

INFO:main:

========

INFO:main:

uploading page_00048.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00048.pdf | tee page_00048.log
Uploading file: page_00048.pdf
File size: 545.9KB
Completed!
id: 1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs
drive url: https://docs.google.com/document/d/1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1cwqGeWZ3TZC5dMD69IpGxgACaBtBZFYGDKR-M9omZrs&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00048.pdf | tee page_00048.log

File location: /home/wikisource/OCR4wikisource-master/page_00048.txt
File size in bytes: 2279
INFO:main:
Creating temp file touch page_00048.upload

INFO:main:

========

INFO:main:

uploading page_00049.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00049.pdf | tee page_00049.log
Uploading file: page_00049.pdf
File size: 527.1KB
Completed!
id: 19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY
drive url: https://docs.google.com/document/d/19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=19Jx2BoAmYUaEpqQO1Qr24sLBMuuhuY42C_2lXOw-qAY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00049.pdf | tee page_00049.log

File location: /home/wikisource/OCR4wikisource-master/page_00049.txt
File size in bytes: 2343
INFO:main:
Creating temp file touch page_00049.upload

INFO:main:

========

INFO:main:

uploading page_00050.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00050.pdf | tee page_00050.log
Uploading file: page_00050.pdf
File size: 447.8KB
id: 1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM
drive url: https://docs.google.com/document/d/1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1ZQILYUxmRe2qslHb1crzTlmXu7Jqi4KZ9RMV3YXEnSM&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00050.pdf | tee page_00050.log

File location: /home/wikisource/OCR4wikisource-master/page_00050.txt
File size in bytes: 1735
INFO:main:
Creating temp file touch page_00050.upload

INFO:main:

========

INFO:main:

uploading page_00051.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00051.pdf | tee page_00051.log
Uploading file: page_00051.pdf
File size: 664.8KB
Completed!
id: 1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA
drive url: https://docs.google.com/document/d/1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1mh96XsLuno0yPeB7H1rZ1hdrqM1B7-kr3dKHgkePlBA&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00051.pdf | tee page_00051.log

File location: /home/wikisource/OCR4wikisource-master/page_00051.txt
File size in bytes: 3123
INFO:main:
Creating temp file touch page_00051.upload

INFO:main:

========

INFO:main:

uploading page_00052.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00052.pdf | tee page_00052.log
Uploading file: page_00052.pdf
File size: 908.6KB
Completed!
id: 19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0
drive url: https://docs.google.com/document/d/19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=19cD_FjKBs-pO6z1HH1ck1BzD2uVECYiscY3ZA-uCug0&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00052.pdf | tee page_00052.log

File location: /home/wikisource/OCR4wikisource-master/page_00052.txt
File size in bytes: 4403
INFO:main:
Creating temp file touch page_00052.upload

INFO:main:

========

INFO:main:

uploading page_00053.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00053.pdf | tee page_00053.log
Uploading file: page_00053.pdf
File size: 875.9KB
Completed!
id: 1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o
drive url: https://docs.google.com/document/d/1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1QAKVQsO9SGMa6rx2J10FoDAzBD17XWzXC_T8N9-sQ8o&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00053.pdf | tee page_00053.log

File location: /home/wikisource/OCR4wikisource-master/page_00053.txt
File size in bytes: 4221
INFO:main:
Creating temp file touch page_00053.upload

INFO:main:

========

INFO:main:

uploading page_00054.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00054.pdf | tee page_00054.log
Uploading file: page_00054.pdf
File size: 896.2KB
Completed!
id: 1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k
drive url: https://docs.google.com/document/d/1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1-V4ijFiI0vic981jxN-i85BF9jIxDDYMCjkzUVD7-2k&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00054.pdf | tee page_00054.log

File location: /home/wikisource/OCR4wikisource-master/page_00054.txt
File size in bytes: 4246
INFO:main:
Creating temp file touch page_00054.upload

INFO:main:

========

INFO:main:

uploading page_00055.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00055.pdf | tee page_00055.log
Uploading file: page_00055.pdf
File size: 862.8KB
Completed!
id: 1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE
drive url: https://docs.google.com/document/d/1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1QVgL7NLYZ_N8dJ4mdhj_C2z5Tne6_LEG6tycxS435RE&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00055.pdf | tee page_00055.log

File location: /home/wikisource/OCR4wikisource-master/page_00055.txt
File size in bytes: 4382
INFO:main:
Creating temp file touch page_00055.upload

INFO:main:

========

INFO:main:

uploading page_00056.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00056.pdf | tee page_00056.log
Uploading file: page_00056.pdf
File size: 675.1KB
Completed!
id: 1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg
drive url: https://docs.google.com/document/d/1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1HAwwlDQ3aTt63CDZdP586KxDr30mp8CQXG7Oqu9cYkg&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00056.pdf | tee page_00056.log

File location: /home/wikisource/OCR4wikisource-master/page_00056.txt
File size in bytes: 3090
INFO:main:
Creating temp file touch page_00056.upload

INFO:main:

========

INFO:main:

uploading page_00057.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00057.pdf | tee page_00057.log
Uploading file: page_00057.pdf
File size: 904.7KB
Completed!
id: 1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY
drive url: https://docs.google.com/document/d/1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1v5IeC23puEDVQNM73zIzBn8jBXGUx2v6RkDCkAr5yHY&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00057.pdf | tee page_00057.log

File location: /home/wikisource/OCR4wikisource-master/page_00057.txt
File size in bytes: 4457
INFO:main:
Creating temp file touch page_00057.upload

INFO:main:

========

INFO:main:

uploading page_00058.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00058.pdf | tee page_00058.log
Uploading file: page_00058.pdf
File size: 917.6KB
Completed!
id: 1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk
drive url: https://docs.google.com/document/d/1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1NTHufIzdw5fwb_2LP_G81R2QBiJ49MwuUIULnxyp0Nk&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00058.pdf | tee page_00058.log

File location: /home/wikisource/OCR4wikisource-master/page_00058.txt
File size in bytes: 4515
INFO:main:
Creating temp file touch page_00058.upload

INFO:main:

========

INFO:main:

uploading page_00059.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00059.pdf | tee page_00059.log
Uploading file: page_00059.pdf
File size: 903.7KB
Completed!
id: 1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE
drive url: https://docs.google.com/document/d/1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1LUkyWR5bPk9He1dl1DZd6xaivICT3BN0NNMA3jrJHsE&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00059.pdf | tee page_00059.log

File location: /home/wikisource/OCR4wikisource-master/page_00059.txt
File size in bytes: 4466
INFO:main:
Creating temp file touch page_00059.upload

INFO:main:

========

INFO:main:

uploading page_00060.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00060.pdf | tee page_00060.log
Uploading file: page_00060.pdf
File size: 890.5KB
Completed!
id: 1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE
drive url: https://docs.google.com/document/d/1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1Dit4nP3wm6EG615Ij3BsoHtcbOsvsq_1cZT-U2YgdRE&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00060.pdf | tee page_00060.log

File location: /home/wikisource/OCR4wikisource-master/page_00060.txt
File size in bytes: 4481
INFO:main:
Creating temp file touch page_00060.upload

INFO:main:

========

INFO:main:

uploading page_00061.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00061.pdf | tee page_00061.log
Uploading file: page_00061.pdf
File size: 876.3KB
Completed!
id: 1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM
drive url: https://docs.google.com/document/d/1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1JcqsgCMN3PDCPp0sNU0cj6ZbcNMHE166-fQLm_Wu6RM&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00061.pdf | tee page_00061.log

File location: /home/wikisource/OCR4wikisource-master/page_00061.txt
File size in bytes: 4456
INFO:main:
Creating temp file touch page_00061.upload

INFO:main:

========

INFO:main:

uploading page_00062.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00062.pdf | tee page_00062.log
Uploading file: page_00062.pdf
File size: 844.1KB
Completed!
id: 1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI
drive url: https://docs.google.com/document/d/1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1trX7Atvs_Dx5cix6GJh7asS8B6ar6B3qvA9kK8vw2ZI&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00062.pdf | tee page_00062.log

File location: /home/wikisource/OCR4wikisource-master/page_00062.txt
File size in bytes: 4051
INFO:main:
Creating temp file touch page_00062.upload

INFO:main:

========

INFO:main:

uploading page_00063.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00063.pdf | tee page_00063.log
Uploading file: page_00063.pdf
File size: 358.0KB
id: 1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c
drive url: https://docs.google.com/document/d/1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1jINzyblPHZytWCKVk4_nm8ZpSBlWJZNYlmsgZAIXZ0c&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00063.pdf | tee page_00063.log

File location: /home/wikisource/OCR4wikisource-master/page_00063.txt
File size in bytes: 1224
INFO:main:
Creating temp file touch page_00063.upload

INFO:main:

========

INFO:main:

uploading page_00064.pdf to google Drive.
INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00064.pdf | tee page_00064.log
Uploading file: page_00064.pdf
File size: 661.6KB
Completed!
id: 1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ
drive url: https://docs.google.com/document/d/1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ/edit?usp=drivesdk
download url: https://drive.google.com/uc?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&export=download
text/html: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=html
text/plain: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=txt
application/vnd.openxmlformats-officedocument.wordprocessingml.document: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=docx
application/zip: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=zip
application/vnd.oasis.opendocument.text: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=odt
application/epub+zip: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=epub
application/rtf: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=rtf
application/pdf: https://docs.google.com/feeds/download/documents/export/Export?id=1WFZaF37ZUTXWd6AVf7gnVoijEf1exZ-KIivgCZsVUGQ&exportFormat=pdf
INFO:main:

Downloading the OCRed text

INFO:main:Running gdput.py -t ocr -f 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t page_00064.pdf | tee page_00064.log

File location: /home/wikisource/OCR4wikisource-master/page_00064.txt
File size in bytes: 3149
INFO:main:
Creating temp file touch page_00064.upload

INFO:main:

========

INFO:main:Split the text files to sync with the original images
INFO:main:Joining text files based on Column No
INFO:main:Running cat txt_00001.txt > text_for_page_00001.txt
INFO:main:Running cat txt_00002.txt > text_for_page_00002.txt
INFO:main:Running cat txt_00003.txt > text_for_page_00003.txt
INFO:main:Running cat txt_00004.txt > text_for_page_00004.txt
INFO:main:Running cat txt_00005.txt > text_for_page_00005.txt
INFO:main:Running cat txt_00006.txt > text_for_page_00006.txt
INFO:main:Running cat txt_00007.txt > text_for_page_00007.txt
INFO:main:Running cat txt_00008.txt > text_for_page_00008.txt
INFO:main:Running cat txt_00009.txt > text_for_page_00009.txt
INFO:main:Running cat txt_00010.txt > text_for_page_00010.txt
INFO:main:Running cat txt_00011.txt > text_for_page_00011.txt
INFO:main:Running cat txt_00012.txt > text_for_page_00012.txt
INFO:main:Running cat txt_00013.txt > text_for_page_00013.txt
INFO:main:Running cat txt_00014.txt > text_for_page_00014.txt
INFO:main:Running cat txt_00015.txt > text_for_page_00015.txt
INFO:main:Running cat txt_00016.txt > text_for_page_00016.txt
INFO:main:Running cat txt_00017.txt > text_for_page_00017.txt
INFO:main:Running cat txt_00018.txt > text_for_page_00018.txt
INFO:main:Running cat txt_00019.txt > text_for_page_00019.txt
INFO:main:Running cat txt_00020.txt > text_for_page_00020.txt
INFO:main:Running cat txt_00021.txt > text_for_page_00021.txt
INFO:main:Running cat txt_00022.txt > text_for_page_00022.txt
INFO:main:Running cat txt_00023.txt > text_for_page_00023.txt
INFO:main:Running cat txt_00024.txt > text_for_page_00024.txt
INFO:main:Running cat txt_00025.txt > text_for_page_00025.txt
INFO:main:Running cat txt_00026.txt > text_for_page_00026.txt
INFO:main:Running cat txt_00027.txt > text_for_page_00027.txt
INFO:main:Running cat txt_00028.txt > text_for_page_00028.txt
INFO:main:Running cat txt_00029.txt > text_for_page_00029.txt
INFO:main:Running cat txt_00030.txt > text_for_page_00030.txt
INFO:main:Running cat txt_00031.txt > text_for_page_00031.txt
INFO:main:Running cat txt_00032.txt > text_for_page_00032.txt
INFO:main:Running cat txt_00033.txt > text_for_page_00033.txt
INFO:main:Running cat txt_00034.txt > text_for_page_00034.txt
INFO:main:Running cat txt_00035.txt > text_for_page_00035.txt
INFO:main:Running cat txt_00036.txt > text_for_page_00036.txt
INFO:main:Running cat txt_00037.txt > text_for_page_00037.txt
INFO:main:Running cat txt_00038.txt > text_for_page_00038.txt
INFO:main:Running cat txt_00039.txt > text_for_page_00039.txt
INFO:main:Running cat txt_00040.txt > text_for_page_00040.txt
INFO:main:Running cat txt_00041.txt > text_for_page_00041.txt
INFO:main:Running cat txt_00042.txt > text_for_page_00042.txt
INFO:main:Running cat txt_00043.txt > text_for_page_00043.txt
INFO:main:Running cat txt_00044.txt > text_for_page_00044.txt
INFO:main:Running cat txt_00045.txt > text_for_page_00045.txt
INFO:main:Running cat txt_00046.txt > text_for_page_00046.txt
INFO:main:Running cat txt_00047.txt > text_for_page_00047.txt
INFO:main:Running cat txt_00048.txt > text_for_page_00048.txt
INFO:main:Running cat txt_00049.txt > text_for_page_00049.txt
INFO:main:Running cat txt_00050.txt > text_for_page_00050.txt
INFO:main:Running cat txt_00051.txt > text_for_page_00051.txt
INFO:main:Running cat txt_00052.txt > text_for_page_00052.txt
INFO:main:Running cat txt_00053.txt > text_for_page_00053.txt
INFO:main:Running cat txt_00054.txt > text_for_page_00054.txt
INFO:main:Running cat txt_00055.txt > text_for_page_00055.txt
INFO:main:Running cat txt_00056.txt > text_for_page_00056.txt
INFO:main:Running cat txt_00057.txt > text_for_page_00057.txt
INFO:main:Running cat txt_00058.txt > text_for_page_00058.txt
INFO:main:Running cat txt_00059.txt > text_for_page_00059.txt
INFO:main:Running cat txt_00060.txt > text_for_page_00060.txt
INFO:main:Running cat txt_00061.txt > text_for_page_00061.txt
INFO:main:Running cat txt_00062.txt > text_for_page_00062.txt
INFO:main:Running cat txt_00063.txt > text_for_page_00063.txt
INFO:main:Running cat txt_00064.txt > text_for_page_00064.txt
INFO:main:
Moving all temp files to OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52

INFO:main:Running mv folder*.log currentfile.pdf doc_data.txt pg*.pdf page* txt* "OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52"
INFO:main:Merged all OCRed files to all_text_for_Alochana_Magazine_October_1958.pdf.txt
INFO:main:Making a copy of all text files to text-for-Alochana_Magazine_October_1958.pdf
INFO:main:Running cp *.txt text-for-Alochana_Magazine_October_1958.pdf
INFO:main:
Deleting the Temp folder in Google Drive OCR-Alochana_Magazine_October_1958.pdf-temp-2018-04-27-22-46-52

INFO:main:Running gdrm.py 167kUsTMlce7YnDxMes5hvrvSjw7Ftx3t
INFO:main:

Done. Check the text files start with text_for_page_
INFO:main:

The PDF files and result text files are equval. Now running the mediawiki_uploader.py script

Traceback (most recent call last):
File "mediawiki_uploader.py", line 3, in
import wikitools
ImportError: No module named wikitools

I am tried to run python mediawiki_uploader.py this command but it gave again this error

section modifications - i am not able to understand how to do it

Hi,
I have an existing page, built from several sections. I want the robot to modify only one of sections, leaving the rest intact.

Let us say the current content looks like that:

[Category:one]
== section one ==
=== para one
bla ba bla

== section two ==
blablabla

I am generating a new version of the section one in the script:

== section one ==
=== para one
bla ba bla ba bla bla

and trying to commit into mediawiki:

wikipage = page.Page(mymediawiki, title='MyPage" , section="section one")

but the result is that the new section one is removing "[Category:one]" with the new content of section one, putting it on the top of previous section one.

Where am I making a mistake in my script?

Cheers :)

very high lag time: possible causes?

Hello,

I'm experiencing real high lag time.
I even hit the message ('Server lag, sleeping 14 seconds').

Could you please suggest which could possible reasons be ?

I am just running this test from console:

def search_wikipedia_random():
    site = wiki.Wiki("https://en.wikipedia.org/w/api.php") 
    params = {
        'action':'query', 
        'list':'random',
        'rnnamespace' : 0,
        'rnfilterredir' : 'all' ,
        'rnlimit' : 1,
        'redirects' : '',
        'format' : 'json',
        }
    request = api.APIRequest(site, params)
    result = request.query()
    return result['query']['random'][0]['id']

import time

start = time.time()
search_wikipedia_random()
end = time.time()
print(end - start)

and got

16.8418629169
13.1237468719

!

I am not having problems in browsing, so I don't think is problem of the line.. (right now listening to youtube and doing stuff and rassodocks in the evening :) )
I wonder if I could be lagged out for not having configured something (headers?) or if I m missing something.

wikitools.wiki.Wiki for biografias.bcn.cl: Invalid JSON

>>> import wikitools
>>> site = wikitools.wiki.Wiki('http://biografias.bcn.cl/api.php')
Invalid JSON, trying request again
Invalid JSON, trying request again
Invalid JSON, trying request again
Invalid JSON, trying request again
Invalid JSON, trying request again
^CTraceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/federico/.local/lib/python2.7/site-packages/wikitools/wiki.py", line 95, in __init__
    self.setSiteinfo()
  File "/home/federico/.local/lib/python2.7/site-packages/wikitools/wiki.py", line 113, in setSiteinfo
    info = req.query(False)
  File "/home/federico/.local/lib/python2.7/site-packages/wikitools/api.py", line 158, in query
    rawdata = self.__getRaw()
  File "/home/federico/.local/lib/python2.7/site-packages/wikitools/api.py", line 270, in __getRaw
    data = gzip.GzipFile('', 'rb', 9, StringIO.StringIO(data.read()))
  File "/usr/lib64/python2.7/socket.py", line 355, in read
    data = self._sock.recv(rbufsize)
  File "/usr/lib64/python2.7/httplib.py", line 597, in read
    s = self.fp.read(amt)
  File "/usr/lib64/python2.7/socket.py", line 384, in read
    data = self._sock.recv(left)
KeyboardInterrupt

Note, this wiki has some weird webserver configuration which causes several problems. Apparently, api.php returns the HTML main page if you attempt to POST any data instead of POSTing the API parameters as URL parameters. I'm not sure that's the culprit though.

From WikiTeam/wikiteam#314

Allow finer control of API errors in client code.

Hi! Citation Hunt uses wikitools on its (batch) jobs that process Wikipedia articles, both for fetching the article's text and converting wikicode to HTML.

I was bit by the fact that wikitools will sometimes retry forever upon API errors, which was, interestingly enough, mitigated by another hack I have in place for persisting connections (I'll file a separate issue for that later).

For the purposes of Citation Hunt's batch jobs, I don't particularly care if a single request fails, and I can just as well move on to another one, so it would be nice to have that kind of control. Maybe this could just raise an exception for the client to handle cleanly?

In addition to that, for tools that perform live queries against the API to serve requests, it would be nice to have some control over maxlag errors. wikitools will currently time.sleep when one happens, but if an exception was raised instead, the client might be able to handle it some other way. The sleep could be made optional for compatibility and sane out-of-the-box behavior.

What do you think? I'll be happy to try my hand at this, once we agree on the changes to be made :)

Expand documentation and code samples

The documentation that exists is clear and well written; however, it is very limited. Suggestions for improvement:

  • Expand the documentation to modules besides api, including links to the relevant [[API]] subpages
  • Add to the code samples:
    • an example of using APIRequest.query() to make edits
    • usage of Wiki.setUserAgent

COPYING missing

which setup.py refers to via

package_data={'wikitools': ['COPYING']}

GET requests

A site maintainer asked if the requests for content could be sent as GET instead of POST, so I made a couple of minor changes in wikitools to support this.

In case you want to propagate something like this to the master, the patch files wrt the current package version are linked here: https://gist.github.com/nberger-git/7fa9bc527c21a11caba2

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.