Comments (13)
Thanks for reporting this issue. Quite some info on the internet about this problem (https://access.redhat.com/articles/2039753).
I did change https://
to http://
in commit 30312c0. Does this solve your issue?
from recordlinkage.
I also have problem with this line, but I get a "ValueError: Unrecognised compression type: zip" error instead ... the error is different from the above: (this is on Win 7 Pro, 64 bit, running Anaconda)
krebs_data, krebs_match = load_krebsregister(missing_values=0)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "C:\Users\user\AppData\Roaming\Python\Python27\site-packages\recordlinkag
e\datasets\external.py", line 75, in load_krebsregister
data = pandas.concat([_krebsregister_block(bl) for bl in block])
File "C:\Users\user\AppData\Roaming\Python\Python27\site-packages\recordlinkag
e\datasets\external.py", line 123, in _krebsregister_block
compression='zip')
File "C:\Anaconda2\lib\site-packages\pandas\io\parsers.py", line 529, in parse
r_f
return _read(filepath_or_buffer, kwds)
File "C:\Anaconda2\lib\site-packages\pandas\io\parsers.py", line 295, in _read
parser = TextFileReader(filepath_or_buffer, **kwds)
File "C:\Anaconda2\lib\site-packages\pandas\io\parsers.py", line 612, in __ini
t__
self._make_engine(self.engine)
File "C:\Anaconda2\lib\site-packages\pandas\io\parsers.py", line 747, in _make
_engine
self._engine = CParserWrapper(self.f, **self.options)
File "C:\Anaconda2\lib\site-packages\pandas\io\parsers.py", line 1119, in __in
it__
self._reader = _parser.TextReader(src, **kwds)
File "pandas\parser.pyx", line 353, in pandas.parser.TextReader.__cinit__ (pan
das\parser.c:3246)
File "pandas\parser.pyx", line 567, in pandas.parser.TextReader._setup_parser_
source (pandas\parser.c:5787)
ValueError: Unrecognized compression type: zip
from recordlinkage.
Can you show the output of
import pandas
pandas.show_versions()
It looks like an issue with your installation of pandas. Can you read a zipped csv file with pandas.read_csv('yourfile_csv.zip")
?
from recordlinkage.
from recordlinkage.
from recordlinkage.
from recordlinkage.
from recordlinkage.
Problem prevails with recordlinkage version 0.11.2
>>> from recordlinkage.datasets import load_krebsregister
>>> krebs_data, krebs_match = load_krebsregister(missing_values=0)
Start downloading the data.
Issue with downloading the data: [Errno 13] Permission denied: '/usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/krebsregister/block_10.zip'
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/external.py", line 75, in load_krebsregister
data = pandas.concat([_krebsregister_block(bl) for bl in block])
File "/usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/external.py", line 75, in <listcomp>
data = pandas.concat([_krebsregister_block(bl) for bl in block])
File "/usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/external.py", line 123, in _krebsregister_block
compression='zip')
File "/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py", line 709, in parser_f
return _read(filepath_or_buffer, kwds)
File "/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py", line 449, in _read
parser = TextFileReader(filepath_or_buffer, **kwds)
File "/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py", line 818, in __init__
self._make_engine(self.engine)
File "/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py", line 1049, in _make_engine
self._engine = CParserWrapper(self.f, **self.options)
File "/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py", line 1695, in __init__
self._reader = parsers.TextReader(src, **kwds)
File "pandas/_libs/parsers.pyx", line 402, in pandas._libs.parsers.TextReader.__cinit__
File "pandas/_libs/parsers.pyx", line 664, in pandas._libs.parsers.TextReader._setup_parser_source
File "/usr/lib/python3.5/zipfile.py", line 1009, in __init__
self.fp = io.open(file, filemode)
FileNotFoundError: [Errno 2] No such file or directory: '/usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/krebsregister/block_1.zip'
from recordlinkage.
Hello, this is a different issue. You do not have permission to write /usr/local/lib/python3.5/dist-packages/recordlinkage/datasets/krebsregister/block_10.zip
.
Is it possible to get write access?
from recordlinkage.
Yes it is, I invoked python3 with sudo. I wasn't aware of what the problem was in this case.
Thanks for the help!
from recordlinkage.
I encountered the same problem when doing the load:
krebs_X, krebs_true_links = load_krebsregister(missing_values=0)
Error Message:
FileNotFoundError: [Errno 2] No such file or directory: 'C:\Users\s59018\AppData\Roaming\Python\Python36\site-packages\recordlinkage\datasets\krebsregister\block_1.zip'
Please help.
from recordlinkage.
This is a permission error. Do you have permission to the given folder?
I will work on a permanent solution.
from recordlinkage.
I rerun it successfully from home rather than in the office. Looks like a connection problem which fails to download the data completely.
from recordlinkage.
Related Issues (20)
- threshold in at compere is broken
- missing values HOT 4
- compare.date
- What languages are supported by this toolkit? only English?
- optimize Performance ?
- fastparquet 0.8.1: writing dataframe to parquet file from a table data field with rtf doc content falls with TypeError exception
- Data Corruptors a la GeCO
- AttributeError: module 'recordlinkage' has no attribute 'SortedNeighbourhoodIndex' HOT 1
- How to utilize prob-related methods of ECM classifier
- Support for pandas datatypes
- missing value is not working and it is default to 0 even if we change the value. HOT 1
- Possible bug with _dedup_index when df has only 1 row.
- For when support for packages like Dask or Ray (or Modin)?
- Candidate pairs issue
- Indexing - performance warning - full index can result in a large number of pairs HOT 3
- `ECMClassifier` returns almost all candidate pairs HOT 2
- Address Matching Conditional on value of another column HOT 1
- Duplicated matching columns with rl_comparer.compute while looping over zip code HOT 2
- automatically check how many components are defined in rl.Compare()
- Length mismatch at
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from recordlinkage.