leoliu0 / cik-cusip-mapping Goto Github PK
View Code? Open in Web Editor NEWprovide cik to cusip links using 13G and 13D filings
provide cik to cusip links using 13G and 13D filings
Thanks for your work here, it is very useful - is there any particular reason why you recommend against using this? It seems to still be a valid method for mapping CIKs to CUSIPs?
Hi
Thanks for making this mapping available to everyone. It is extremely useful. I noticed what I think is a small bug in the output. It seems cusip8 "94311218" occurs 1,388 times in the output file and appears to map to a different CIK each time. cusip8 "94311218" doesn't appear to match any security on https://quotes.fidelity.com/ftgw/fbc/ofquotes/mmnet/SymLookup. Is it a placeholder? Thanks again.
Hi,
Excellent job! Thank you.
I have some questions.
The map between cik and cusip is not 1 to 1. It is many to many. Is this correct?
For instance, cusip G1117K maps to 2 ciks (1769484, 1725206), while cik 1172939 maps to 6 cusips (49900N, 024855, 024857, 027858, 024884, 024856).
Can you explains why?
Thanks!
The download of the files from python dl.py
does not work properly.
The HTML-files are all the same, stating that accessing files on EDGAR is not allowed through an automated tool.
Do you have a solution for this issue?
Hi,
Great work! Thanks
I download your codes for test. But it fails.
I modify dl_idx.py so it only download for 2004 as:
for year in range(2004, 2005):
It product more than 500,000 lines in full_index.csv. I delete 450,000 lines so it only have about 50,000 filing for test.
I then run "python dl.py 13G 13G", it download all html file in 13G folder, such as 1007853_2004-03-23.html. It doesn't download the form 15G filing. When I run "python parse_cusip.py 13G", it product 13G.csv like below:
13G/2004_02/1000045_2004-02-26.html,,
13G/2004_02/1000097_2004-02-03.html,,
13G/2004_02/1000180_2004-02-13.html,,
13G/2004_02/1000180_2004-02-11.html,,
13G/2004_02/1000209_2004-02-04.html,,
13G/2004_02/1000209_2004-02-12.html,,
13G/2004_02/1000209_2004-02-13.html,,
13G/2004_02/1000227_2004-02-12.html,,
13G/2004_02/1000227_2004-02-13.html,,
There are no cik, cusip. etc...
What do I do wrong?
Thank you!
Great package, I was wondering if you would be able to extend it to companies that don't yet have 13-G filings, normally these companies have cusips in their S-1 filings. If you can do that, this would be the most comprehensive mapping tool out there. (I am using the HTML parser currently, for what it is worth).
Hi Leo,
I am using the program to download NPORT-P files. After running dl_index.py, I see roughly 150000 NPORT-P entries from full_index.csv, but I am only able to download about 50000 entries after running dl.py. Could you please help with that? I am new to Python, so maybe I am missing something here.
Looking forward to your reply.
Thanks,
Leo
Hi Dr Liu,
Your cik-cusio-mapping database is so powerful and I am really impressed with it. However, when I try to run code python dl.py 13D 13D, it raises an error that I am missing 'full_index.csv' file. I am wondering whether you could share the full_index.csv file?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.