alions7000 / sec-edgar-text Goto Github PK
View Code? Open in Web Editor NEWText information from US companies' SEC EDGAR electronic filings
License: GNU General Public License v3.0
Text information from US companies' SEC EDGAR electronic filings
License: GNU General Public License v3.0
Hey,
I'm trying to use text_document module to search filings for specific text but when I open the file to run it it says traceback error and closes immediately. Any advice? Looking to run for keys words like "supply chain issues". How do I implement this module in main??
Best,
I installed the requirements.txt
file and I'm trying to call SEC-EDGAR-text
from its parent folder.
I am following your instructions but the program shows me an sqlite
error.
More specifically:
File "SEC-EDGAR-text/__main__.py", line 20, in <module>
from src.control import Downloader
File "SEC-EDGAR-text/src/control.py", line 11, in <module>
from .download import EdgarCrawler
File "SEC-EDGAR-text/src/download.py", line 15, in <module>
from .utils import args, logger, requests_get
File "SEC-EDGAR-text/src/utils.py", line 99, in <module>
sql_connection = sqlite3.connect(db_location)
sqlite3.OperationalError: unable to open database file
I saw that you have commented some lines in the source code where you create the DB.
Can you specify the problem? Thanks.
Hi,
This repo seemed promising for my work.
I am not able to run it.
python secedgartext --storage=/tmp/my_storage_folder
There is no secedgartext file to run.
Hey,
I tried to run this program to download some SEC filings text. The installation process was smooth in Anaconda. However, when I run it, I just cannot obtain any outputs in the batch_000x\001 folder. I tried to update all the dependencies but it did not help. Appreciate if anyone can help here. Thanks so much!
One of my trails is as follows:
(base) C:\Users\x>python SEC-EDGAR-text-master --storage=C:\Users\y --start=20170101 --end=99991231 --filings=10-Q --multiprocessing_cores=8 --start_company=0 --end_company=10
Enter filing report period ccyy, ccyymm etc. (default: all periods): all periods
20210515 10:32:40 INFO =================================================================
20210515 10:32:40 INFO Analysis started at 20210515 10:32:40
20210515 10:32:40 INFO Command line: SEC-EDGAR-text-master
20210515 10:32:40 INFO Arguments: SEC-EDGAR-text-master --storage=C:\Users\y --start=20170101 --end=99991231 --filings=10-Q --multiprocessing_cores=8 --start_company=0 --end_company=10
20210515 10:32:40 INFO =================================================================
20210515 10:32:40 INFO Opened SQL connection: C:\Users\y\metadata.sqlite3
20210515 10:32:40 INFO Traffic Limit Pause (ms): 0
20210515 10:32:40 INFO Using companies list: SEC-EDGAR-text-master\companies_list.txt
20210515 10:32:40 INFO -----------------------------------------------------------------
20210515 10:32:40 INFO Downloading 504 companies: SEC-EDGAR-text-master\companies_list.txt
20210515 10:32:40 INFO Filings period: 20170101 - 99991231
20210515 10:32:40 INFO Filings search: ['10-Q']
20210515 10:32:40 INFO Storage location: C:\Users\y
20210515 10:32:40 INFO -----------------------------------------------------------------
20210515 10:32:40 INFO Saving extracts (if successful) only. Not saving source documents locally.
20210515 10:32:40 INFO SEC filing date range: 20170101 to 99991231
20210515 10:32:40 INFO Batch number: 5, begin downloading company: 1 / 10
20210515 10:32:40 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:40 INFO Query EDGAR database for 10-Q, Search: 0000354950 (HD)
20210515 10:32:41 DEBUG EDGAR search URL: https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0000354950&type=10-Q&datea=20170101&dateb=99991231&owner=exclude&output=html&count=100
20210515 10:32:41 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:41 INFO Identified 0 filings, gathering SEC metadata and document links...
20210515 10:32:42 DEBUG Finished attempting to download all the 10-Q forms for HD
20210515 10:32:42 INFO Batch number: 5, begin downloading company: 2 / 10
20210515 10:32:42 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:42 INFO Query EDGAR database for 10-Q, Search: 0000050104 (TSO)
20210515 10:32:42 DEBUG EDGAR search URL: https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0000050104&type=10-Q&datea=20170101&dateb=99991231&owner=exclude&output=html&count=100
20210515 10:32:42 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:42 INFO Identified 5 filings, gathering SEC metadata and document links...
20210515 10:32:43 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010418000103/0000050104-18-000103-index.htm
20210515 10:32:57 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010417000229/0000050104-17-000229-index.htm
20210515 10:33:11 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010417000189/0000050104-17-000189-index.htm
20210515 10:33:24 DEBUG Finished attempting to download all the 10-Q forms for TSO
...
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.