Coder Social home page Coder Social logo

sec-edgar-text's People

Contributors

alions7000 avatar jiwooshim avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

sec-edgar-text's Issues

Help with text_document module

Hey,

I'm trying to use text_document module to search filings for specific text but when I open the file to run it it says traceback error and closes immediately. Any advice? Looking to run for keys words like "supply chain issues". How do I implement this module in main??

Best,

SQLite Database Error when running the program

I installed the requirements.txt file and I'm trying to call SEC-EDGAR-text from its parent folder.

I am following your instructions but the program shows me an sqlite error.

More specifically:

File "SEC-EDGAR-text/__main__.py", line 20, in <module>
    from src.control import Downloader
  File "SEC-EDGAR-text/src/control.py", line 11, in <module>
    from .download import EdgarCrawler
  File "SEC-EDGAR-text/src/download.py", line 15, in <module>
    from .utils import args, logger, requests_get
  File "SEC-EDGAR-text/src/utils.py", line 99, in <module>
    sql_connection = sqlite3.connect(db_location)
sqlite3.OperationalError: unable to open database file

I saw that you have commented some lines in the source code where you create the DB.

Can you specify the problem? Thanks.

How to run this ?

Hi,

This repo seemed promising for my work.
I am not able to run it.
python secedgartext --storage=/tmp/my_storage_folder

There is no secedgartext file to run.

WARNING No valid index page and No output files in batch_0001\001

Hey,

I tried to run this program to download some SEC filings text. The installation process was smooth in Anaconda. However, when I run it, I just cannot obtain any outputs in the batch_000x\001 folder. I tried to update all the dependencies but it did not help. Appreciate if anyone can help here. Thanks so much!

One of my trails is as follows:

(base) C:\Users\x>python SEC-EDGAR-text-master --storage=C:\Users\y --start=20170101 --end=99991231 --filings=10-Q --multiprocessing_cores=8 --start_company=0 --end_company=10
Enter filing report period ccyy, ccyymm etc. (default: all periods): all periods
20210515 10:32:40 INFO =================================================================
20210515 10:32:40 INFO Analysis started at 20210515 10:32:40
20210515 10:32:40 INFO Command line: SEC-EDGAR-text-master
20210515 10:32:40 INFO Arguments: SEC-EDGAR-text-master --storage=C:\Users\y --start=20170101 --end=99991231 --filings=10-Q --multiprocessing_cores=8 --start_company=0 --end_company=10
20210515 10:32:40 INFO =================================================================
20210515 10:32:40 INFO Opened SQL connection: C:\Users\y\metadata.sqlite3
20210515 10:32:40 INFO Traffic Limit Pause (ms): 0
20210515 10:32:40 INFO Using companies list: SEC-EDGAR-text-master\companies_list.txt
20210515 10:32:40 INFO -----------------------------------------------------------------
20210515 10:32:40 INFO Downloading 504 companies: SEC-EDGAR-text-master\companies_list.txt
20210515 10:32:40 INFO Filings period: 20170101 - 99991231
20210515 10:32:40 INFO Filings search: ['10-Q']
20210515 10:32:40 INFO Storage location: C:\Users\y
20210515 10:32:40 INFO -----------------------------------------------------------------
20210515 10:32:40 INFO Saving extracts (if successful) only. Not saving source documents locally.
20210515 10:32:40 INFO SEC filing date range: 20170101 to 99991231
20210515 10:32:40 INFO Batch number: 5, begin downloading company: 1 / 10
20210515 10:32:40 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:40 INFO Query EDGAR database for 10-Q, Search: 0000354950 (HD)
20210515 10:32:41 DEBUG EDGAR search URL: https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0000354950&type=10-Q&datea=20170101&dateb=99991231&owner=exclude&output=html&count=100
20210515 10:32:41 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:41 INFO Identified 0 filings, gathering SEC metadata and document links...
20210515 10:32:42 DEBUG Finished attempting to download all the 10-Q forms for HD
20210515 10:32:42 INFO Batch number: 5, begin downloading company: 2 / 10
20210515 10:32:42 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:42 INFO Query EDGAR database for 10-Q, Search: 0000050104 (TSO)
20210515 10:32:42 DEBUG EDGAR search URL: https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0000050104&type=10-Q&datea=20170101&dateb=99991231&owner=exclude&output=html&count=100
20210515 10:32:42 INFO ----------------------------------------------------------------------------------------------------
20210515 10:32:42 INFO Identified 5 filings, gathering SEC metadata and document links...
20210515 10:32:43 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010418000103/0000050104-18-000103-index.htm
20210515 10:32:57 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010417000229/0000050104-17-000229-index.htm
20210515 10:33:11 WARNING No valid index page, attempt 1: https://www.sec.gov//Archives/edgar/data/50104/000005010417000189/0000050104-17-000189-index.htm
20210515 10:33:24 DEBUG Finished attempting to download all the 10-Q forms for TSO

...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.