govstat

Source code for GovStat.us website.

Webserver is implemented using Flask on Python 3.10.

Install locally using pip install .

Dependencies:

Python dependencies will be pulled in automatically by pip.

Data Sources

Some notes on where the data for this webapp comes from. Congress data on bills and votes comes from scrapers in the unitedstates/congress repo. Budget data comes from excel files published by the White House Office of Management and Budget (OMB).

To obtain congress data, do the following:

From the root of this repo, run:

usc-run votes --congress=XXX --session=YYYY --force=True --fast=True
usc-run govinfo --bulkdata=BILLSTATUS --congress=XXX
usc-run bills

where XXX is the Congress number, and YYYY is the session number.

For example,

usc-run votes --congress=117 --session=2022 --force=True --fast=True
usc-run govinfo --bulkdata=BILLSTATUS --congress=117
usc-run bills

Budget data is carried in this repo via git-lfs.

Flask MySQL DB Creation

Start a MySQL Server.

Simple start for MariaDB:

sudo mariadb-install-db --user=mysql --basedir=/usr --datadir=/var/lib/mysql
sudo systemctl start mariadb.service

Add Configuration Information:

cp app/cfg/config.sample.json app/cfg/config.json

and edit in the appropriate values to app/cfg/config.json.

Initialize and Start `flask`.

flask db init
flask db migrate -m "initial migration"
flask db upgrade

Populate DB

After creating the flask MySQL DB run the following commands to populate it:

python vote_loader.py
python bill_loader.py
python budget_loader.py

Webapp Entrypoint

To launch the webapp:

gunicorn -b localhost:5000 -w 4 govstat:app

Host Name (localhost)
Port Number (5000)
Number of Threads/Handlers (4)
Flask app and entrypoint (govstat:app)

Gunicorn, NGINX, and Supervisor Configuration

See above to run gunicorn.
Specify NGINX port permissions, and forwarding for HTTP and HTTPS requests at /etc/nginx/sites-enabled/
Configure supervisor to run gunicorn app at /etc/supervisor/conf.d/
Create SSL certificates

Directory Structure

congress/
+--	govstat/
    +-- app/
        +--	Bills.py
        +-- Budget.py
        +-- config.py
        +-- __init__.py		[App instantiation, database instantiation, import functions for data loading and retrieval.]
        +-- models.py
        +-- routes.py
        +--	Votes.py
        +-- static/
        +-- templates/
    +-- govstat.py
    +-- setup.py
    +-- bill_loader.py
    +--	vote_loader.py
+--	data/
    +-- 116/
        +-- amendments/
            +-- hamdt/ [House Amendments]
                +-- hamdtN/
                    +-- [JSON and XML files]
            +-- samdt/ [Senate Amendments]
                +-- samdtN/
                    +-- [JSON and XML files]
        +-- bills/
            +-- hconres/
                +-- hconresN/
                    +-- [XML files. After processing, JSON files]
            +-- hjres/
                +-- hjresN/
                    +-- [XML files. After processing, JSON files]
            +-- hr
                +-- hrN/
                    +-- [XML files. After processing, JSON files]
            +-- hres/
                +-- hresN/
                    +-- [XML files. After processing, JSON files]
            +-- s/
                +-- sN/
                    +-- [XML files. After processing, JSON files]
            +-- sconres/
                +-- sconresN/
                    +-- [XML files. After processing, JSON files]
            +-- sjres/
                +-- sjresN/
                    +-- [XML files. After processing, JSON files]
            +-- sres/
                +-- sresN/
                    +-- [XML files. After processing, JSON files]
        +-- votes/
            +-- 2020/
                +-- hN/
                    +-- [JSON and XML files]
                +-- sN/
                    +-- [JSON and XML files]
                +-- 2021/ [One directory per year]
    +-- 117/ ... [One directory per congress session number]
    +--	hist_fy21/ [Historical data through 2021 from Office of Management and Budget (OMB)]
        +-- [51 XLSX files containing data].
    +-- supplemental/
        +-- [XLSX files containing supplemental budget data]
    +--	upcoming_house_floor/
        +-- [JSON files per week containing bill activities that week]
+-- tasks/
    +-- [PY files for each type of data that can be scraped and delivered]
    +--	[amendments, bills, committees, govinfo, nominations, votes, upcoming, etc.]
+-- scripts/
    +-- [SH scripts to transform raw JSON and XML data into forms usable for govtrack and other utilities.]
+-- cache/
+-- test/
    +-- [Test scripts, not exhaustive]
+-- contrib/

`data/hist_fy21` folder empty

Command:
gunicorn -b localhost:5000 govstat:app

[2022-02-20 10:47:40 -0500] [964739] [INFO] Starting gunicorn 20.1.0
[2022-02-20 10:47:40 -0500] [964739] [INFO] Listening at: http://127.0.0.1:5000 (964739)
[2022-02-20 10:47:40 -0500] [964739] [INFO] Using worker: sync
[2022-02-20 10:47:40 -0500] [964743] [INFO] Booting worker with pid: 964743
[2022-02-20 10:47:41 -0500] [964743] [ERROR] Exception in worker process
Traceback (most recent call last):
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/arbiter.py", line 589, in spawn_worker
    worker.init_process()
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/workers/base.py", line 134, in init_process
    self.load_wsgi()
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/workers/base.py", line 146, in load_wsgi
    self.wsgi = self.app.wsgi()
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/app/base.py", line 67, in wsgi
    self.callable = self.load()
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/app/wsgiapp.py", line 58, in load
    return self.load_wsgiapp()
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/app/wsgiapp.py", line 48, in load_wsgiapp
    return util.import_app(self.app_uri)
  File "/home/acxz/venvs/govstat-venv/lib/python3.10/site-packages/gunicorn/util.py", line 359, in import_app
    mod = importlib.import_module(module)
  File "/usr/lib/python3.10/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1050, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1027, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1006, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 688, in _load_unlocked
  File "<frozen importlib._bootstrap_external>", line 883, in exec_module
  File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
  File "/home/acxz/vcs/git/github/acxz/govstat/govstat.py", line 1, in <module>
    from app import app
  File "/home/acxz/vcs/git/github/acxz/govstat/app/__init__.py", line 22, in <module>
    from app import routes, models
  File "/home/acxz/vcs/git/github/acxz/govstat/app/routes.py", line 8, in <module>
    import app.Budget as Budget
  File "/home/acxz/vcs/git/github/acxz/govstat/app/Budget.py", line 13, in <module>
    EXCEL_FILES = sorted(os.listdir(EXCEL_DIR))
FileNotFoundError: [Errno 2] No such file or directory: '/home/acxz/vcs/git/github/acxz/us/data/hist_fy21'
[2022-02-20 10:47:41 -0500] [964743] [INFO] Worker exiting (pid: 964743)
[2022-02-20 10:47:42 -0500] [964739] [INFO] Shutting down: Master
[2022-02-20 10:47:42 -0500] [964739] [INFO] Reason: Worker failed to boot.

stevesdawg / govstat Goto Github PK

govstat's Introduction

govstat

Dependencies:

Data Sources

Flask MySQL DB Creation

Start a MySQL Server.

Add Configuration Information:

Initialize and Start flask.

Populate DB

Webapp Entrypoint

Gunicorn, NGINX, and Supervisor Configuration

Directory Structure

govstat's People

Contributors

Stargazers

Watchers

Forkers

govstat's Issues

There are tons of elections in this country.

Federal:

State:

Local:

Recommend Projects

Recommend Topics

Recommend Org

Initialize and Start `flask`.