This repo contains scripts and data to extract a multipartite graph from EpiGraphDB to be embedded and clustered.
To recreate the data from raw:
- From the base directory, run
scripts/process_raw.sh
to unzip and prepare raw data - Run
scripts/create_edgelist.py
to generate the edgelist - (optional) To regenerate the schema image, install graphviz and run the following command
dot -Tpng scripts/schema.gv -o schema.png