This application provides a platform for users to learn about the genetics of diseases, as well as genetics of common appearance traits. Users can search for genes that are associated with disease names and phenotype descriptions as well as DNA or protein sequences. It makes the world of genetics easy to explore for younger students or curious adults and would be of great use to high school science classrooms.
-
Clone the repository to your computer
git clone https://github.com/xinyu-g/geneEd.git
-
Create a new conda environment for GeneEd. If you do not have conda, install Miniconda from here first.
conda env create -f environment.yaml
- concatenate all drugsGenes, genePhenoDis, and geneProSeq into 3 files. Remove duplicate headers (column metadata). Place these files in data/.
- Run
create_relations.py
to get 5 csv files representing the relations. - Copy the generated SQL files to MySQL's uploads directory. On windows, this is in
C:\ProgramData\MySQL\MySQL Server 8.0\Uploads\
. This is the only directoru that MySQL trusts by default. - Run
sql/data.sql
in MySQL workbench or a shell to build the schema and insert data into all tables but the gene table. - use
sql/users.sql
to create the user and favoriyes table. - Run
csv2sql.py
on gene.csv to generate gene.sql. This will take some time but reports progress. - Log in to the database with a commandline client. Select the geneEd database. Run
source gene.sql
to populate the database. Some records will fail to insert but this is okay.SELECT COUNT(*) FROM gene
should return 1024. - The database should be complete.
Alternatively, use the dumpfile Hunter created to generate the entire database. (I generated this late last night and did not test).