Coder Social home page Coder Social logo

tkmanabat / sars-cov-2-clade-identifier Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 9.26 MB

SARS-CoV-2 Clade identifier with SNP data using machine learning.

Jupyter Notebook 100.00%
genomics bioninformatics sars-cov-2 clade single-nucleotide-polymorphisms phylogenetics

sars-cov-2-clade-identifier's Introduction

drawing

SARS-CoV-2-Clade-Identifier

Background and Objective ๐Ÿ“ฐ

This is the code repository for SARS-CoV-2 clade identifier with Single Nucleotide Polymorphisms (SNPs) data using machine learning. A project of the Philippine Genome Center's 2021 Internship Program.

General Objective: Use Machine Learning to determine the Clades using SNPs of a given SARS-CoV-2 given sequence.

Specific Objectives:

  • Use Machine Learning as another lens to understand SARS-CoV-2.
  • Provide an alternative way for identifying clades.
  • Proof of concept for future studies.

Project Workflow:

Code Organization ๐Ÿคนโ€โ™‚๏ธ

This repository is divided into five parts:

  • Dataset Gathering/: contains steps on how SARS-CoV-2 sequences was collected/downloaded.
  • Variant Calling Workflow/: contains information on was processing done to get the SNPs from the sequences.
  • Clade Assignment/: contains steps on how clade assignment and processing was done.
  • Concatenation and Data Preprocessing/: contains the notebook for concatenating the SNPs from the variant calling workflow and the clade from the clade assignment workflow.
  • Machine Learning/: contains the machine learning models trained for this projects.

Setup ๐Ÿ’ป

To get started:

  • Clone the repository
git clone https://github.com/tkmanabat/SARS-CoV-2-Clade-Identifier.git
  • In order that to run the notebooks, all dependencies must be installed. We provided a list of dependencies in requirements.txt
pip install -r requirements.txt
  • To open with Jupyter Notebook
jupyter notebook

sars-cov-2-clade-identifier's People

Contributors

tkmanabat avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.