Coder Social home page Coder Social logo

published in: Journal of Proteome Research

MegaGO

Calculate semantic distance for sets of Gene Ontology terms.

Getting Started

These instructions will get you a copy of the project up and running on your local machine.

Prerequisites

Scripts are written in python 3. One easy way to get started is installing miniconda 3.

On linux:

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh

Installing

Clone the repository:

git clone https://github.com/MEGA-GO/Mega-Go.git

Install package:

cd Mega-Go
pip install -U .

Execute example analysis:

megago sample7.txt sample8.txt

These files can be found here:

How does it work?

MegaGO calculates the similarity between GO terms with the Lin semantic similarity (simLin) metric 1.

where:

  • MICA: most informative common ancestor.
  • IC(goi): information content of the term goi.

The information content of a go term is calculated as follows:

The frequency p of a term go is defined as:

where:

  • c: children of go.
  • N: total number of terms in GO corpus.
  • ngo': number of occurences of a term go' in a reference data set.

To calculate the similarity of two sets of terms, the best match average (BMA)1 is used.

where:

  • m,n: number of terms in set gi and gj, respectively
  • sim(go1i,go2j): similarity between two GO terms

1: Lin, Dekang. 1998. β€œAn Information-Theoretic Definition of Similarity.” In Proceedings of the 15th International Conference on Machine Learning, 296β€”304.

Interpretation

The relative similarity ranges between 0 and 1.

sim(go1i,go2j) value Interpretation
>0.9 highly similar functions
0.3-0.9 functionally related
<0.3 not functionally similar

License

This project is licensed under the MIT License - see the LICENSE file for details

MegaGO's Projects

megago icon megago

Calculate semantic distance for sets of Gene Ontology terms

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.