cmu-safari / grim Goto Github PK

View Code? Open in Web Editor NEW

Source code of the processing-in-memory simulator used in the GRIM-Filter paper published at BMC Genomics in 2018: "GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping using Processing-in-Memory Technologies" (preliminary version at https://arxiv.org/pdf/1711.01177.pdf)

C 99.84% Makefile 0.16%

grim's Introduction

GRIM-Filter

GRIM-Filter is an algorithm optimized to exploit 3D-stacked memory systems that integrate computation within a logic layer stacked under memory layers, to perform processing-in-memory (PIM). GRIM-Filter quickly filters seed locations by 1) introducing a new representation of coarse-grained segments of the reference genome, and 2) using massively-parallel in-memory operations to identify read presence within each coarse-grained segment.

Our code baseline is taken from mrFAST_v2.6.1.0, which is described in detail in the following publications:

While we use mrFAST as a baseline, GRIM-Filter can be adapted to run with any other read mapper.

The algorithm of GRIM-Filter is described at: J.S. Kim et al., GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping using Processing-in-Memory Technologies, To appear in BMC Genomics

Prerequisites

In order to run GRIM-Filter, have the following files:

Human Genome FASTA file (e.g., Human_g1k_v37 Genome)
Read Sequence data sets (FASTA file)

Getting Started

To build mrFAST with GRIM-Filter, simply do:

$ make

To build the hash table used by mrFAST, run the following command:

./mrfast --index <Genome FASTA File>

There is more information on the parameters for hash table generation in the mrFAST User Manual.

To build the bitvectors that are referenced by GRIM-Filter, run the following command:

./mrfast --index <Genome FASTA File> -t 0 -k <Number of Bins> -b <Token Size> -f <Number of Tokens the Bitvector can Count (1)>

This will generate a .bv file in the same directory as your Genome FASTA File.

You can then use the bitvectors by running mrfast with the following command:

./mrfast --search <Genome FASTA File> -b <Token Size> -t 1 -e <error Tolerance (%)> -k <Number of Bins> -q 1 --seq <Read Sequences FASTA File>

Contributors

Jeremie S. Kim (Carnegie Mellon University)

grim's People

Contributors

Stargazers

Watchers

Recommend Projects

cmu-safari / grim Goto Github PK

grim's Introduction

GRIM-Filter

Prerequisites

Getting Started

Contributors

grim's People

Contributors

Stargazers

Watchers

Forkers

grim's Issues

May I know your email address? I want to discuss some issues about GRIM with you.

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent