Coder Social home page Coder Social logo

mars-update-pipeline's People

Contributors

fnyasimi avatar

Watchers

 avatar  avatar  avatar

mars-update-pipeline's Issues

Database update

Hi @kipkurui, I have managed to write an sql script of the database here.

Kindly review it and give feedback. I hope you should be able to run it well from your side.

Description

The script starts with the sql commands for building the database and the required tables then this is followed by inserting the data into the tables which takes in majority of the script.

Challenge

I have found a challenge in updating the foreign key and and constrains which have run me into multiple errors.

I ended up commenting them out of the code. Kindly help me address this problem and updating all the required and necessary keys & constrains into the script.

A quick question on PBM_Data table I didnt understand what PBM_DEBRU meant...

Looking forward to hearing from you.

Fnyasimi/Msc_Project#22

bed2chipseq.sh

`#!/usr/bin/env bash

function get_fasta(){

#check the size of the input sequence of reduce the processing time form large files (assumes sorted file)

lenbed=$(wc -l $bed_in | cut -f1 -d " ")
lenuse=$(($lenbed/10))
if [ "$lenbed" -gt 5000 ];
    then
        cut -f 1,2,3 $bed_in | head -$lenuse | bed-widen -width $len  >$bed_wide.bed
 else
        cut -f 1,2,3 $bed_in | bed-widen -width 100  >$bed_wide.bed
fi

#Extract negative bed and the fasta
python extractnegative.py $bed_wide.bed $bed_wide.negbed 500

fastaFromBed -tab -fi $hg -bed $bed_wide.bed -fo $bed_wide.fas
fastaFromBed -tab -fi $hg -bed $bed_wide.negbed -fo $bed_wide.negfas

cut -f 4 $bed_in >/tmp/f1
cut -f 1 $bed_wide.fas >/tmp/f2
cut -f 2 $bed_wide.fas >/tmp/f3
paste /tmp/f2 /tmp/f1 /tmp/f3  >$bed_wide.fas

cut -f 4 $bed_in >/tmp/f1
cut -f 1 $bed_wide.negfas >/tmp/f2
cut -f 2 $bed_wide.negfas >/tmp/f3
paste /tmp/f2 /tmp/f1 /tmp/f3  >$bed_wide.negfas

python removemasked.py $bed_wide.fas $bed_wide.fa
python removemasked.py $bed_wide.negfas $bed_wide.negfa

# use the length of the available sequences to determine the size of test and negative sequences

lenfa=$(wc -l $bed_wide.fa | cut -f1 -d " ")
len_negfa=$(wc -l $bed_wide.negfa | cut -f1 -d " ")
lenuse=$(($lenbed/20))

if [ $lenuse -gt 500 ];
    then
        lenuse=$lenuse
 else
        lenuse=500
 fi


if [ "$lenfa" -gt $lenuse ] &&  [ "$len_negfa" -gt $lenuse ];
    then
        cutoff=$lenuse
elif [ "$lenfa" -lt "$len_negfa" ] ;
    then
        cutoff=$lenfa
 else
        cutoff=$len_negfa
fi

head -$cutoff $bed_wide.fa >$bed_wide.posneg
head -$cutoff $bed_wide.negfa >>$bed_wide.posneg

#clean up the temporary files
#rm $bed_wide.neg*
#rm $bed_wide.fa*
rm /tmp/f*

}
bed_in=$1
bed_wide=$2
hg=$3
len=${4:-100} #Use a default f 100 if not provided
get_fasta`

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.