Coder Social home page Coder Social logo

Make conda environment.yml about chipseq HOT 12 CLOSED

nf-core avatar nf-core commented on May 31, 2024
Make conda environment.yml

from chipseq.

Comments (12)

ewels avatar ewels commented on May 31, 2024 2

Super cool! Yes, phantompeakqualtools is part of spp, so just installing that should do the job. Great work @apeltzer - thanks! That's quite a relief that we don't need to package stuff ourselves.

There are a few other issues that need testing / thinking about now... I've updated the original comment with some checkboxes.

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024 1

Just checked and everything should work now (at least from what I see in main.nf @ewels I will send in a PR of bioconda branch to master and we'll see what the tests say?

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

I figured out 99% of all packages (I hope at least) in bioconda or conda-forge (see the bioconda branch which I committed to).

The missing one is "phantompeakqualtools", but if I'm not mistaken (please check this out @ewels ), this is also now packaged in a CRAN R package as "spp", therefore available in bioconda:

https://bioconda.github.io/recipes/r-spp/README.html

If this is the case, we just need to add:

  - bioconda::r-spp=1.15.2

and could be done with the new environment.yaml.

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

See also here for the last missing package : http://compbio.med.harvard.edu/Supplements/ChIP-seq/

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

Only very small tools missing:

  • run_spp.R
  • ngs.plot.R

These are highlighted in the bioconda branch. The docker image builds fine, but I don't know where to get the run_spp.R script from, as the bioconda package doesn't install that unfortunately.

/opt/conda/envs/nfcore-chipseq-1.4dev/lib/R/library/spp/

from chipseq.

ewels avatar ewels commented on May 31, 2024

run_spp.R is from phantompeakqualtools. We were downloading this from https://code.google.com/archive/p/phantompeakqualtools/ but it looks like the latest version of the code is now at https://github.com/kundajelab/phantompeakqualtools

So me saying that "phantompeakqualtools is part of spp" was probably wrong sorry.

ngs.plot.R is found at https://github.com/shenlab-sinai/ngsplot

from chipseq.

ewels avatar ewels commented on May 31, 2024

Installation script for these tools by @tiagochst in #10:

ENV NGSPLOT_VERSION="2.63"
RUN curl -fsSL https://github.com/shenlab-sinai/ngsplot/archive/${NGSPLOT_VERSION}.tar.gz -o /opt/ngsplot_${NGSPLOT_VERSION}.tar.gz && \
    tar xvzf /opt/ngsplot_${NGSPLOT_VERSION}.tar.gz -C /opt/ && \
    rm /opt/ngsplot_${NGSPLOT_VERSION}.tar.gz
ENV PATH=${PATH}:/opt/ngsplot-${NGSPLOT_VERSION}/bin
ENV NGSPLOT=/opt/ngsplot-${NGSPLOT_VERSION}/

RUN wget "https://drive.google.com/uc?export=download&id=0B5hDZ2BucCI6SURYWW5XdUxnbW8" -O ngsplotdb_hg19_75_3.00.tar.gz && \
    echo y | ngsplotdb.py install ngsplotdb_hg19_75_3.00.tar.gz && \
    rm -rf  ngsplotdb_hg19_75_3.00.tar.gz && \
    wget "https://drive.google.com/uc?export=download&id=0B5hDZ2BucCI6S3E4dVprdlF2YW8" -O ngsplotdb_hg38_76_3.00.tar.gz && \
    echo y | ngsplotdb.py install ngsplotdb_hg38_76_3.00.tar.gz && \
    rm -rf  ngsplotdb_hg38_76_3.00.tar.gz && \
    wget "https://drive.google.com/uc?export=download&id=0B5hDZ2BucCI6NXNzNjZveXdadU0" -O ngsplotdb_mm10_75_3.00.tar.gz && \
    echo y | ngsplotdb.py install ngsplotdb_mm10_75_3.00.tar.gz && \
    rm -rf  ngsplotdb_mm10_75_3.00.tar.gz 

RUN git clone https://github.com/kundajelab/phantompeakqualtools  && \
    mv phantompeakqualtools /opt/  && \
    echo 'library(caTools)' | cat - /opt/phantompeakqualtools/run_spp.R > temp && mv temp /opt/phantompeakqualtools/run_spp.R && \
    chmod 755 /opt/phantompeakqualtools/* && \
    echo 'alias run_spp.R="Rscript /opt/phantompeakqualtools/run_spp.R"' >> ~/.bashrc 
ENV PATH=${PATH}:/opt/phantompeakqualtools 

Need to think hard about how to handle the reference genomes for ngsplot.

from chipseq.

ewels avatar ewels commented on May 31, 2024

The three ngsplot references are actually not super huge:

35M ngsplotdb_hg19_75_3.00.tar.gz
34M ngsplotdb_hg38_76_3.00.tar.gz
21M ngsplotdb_mm10_75_3.00.tar.gz

That's 91M in total, a lot less than I was expecting. This is small enough that it may be possible to add these to the bioconda recipe (in discussion with that community). Alternatively, we could consider adding these to the pipeline repo. Finally, it would be very easy to add these to the AWS-iGenomes reference.

from chipseq.

ewels avatar ewels commented on May 31, 2024

xref #10 (comment):

We just need to keep in mind that the run_spp.R has a line missing (library(caTools)) to make it work.

This is something that we can add in using a patch in the bioconda release.

@apeltzer - is this packaging something that you're keen / able to work on?

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

Hi everyone! I just asked in the bioconda gitter whether they feel its alright and will do it once confirmed that its okay to package something like this!

Absolutely happy to create a bioconda package for that specific purpose - shouldn't be too difficult to do that!

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

So all conda packages are there and I added run_spp.R to the bin folder of the bioconda branch.
Will adjust calls to tools now!

from chipseq.

ewels avatar ewels commented on May 31, 2024

Sounds good 👍

from chipseq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.