Coder Social home page Coder Social logo

mm9 vs. mm10 about cistopic HOT 5 CLOSED

JBreunig avatar JBreunig commented on July 19, 2024
mm9 vs. mm10

from cistopic.

Comments (5)

cbravo93 avatar cbravo93 commented on July 19, 2024

Hi @JBreunig !

Regards rGREAT, I am only able to reproduce this error when my regions are in mm10 but I select mm9 as genome (or vice versa). Which one is the genome assembly of your data, mm9 or mm10? If it is mm10, this code should work:

cisTopicObject <- GREAT(cisTopicObject, genome='mm10', fold_enrichment=2, geneHits=1, sign=0.05, request_interval=10)
pathToFeather <- '/media/data/lcb/icistarget/data/rankings/feather/mm9-DHS_enh_prom_pennstate-9species.all_regions.mc9nr.feather' 
library(R.utils)
url <- "http://hgdownload.soe.ucsc.edu/goldenPath/mm10/liftOver/mm10ToMm9.over.chain.gz"
mm10Tomm9.chain <- "mm10Tomm9.over.chain"
download.file(url, destfile = paste0(mm10Tomm9.chain, ".gz"))
gunzip(paste0(mm10Tomm9.chain, ".gz"))
mm10Tomm9.chain  <- import.chain(mm10Tomm9.chain)

# Obtain liftOver dictionary (as list)
mm10_coord <- cisTopicObject@region.ranges
mm10_to_mm9_list <- liftOver(mm10_coord, mm10Tomm9.chain)

cisTopicObject <- binarizedcisTopicsToCtx(cisTopicObject, liftOver=mm10_to_mm9_list, genome='mm9')
cisTopicObject <- scoredRegionsToCtx(cisTopicObject, liftOver=mm10_to_mm9_list, genome='mm9')
cisTopicObject <- topicsRcisTarget(cisTopicObject, genome='mm9', pathToFeather, reduced_database=FALSE, nesThreshold=3, rocthr=0.005, maxRank=20000, nCores=4)
cisTopicObject<- getCistromes(cisTopicObject, annotation = 'Both', nCores=5)

Let me know if this is not the case!

C

from cistopic.

JBreunig avatar JBreunig commented on July 19, 2024

This is almost certainly my issue as we used mm10. Thanks for the code and help solving it.

from cistopic.

JBreunig avatar JBreunig commented on July 19, 2024

Hi Carmen,
Your liftover code worked fine. However, I'm running out of memory at the topicsRcisTarget line:

cisTopicObject <- topicsRcisTarget(cisTopicObject, genome='mm9', pathToFeather, reduced_database=FALSE, nesThreshold=3, rocthr=0.005, maxRank=20000, nCores=4)

(Running Ubuntu 18.04 on a dual Xeon workstation with 128Gb of ram and a 32 gb swapdisk.) Memory use starts at 8GB of RAM after running the command but slowly grows until it uses both RAM and swapdisk and then creates an error. This is a dataset of 2,695 mouse tumor cells.

Initial settings:

cisTopicObject <- runModels(cisTopicObject, topic=c(20), seed=987, nCores=14, burnin = 200, iterations = 250, returnType = "selectedModel", addModels=FALSE)

cisTopicObject <- selectModel(cisTopicObject, select=20)

I should add that I've had issues with corrupted feather databases with SCENIC but it doesn't look like I can checksum the mouse feather file mm9-regions-9species.all_regions.mc9nr.feather with the sha256sum.txt file. (It takes 5 days to download each feather file.)

Any suggestions? Thanks in advance!
Josh

from cistopic.

cbravo93 avatar cbravo93 commented on July 19, 2024

Hi @JBreunig !

Can you try reducing the number of cores?

Cheers!

C

from cistopic.

JBreunig avatar JBreunig commented on July 19, 2024

Reducing the cores worked...thanks!
Best,
J

from cistopic.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.