Coder Social home page Coder Social logo

A 1 mb bubble - fact or fiction? about minigraph HOT 4 CLOSED

lh3 avatar lh3 commented on August 18, 2024
A 1 mb bubble - fact or fiction?

from minigraph.

Comments (4)

ASLeonard avatar ASLeonard commented on August 18, 2024

Following up on this, I found an example where there is a bubble where both paths are approximately 1.2 Mb (bottom left), as well as a bubble where both paths are 13.5 Mb (right). This long region appears both in the --call bed file as well as the gaf file. The bottom arm in the huge bubble is from a single assembly (2nd used in construction), while the top arm is from two assemblies (reference and 3rd assembly).

image

from minigraph.

lh3 avatar lh3 commented on August 18, 2024

Missed the original question. No, this is not an artifact. The alignment allows up to 100kb indels, but downstream steps may join long poorly aligned regions.

from minigraph.

ASLeonard avatar ASLeonard commented on August 18, 2024

As a follow up , after swapping the order of the samples used in construction (ref sample_3 sample_2), the 13.5 Mb bubble disappears. As you suggested, this looks like a series of poorly aligned regions constantly jumping locations (within +/- 5mb) and orientations for sample_2, hence possibly why it was a huge bubble when mapped 2nd, but aligns in smaller chunks when mapped 3rd.

I tried increasing the parameters set by -xasm (-g1m -r1m --max-gap-pre=50k --min-cov-blen=1m --min-cov-mapq=20 --max-lc-skip=100 --max-gc-skip=100) to see if it would overcome the small, poor alignments. There were slightly fewer nodes/edges/links in the graph, but the 13 Mb bubble was unchanged.

Are there any parameters or heuristics to adjust to avoid

downstream steps may join long poorly aligned regions

and instead recover the long stretches of consensus + many smaller bubbles seen when swapping the sample order.

from minigraph.

lh3 avatar lh3 commented on August 18, 2024

You can apply --gg-match-pen=5 or smaller (default value: 10) to reduce large bubbles. However, most of these small bubbles will be sensitive to alignment parameters and input ordering and are probably not biological.

from minigraph.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.