Coder Social home page Coder Social logo

Comments (4)

oschwengers avatar oschwengers commented on August 18, 2024

Hi Ahmed,
Of course, short/long read sequencing indeed can have a strong impact on assembly lengths. Especially short read draft assemblies can suffer from too many contig edges which make it very difficult to detect all CDS proximate or even spanning contig edges. The coding density is only summing up all bases that are part of an annotated genome feature divided by the genome length.
I hope this helps. Otherwise, can you elaborate a bit more?

from bakta.

AhmedElsherbini avatar AhmedElsherbini commented on August 18, 2024

Thank you Oliver for your response.

Summing up all the bases of the annotated genome is (~the sum of the whole length of CDS) / the sum of the total contigs length, right?
if we focus on long read assemble genomes and NO statistical difference between the two species's CDS number, nor total length,
so my guess now for the difference in coding density, could be longer CDS in the more species with high density, or pseudogenes, / transposons (they are short in lenght anyway) in the less dense species. does it make sense now?

Best,
Ahmed

from bakta.

oschwengers avatar oschwengers commented on August 18, 2024

Hi, the coding density is not limited to CDS but comprises all genomic features, e.g. non-coding RNA genes, regulatory elements, DNA motifs, etc.

Regarding your question, I guess in theory yes, that could be, but I'd be rather reluctant to use these kind of statistics. I'd rather directly compare certain genes presence/absence, etc.

from bakta.

AhmedElsherbini avatar AhmedElsherbini commented on August 18, 2024

Absolutely, you are right.

I followed this with tools like Panaroo and PPanGGolin for gene presence/absence comparison. Mobilome ( insertion elements ) is the main thing being enriched in the sister species with lower coding density.

Just, I wanted to investigate the causality of this coding density, as I get a lot of questions regarding this 2 % difference.

from bakta.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.