Coder Social home page Coder Social logo

Comments (15)

childers avatar childers commented on July 21, 2024

Oops, left out the insertion:

Scaffold1   WebApollo   insertion   875615  875615  .   +   .   Name=024CEEAB2AE0C4F9AE1490815C67FDC3;

##FASTA
>024CEEAB2AE0C4F9AE1490815C67FDC3
AAAAAAAAAAAAAAAAACCCCCCCCC

from apollo.

monicacecilia avatar monicacecilia commented on July 21, 2024

@nathandunn Expected output should be "same as above", not a non-coding feature (tRNA in this case) with CDSs.

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

Interesting, when you delete the insertion, everything else stays.

Also, if the insertion precedes the annotation you don't get the extra CDS calculations.

from apollo.

monicacecilia avatar monicacecilia commented on July 21, 2024

just fyi -- and I haven't looked to see if it applies here -- current, sequence alterations cannot overlap each other.

On Nov 26, 2014, at 1:50 PM, Nathan Dunn [email protected] wrote:

Interesting, when you delete the insertion, everything else stays.

Also, if the insertion precedes the annotation you don't get the extra CDS calculations.


Reply to this email directly or view it on GitHub.

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

@monicacecilia Do we only want to calculate for "mRNA" transcripts, or other ones, as well?

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

@monicacecilia It is calculating the CDS when setting the longest ORF. I am telling it to do this only if it is a transcript of type mRNA. Not sure if snRNA, miRNA, etc. should also be included. . . for calculating or if we should exclude tRNA, ncRNA, etc.

from apollo.

selewis avatar selewis commented on July 21, 2024

yes, also exclude tRNA (any kind of ncRNA in other words). Only protein
coding should have the protein coding calculated.

p.s. it would be difficult to allow genomic sequence alterations to overlap
because then we'd have to know the order with which to apply the
alterations. However when/if we reuse this code for indicating natural
variation/alterations we'll need to allow overlaps coming from different
individuals, even if the alterations for each individual are
non-overlapping... (need to draw a picture)

-S

On Wed, Nov 26, 2014 at 2:08 PM, Nathan Dunn [email protected]
wrote:

@monicacecilia https://github.com/monicacecilia It is calculating the
CDS when setting the longest ORF. I am telling it to do this only if it is
a transcript of type mRNA. Not sure if snRNA, miRNA, etc. should also be
included. . . for calculating or if we should exclude tRNA, ncRNA, etc.


Reply to this email directly or view it on GitHub
#30 (comment).

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

@cmdcolin @childers Sorry, I should have fixed this during the break as it was failing the regression tests. The original code was correct, we want to calculate the CDS if a coding transcript. The original fix should do this. I just need to create a test that handles both cases. Unfortunately the default is "transcript", which should not be encoding.

from apollo.

selewis avatar selewis commented on July 21, 2024

to be finicky

"transcript" is not necessarily encoding.

if it is a transcript of type mRNA (a subclass of transcript in SO) then it
is an encoding transcript.

the transcripts of protein coding genes should properly be typed using the
sub-class mRNA

On Mon, Dec 1, 2014 at 8:50 AM, Nathan Dunn [email protected]
wrote:

@cmdcolin https://github.com/cmdcolin @childers
https://github.com/childers Sorry, I should have fixed this during the
break as it was failing the regression tests. The original code was
correct, we want to calculate the CDS if a coding transcript. The original
fix should do this. I just need to create a test that handles both
cases. Unfortunately the default is "transcript", which should not be
encoding.


Reply to this email directly or view it on GitHub
#30 (comment).

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

You are exactly correct. “Transcript” is actually used with pseudogenes and MRNA (at least in our current system) has no sub-classess.

I’m pretty sure this implementation is correct, but I will need others to test. Once I get the rest of the 1.0.3 bugs sorted out, I’ll put it up for testing.

Nathan

On Dec 1, 2014, at 11:02 AM, selewis [email protected] wrote:

to be finicky

"transcript" is not necessarily encoding.

if it is a transcript of type mRNA (a subclass of transcript in SO) then it
is an encoding transcript.

the transcripts of protein coding genes should properly be typed using the
sub-class mRNA

On Mon, Dec 1, 2014 at 8:50 AM, Nathan Dunn [email protected]
wrote:

@cmdcolin https://github.com/cmdcolin @childers
https://github.com/childers Sorry, I should have fixed this during the
break as it was failing the regression tests. The original code was
correct, we want to calculate the CDS if a coding transcript. The original
fix should do this. I just need to create a test that handles both
cases. Unfortunately the default is "transcript", which should not be
encoding.


Reply to this email directly or view it on GitHub
#30 (comment).


Reply to this email directly or view it on GitHub #30 (comment).

from apollo.

monicacecilia avatar monicacecilia commented on July 21, 2024

works now. Thanks!

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

@monicacecilia I want to retest this in 2.0.0 I looked and it appears that we have the correct code in there, but I want some separate eyes to verify.

from apollo.

monicacecilia avatar monicacecilia commented on July 21, 2024

@nathandunn You are correct, genomic insertions are no longer causing the generation of a CDS in tRNA features. However, while investigating this issue, I found a few other things. Please see #263

from apollo.

nathandunn avatar nathandunn commented on July 21, 2024

@deepakunni3 Could this be relevant?

from apollo.

deepakunni3 avatar deepakunni3 commented on July 21, 2024

@nathandunn This bug doesn't occur in Apollo 2.0

The rationale holds for recalculating CDS only when the transcript is an instance of mRNA.

from apollo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.