Coder Social home page Coder Social logo

Comments (4)

satta avatar satta commented on May 30, 2024

What error message did you receive exactly? The -protein option (as well as the -dna option) forces the encoder to use a specific alphabet definition, while otherwise the alphabet is determined from the characters encountered in the input sequence. Check core/alphabet.c for the exact DNA and protein alphabet definitions, they are hardcoded there and used when you specify -protein or -dna. If there are no lower case characters in these definitions then they are counted as invalid. If that is not desired then either a different alphabet definition must be used or the hard-coded alphabet must be extended (maybe the better option).
Regarding the wrong line number: I assume that is an simple off-by-one error due to 0- vs. 1-based indexing. I am sure I can fix this easily if you tell me which error you got.

from genometools.

kowsky avatar kowsky commented on May 30, 2024

The error message is:
"bin/gt encseq encode: error: illegal character 's': file "encseq_error_minex.fasta", line 3"

There are indeed only upper case letters hardcoded in alphabet.c.

from genometools.

satta avatar satta commented on May 30, 2024

I see. It may be worth the effort of extending it by lowercase characters then (I assume that's the way it is already done for DNA).

from genometools.

gordon avatar gordon commented on May 30, 2024

These alphabet definitions are pretty old, I think @stefan-kurtz defined them way before GenomeTools was started. I vaguely remember that there was a reason why protein characters have to be upper case, but I can't recall it. Maybe Stefan knows more.

from genometools.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.