Comments (4)
What error message did you receive exactly? The -protein
option (as well as the -dna
option) forces the encoder to use a specific alphabet definition, while otherwise the alphabet is determined from the characters encountered in the input sequence. Check core/alphabet.c
for the exact DNA and protein alphabet definitions, they are hardcoded there and used when you specify -protein
or -dna
. If there are no lower case characters in these definitions then they are counted as invalid. If that is not desired then either a different alphabet definition must be used or the hard-coded alphabet must be extended (maybe the better option).
Regarding the wrong line number: I assume that is an simple off-by-one error due to 0- vs. 1-based indexing. I am sure I can fix this easily if you tell me which error you got.
from genometools.
The error message is:
"bin/gt encseq encode: error: illegal character 's': file "encseq_error_minex.fasta", line 3"
There are indeed only upper case letters hardcoded in alphabet.c.
from genometools.
I see. It may be worth the effort of extending it by lowercase characters then (I assume that's the way it is already done for DNA).
from genometools.
These alphabet definitions are pretty old, I think @stefan-kurtz defined them way before GenomeTools was started. I vaguely remember that there was a reason why protein characters have to be upper case, but I can't recall it. Maybe Stefan knows more.
from genometools.
Related Issues (20)
- Running LTRharvest/digest with parallel HOT 5
- gt gff3 -sortlines fails on particular gff HOT 11
- Difficulty extracting intronic regions HOT 2
- There is no -retainids when using ltrdigest, Why don't add one? HOT 2
- We need to run tirvish two times for a single genome? HOT 1
- Install error: cairo.h: No such file or directory on RedHat 7 HOT 2
- How to test whether the GenomeTools library is installed properly HOT 1
- After installation, how to use and test whether the function is normal? HOT 4
- bed_to_gff3 bug HOT 9
- error: ignoring return value of 'fwrite', declared with attribute warn_unused_result [-Werror=unused-result] HOT 3
- LtrPipeline: GenomeTools failed to run ltrharvest. Error code: 139 HOT 4
- gff3validator "Sequence Ontology" out of date? HOT 3
- Assertion error: gt_feature_node_remove_leaf HOT 2
- gt gff3 loss of intergenic regions HOT 1
- sketch all non-overlapping genes on one track HOT 4
- -Werror is for CI + developer only HOT 4
- Fails to build on macOS Ventura / Xcode 14.x HOT 3
- New build warnings on GCC 13 turned errors are blocking builds HOT 1
- Build time issue with GenomeTools 1.5.9 HOT 5
- Aborted (core dumped) with LTR harvest HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from genometools.