Coder Social home page Coder Social logo

mitefinder's People

Contributors

jhu99 avatar nwpuzhengyan avatar wangjingru avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

mitefinder's Issues

Errorneous MITEFinderII output

I am trying to identify MITE in a published genome sequence using MITEFinder II in Ubuntu14.04 LTS OP system. The following script was used:

./miteFinder -input my/fast/file.fa -output my/mitefinder/out -pattern_scoring ./profile/pattern_scoring.txt -threshold 0.5

The program ran smoothly to produce a fasta seq output of probable MITEs. However, the file consisted of some invalid characters.

image

Please advise to sort the above problem.

Moreover, it will be good if you can throw some light on (i) explaining the header line of the output fasta sequences; (ii) Possible methods to identify TSD and TIR sequences; (iii) How to classify MITEs based on family; and (iv) tool for design primers from MITE-flanking sequence.

Thanks
Dip

How to keep the actual chromosome name in the output?

Hi,
Thanks for this good tool.
I was wondering, how do you keep the original chromosome name in output fasta header?
miteFInder renames chromosomes to 1,2,3... But it is confusing when we want to do some downstream analysis
do you have any suggestions?
Thanks

for example
`>mite|1|17871177|17871187|17871261|17871271|t4|17871184|m1|ave_score:0.849149
ATTGTTGTTTGGATTGTTGTTTAAGTTTTTTTTTGGAATTATTTTGTGTTTTTGTTAAAA
TTCAAAACGTTTTCCTGTTTTTAAAACGTTTTTTGATTTTTAAAAACGTTTTTCAAGTTT
CCAATAATAAACTATGTATAATAAAACGTTTTTAAAAAATTTTAAAAATATTTAACAACA
TTTTTCTATAATTATAAAAATTTATAATTTTGTAT

mite|1|19986949|19986959|19987279|19987289|t3|19986953|m1|ave_score:0.707483
ACAAAAAAGTGAAACATAAAATATTTCTAAAAGACTAATTTCTTTTTGATGATATTGATA
CTCATGAATATATTTGATATAAATAATAAAGAAGTGTACGAAAAAAACATAAAAAAAAAA`

expected fasta header

mite|Bna.chrA1|17871177|17871187|17871261|17871271|t4|17871184|m1|ave_score:0.849149
mite|Bna.chrA2|17871177|17871187|17871261|17871271|t4|17871184|m1|ave_score:0.849149

Issues in program

Dear Jialu Hu,

Thank you for your great software!
In your paper you mentioned that you clustered the mite candidates into families. However, I did not find the
code for clustering in your program. Could you give me more details about the clustering procedure?

Best regards,
Max

No header in output

Hello,

I am processing some genomes with mitefinder and here is my commands:
miteFinder_linux_x64 -input genome.fa -output MITE.fa -pattern_scoring ~/Softwares/miteFinder/profile/pattern_scoring.txt -threshold 0.5

In the output of some genomes, there are sequences looks like this:

>mite|17671|2089|2098|2192|2201|t6|2093|m1|ave_score:0.621227
CACACAATAATGTTTAACACACAATAGTGTTTAACACACAATAATGTTTAACACACAATA
GTGTTTAACAAACAATAATGTTTAACACACAATAATGTTTAACACACAATAATGTTTAAC
ACACAATAATGTTTAACACACAATAATGTTTAACACACAATAATGTTTAACACACAATA
CGCGCGAGTAGGTCACCAAGGGGATCTTGTTCCTGCACGACAATG
CCCCGGTTCATAGGGCACTTACAACCCAGAAGAAACTGGCCTACCTGGGCTTCCAGTGTC
TTGATCACCCACCCTATTCTCCGGATCTGGCCCCGTCGGACTATCCCCTGTTCCCTGGAT
TGAAAAACAATTCACAGTTCGCCATTTTTGTCTGATACGCATGCCATTGCTGCGGAGACC
CAGTCGGACGGAAAACTTTCTGATTTTTTTGAGTGACCTGCAAAAGTTAGTGTTTTGAGC
TTCGTGAGAAGTAAGTTGAATAAACCCCGAGTTTGGTCGCTGTAGCTTGTTACCTTCCTG
GTCAGGCCCTATCAGCACCCCCTTGTACACAACAGAACAAGACAGGCAATGTACTTATAA
CATAACAATGAGATGCGTTAGCGTAATCACTGTTGTCATGGAAAGCTATTAAGTATTAAA
TCTTCAGAGTGTGTGCTTTATTTTTAATGTATGAAAAGCGCATGCGCCGAATTATGTTTC
TACCTCTTGCCTATCTGGCTCTGCCACATTTTCCACATTATCTCATAAACGGCACGGTTT
TCCGAAGAGGGGGATATTGAAAATAAAATGCGTTTTTTATTTTCTCTAAAAATATGCTCA
TTCTAAGAGAAATTCAGCGAGATATTATAAAAAATGTACGCAGGTATTCATATACATTAC
GCGTTATTCCTTCAAACTTTAACAAAACTTGAATTCTCTCGATAGATTTTAATGAAACTT
AAATTTTATCGATAGATTTTAATGGAACATGGATTTCCTCGATAGATTTTAATGAAACTT
AAACTTTGTCGATAGATTTTAATGAAACTTTAATTTTATCAATAGATTTTAACGAAACTT
GGATTTTATCAATATATTTCAATGAAACTTAAATTTTATCGATGAAACCTGAATTTTTTC
GAGAGATTTTAATGAACCTTAAATTCTATTCATATATTTTAATGTAACTTGGATTTTATT
GATAGATTCTCGAAAAATATTCCCATACGAAATTGCATAAAAGTCGGTGCAGTGGTAGCC
GATTTGTTCCATGTGTACAGACGGACGGATATGACGAAGCTAATAGTAGTTTTCGTAATA
TTCCCAACGCGCCTGAAACCTGGTTTACGTTTCTTTAAGAAGTGCGCATATGACCGCTCT
TTCTCGTACAGTCTGTCCTGCTGTCATTGTTACATTTTAACCAGAACTGGAACATATATA
GACGCATTTCATTAAACCTCTCTCCCCATGACGATCAGTAAGCCAATCTCGATTCACGTC
ACGCACACACAGGGTACCGGACAAATTTAAAGGATATCTGAAAGCCTGCCTCATCGTGTC
TTGTGTTTCGTTTCTTAATAGCAAATTTTACAGAAAATAATATAATAGTAATATGGGTTT
ATTATTGCTCATTATGAACGTTCCGCTCCACTTAATAACACGAACTTTTATGTATTTACC
GATTATTTCAATACTGCAAGGTTTATAAAGTGTTTGGCAGAGTTTCAAAGAGATCAGAAC
TGGAAAGGGCTCATAAGACATCCAAGAAATATTGATTAAAAACCCTGACATCATCCGGAA
GAAAGCACCACGTAATTTCAGAAAATAACAACTTTCAACAATATAAACAATGACCTAAGA
AGCATATAATTCATCCTATCTGCATTTAAGGACTGAGTTCCTCATACTTCTCAGGCAACA
CACTGCGTCTCCATACCACTTCATAAATTTAAGATGAGAAATTCCCGCTGGTGTAAACAA
ATCACAAGTAAATACGTTAAATCAGTCACACACAACACCGGTTTTGTAATCAAATGCCTT
TAATAAAGTTGATAAAAACTAGTATTGTGTGCTACTGATTTAATACACACATTTACAAGC
ACATAATGCTGTTTAGGGTAACAATCGCTTTTTATGTGACCAACATAAGTTACGCACTAA
CACTAAGTGTTGCATAATGCTGAGTTTCCTGCTGCTGCAGTGTGTGGTACCAACCAATGT
GCTATCCAGGTTCAAAATAAACGTTTGACATGTTTTATAAGACATTTTCATCTTCATTCC
TCCCATATTTTTTGAAAGAAATTCTACACTTTTTATTTATGGTCGCCTATCTCTAAAAAA
AAGTATGTAACTTTAGTGACGCTGTTTGACGTAGACCCATGAGTGTGGTTCCCTGTCACC
ACCGCTGCAGGAGCTAAATAATATCTATAATACTTGCTACGTA
CACTGCTTCATAGCACGCTGCAGGAGCTAAATAATTTCTATAATACTTGCTCCGTACACT
GCTTCACGGCACGCTGCTGGAGCTAAATAATTTCTATAACACTTGCTCCGTACACTGCTT
CACAGCACGCTGCAGGAGCTAAATAATTTCTATAATACTTGCTCAGTACACTGCTTCAGA
GCACGGTGCAAAAGCTAAATCATTTCTACCCGGTACTTATCTGTACTCAGTTCATAGCCG
CACAAATTCACAATTTACAGAACTTGTCATATTTATTTTATTTTTTTTCAATCTCTTTGC
GCAGCACTTGGAGATGCTTCGTGGAGCACTTGGTGCTCCTAGAACACAGATTTGAGAATA
TCTGCACTAGACCACCGAAGCCGCTAGGGGACGTATGGTACACAGCATTCTACGTAATAC
ACAGTGAGTGTCCAAAACACAATATCACACTCAACCCAGTGGTGATGAAAAGCACTATTG
TGATTTTATTCAGGTTAAAAATAAATACAATACAATACAATACAATACAATACAGTACAA
TACACTGCAATACACTGTGATAAAATACAATACAATACAATACAATACAACGCAATACAA
TACAATACAATACAATACAAGATGATACAATGCAGCGTAATACAACGCAATACAATACAA
TGCAAGACAATACAATACAATACAATATATTGCAGCGCAATACAATGCAATGCAATACAA
TACAATAAAATTTAATACAATACAATGCAGGGCAATGTAATACAATACAATGTAAAAATA
CAATACAATACAATGCAATACAATGCAATACAATAAAATTCAATACAATAAATTACAAGG
CAATACAATACAACACAATACACACCGAACACCTATACTGGATATGGCTTGATACATGAA
AATCCTGCCCTTTGTTGCCTGTTATCTGATTTATTCTTCACCATCCTTGTAAAGCCATAT
TGGATGCCATAGTGCGCTAATATTTCATATGTTACAGATTGTATTGAATTGTTAGTTAAC
CGTTAGCCTACCGAATTAACCGTGTAACGCCGAACCACTTATAAAGACGTAGTGCAGTCA
GACCTGTAACAATTAAAATTCCCAGTAAAAATATGCGTGAAAAGCCAACAAATACACCAA
ATATTGATTCAGTTTATTAATTATGTATGGTATCTCCTACATATTTCGGCCGTATATTTC
CATCTTCAGTGCCATCTGAGAGATGCTCAATTGAGGAGCAGTCGATAGAATATTGTGGTT
GGGCCTGTTGTGTCTAGTGACGTGGTGCGCCGAGCGTACCACGTCACTAGACAAAATGGC
ATTGTAATGCTGAAACATGTTGGAACTGCCATAAGTAATTACTAAAGTGAATGAATAATT
GGTGTATTTGTTGGTTTTCATGCATCCCTTTAAACGTTGAATAAGATCCCATTCGCCATC
CATTACTAGGAGGTATTACTATAATGGTCGTTAGCAGGTTAAGGGTTAATTTAGTCCTTG
GGTTGCATACCTGCATTGACTATGTTAGATAACATCCATCCTTCTGTGTTATGAATATGA
CATACATAAGTAATTACACAGTGTTCAGTCTACCAAACTCAGTTTTTTTATTATTGACAA
CTATTACAGACACAACATGTTTCGGTGTTTTCAAACCACCATCTTCAGGCCTTCTATTGG
TCGAATATTTTACTAGAATGCTACATACAGTAGCCACATATTTCGGAGTTTTCAAACCAC
CTTCTTCAAGACTTTTATTGGACGAATATTTTACTATAATCCTACCTACAGGGTCCACAT
ATTTCGGAGTTTTAAACCGCCATTTTCAGGACTTCTATTAGACGAATATTGTACTAGAAT
GCTACCTACAGAGGCCACATATTTCGGAGTTTTCAAACCACCATCATCAGGACTTCTATT
GGACGAATATTTCATTAGAATCCTACCTACAGGGGCCATATATTTCGTAGTTTTCAAACC
ACCATCTTCAGGACTCCTATTAGACGAATATTGTACTATAATACTACCTACAGCGTCCAC
ATACTTCGGAGTTTTCAAACCGCCATCTTCAGGACTTCTATTGGATGAGTAATTTACTAG
AATCCTACATACATGGTCCACATATTTCGGAATTTTCTAACCGCCATCTTCAGGGCTTCT
ATATTGTCGGCTCTATAAAGTAAAGATAGAGAGAAAGAAAAGG
AGGGTGGTGCAATACAGTGTTGAAGACCTGTCGTATTGCGTATTTGAAGTACTTGCCTGC
TCGGGCCTACGCTATCTGCTGCGGGTAGGAGTGCTGACGTAAGAGAATACACGGTTTTCG
AGCGAAGGAATTCCACTGCTCTCGTTCAAAAAACATCTACTTTATTTAGCTACTCCTACA
CTAACCTATATAATGCTAGGCGACTCCCGTAGAACAAGTTAGGGGTTGCGTCGCTGTATT
ACGCTGGCTTATCAATTTTAACGCACTTTGCTTTAGTTGGCACGTAACTTTCACACGACT
ATTCACTGAGTAAAGATTTTCCACTCACCTGTGATTAAGTAATTAATCCTCATTGCGTGT
GTTAATCCAGTGTGGTTCACTTATAAATGGTTACACTTTTCTATTCACTGCGGGTTAATT
TGGCGTCTATCTAGCACACGTGTTGTACTTTAGCGTATTTAATTAATAATTCCAATTCAA
TCTTTAATTGTTAATCTCGATGTATCAAAACGTTCAACAATAAATTATTAACTGAATTAC
TTAGTTATAAACTTTCAATTTCGTAGTTATAAGACCTAATAAATTGTCGTCCTTAGAGAA
GGTACGATTATTCAGAACATGTTAACGCGTTGAAGACTTGTGGCGGACTCTCTCAAACCG
ACGCTCCTAACATCCGCTCAGCCCATGCTGTCCACCAAGGCAAGCGTCTAATTCAGCATA
ATTCTTGCAAAATCGTGTTTTCCACGGCACGAACAACAACTGGAAACAAGCTCTGGGAAG
TAGCAACTCGGCAATCCCAACTGCTCTACAGGAAGTCTGACTATAGTTTCAACAAACAAG
TGTCTGAAATCTAACTTCTACTGAAACAATTCTACCTGCAAGAAAATTCAAGTAAAACCA
CAAGGTTTAAACAGCGTTATCTGTTACAATATTAATTTAAACCATTCGGCACCCTCCATC
AAAGACCTCATAAAACTGCAAAAACAGCACTTACCCATACGCTCGGTCATAAACTGGAAG
AAAGCCCCAGCTTACAAGCTGACAAAACCTCTAACACAGAGAATAAAAGAGCTGTCTCTC
CTAACATATGCCTTTAACGTGAAGAATTCCACACAACTGATACACGACGTCAAGAAGACC
CCTTTCCAACCATCACACACTCTGGCCACTCTAGACATATCAAACATATCAAACTCCAAC
ATACCACTGACGGAAACCAGACACATCCTGAACCGGTCTCTAGTGAATAACATGGTAGAA
AACAATATTACAAAAGAACTGCTGGCATGGTACGACACCGTCACCGAGCAGAACTTCACG
TTCAAGGAACACACGGACATCCAAACAGACGGACTAGCGATGGGGGCTCCTACATCCAGC
GTCTTATCAAAATTCTTCCTACAACAGATGGAGCACACACACATCCCAAACTTTGCAAGA
AAACACACTCTAGTGAACTACTTTCGATATGTTGACGACATACTCATAATCTTTGACTCT
AAAGCCACTGACATAAAATCCATCCTGATCGAATTCAACGCCATACACCCAAACCTCAAG
TTCACAGCAGAAGTGGAACACAATAACGCGATCAACATCCTGGACACTACCATACAGAAA
ACAAAAAACAACTTAAAGATATCAATCTACAGGAAATGCACGTTTACTAAACCATCATCC
CGTATACCTCAAATCACCCACCACAACACAAACACGCCGCAGGCAGGTTCCTGTATAAGA
GATTGAACACTTACCAACTACAAACAGAAGAATACAAACAAGAAGAAAACCTTATCCACA
ACATATTTCATAGTACCTCCTTTTCTATTCGACCATGTAAGGACCCCTCCAAACAAATAC
AAAAACAATCAACATCCCAACAAACTCCGATCCAGAAGTGAGCTACCTTTACTTACACTG
GTAGAGAGACTAAATACATCACCAGCCACTTCAAGAACTCTAATATAAGAATTTCTTTCC
AAACAAAAACTCCATACTAAACCACATAACAAACCGCAATCACGGCCACAGAGACCCATA
CACTTCCTCAGGAATACACAAGCTGACAAGCCCTGACTGCGGCAAAGCATACGTAGGCCA
AACTGGCGTAATGTTCTCCATAAGTTTCAAAGAACACAAACAAGCCTTCCGTAACAACAG
CCCAACTTTTTTGGATTTATTTTTGCTCCGTGTATTTCACG
CACCACCGTACTGTTCATTTTACCAGTCATCTCACTACCTGCTTCACAAAGTTATCCTAA
ATACCGTACTTCAACACTGCAGCAGTACAGTACACATTTACACACAAACAGTACACAGAA
CAACACAGTGCAGCAGTACAGTACACATTTCACACACAAACAGTACACAGAACAACACAG
TGCAGCAGTACAGTACACATTTACACACAAACAGTACACAGAACAACACAGTGCAGCAGT
ACAGTACACATTTACACACAAACAGTACACAGAACAACACAGTGCAGCAGTACATTACAC
ATTTACACACAAACAGTACACAGAACAACACAGTGCAGCAGTACATTACACATTTACACA
CAAACAGTACACAGAACAACACAGTGCAGCAGTACAGTACAAATTTACACACAAACAGTA
CACAGAACAACACAGTGCAGCAGTACAGTACACATTTACATACAAACAGTACACAGGACA
ACACAGTGCAGCAGTACAGTACACATTTATACACAAACAGTACACACAACAACACAGTGC
AGCAGTACAGTACACATTTACACACAAACAGTACACAGAACAACACAGTGCAGCAGTACA
GTACACATTTACACACAAACAGTACACAGAACAACACAGTACAGCCAGTACAGTACACAT
TTACACACAAACAGTACACAGAACAACACAGTGCAGCAGTACAGTACACATTTACACACA
AACAGTACACAGAACAACACAGTGCAGCAGTACAGTACACATTTACACACAAACAGTACA
CAGAACAACACAGTGCAGCAGTACAGTACACATTTACACACAAACGGTACACAGAACAAC
ACAGTGCAGCAGTACAGTACACATTTACACACAAACAGTACACAGAACAACACAGTGCAG
CAGTACAGTACACATTTACACACAAACAGTACACAGAACAACACAGTGCAGCAGTACAGT
ACACATTTACACACAAACAGTACACAGAATACAGAGAACGGAACATACATAACAATAAAA
AAATTAAACATACATAACAATAAAAATTGAACATACATAACAATAAAAAATTGAATATAC
ATAACAATAAAAAAATTTAACATACATAATAATAAAAAATTGTACATAACAATAAAAAAT
TGAACATACATAACAATAAAAAATTAAACATACATAACAATAAAAATTAAACATACATAA
CAATAAAAATTGAACATACATAACAATAAAAAATTAAACATACATAACAATAAAAAATTA
AACATACATAACAATAAAAATTGAACATACATAACAATAAAAATTAAACATACATAACAA
TAAAAAGTTATACATACATAACAATAAAAAATTGAACATACATAACAATAAAAATTAAAC
ATACATAACAATAAAAAAAGTAATCAAAATCAAGCCGGAACACGTGACTGTACTGCATTT
GGCAACTCTGCTTGCACAACCACGCCATGACCCACACTACGTTAATATACAGCATGGCGC
ACGAGATGTCATACCATTTTATCATACCATTAAAATTGTAACATCATAGTATCGATGTTG
CAAACGTGCGTGTGAATGTTGAGGTTCATTGAAAATCCGATAGATGTCACCAGTTTGTAA
AATGGCGGACAACGGACGGTTGACTGTAGACCGAAATGCTTTTGTAATAATAAAACTGTG
TAACGTGTCCAATCAATGGTATGACATTTCGTGCGCCAACCTGTCTCATTTTTGCTATAT
ACAGGGTTATTCATAAGTCCTTCCGGGATTTCCGAAATCGACTGCGCAACAACCAAGACA
GACACGGCAGAAAGGAGCATATCAATAGGTAGAGAATCTCTCCAAGTTTTTTTTTGTACT
AGGGGCCTTGACGTACTTGCAGATTCCACCGCTAGGGGGTAGTCGTGACGAAAAATGGCG
TTCACAGTGAATAAGAAAGCGTTCTGTTTCTTGGAATTTGCCAAAACTGAGTCAATTGTG
ACAGTGCAACGGAGGTTTAGGATCATGTACCCCTAGTACAAAAAAACTTGGAGATATTCT
CTGCCTATTGATATGCTCCTTTATATCGTGTCTGTCTTGGTTGTTGCGCAGTCGAGTTCG
GAAATCCCGGAGGGACTTATGTATAACCCTTGTATATACAGGGTGTCAACACGTAACCGA
ACGAAGCACAATTGCAACACAAATACCATATGTTCGTTTTG
GGCTCTGAAAAAGCAAATTCCAACGACACACGCAATACATCGATGGAAAACCACAATGAT
TGAGACAAATACTAAATTTTCGTTTTTGGCCCTGAAAAGCAAATTCCCAACGACACACGC
AATAAATCGATGGAAAACGACAATGGTTGAGACCCAAATACCATATGTTCGTTTTGGGGT
CTGAAAAGCTAATTCCAACGACACTCGCAATACATCGATGGAAAACCACAATGATTGAGA
CATAAATCCTAATGTTCGTTTTGGGATATGAAAAGCAAATTTCCAACGACACAGGCAATA
CATCGATGGAAAACCACATTGGTTGCGACACAAATACCATATGTTCGTTTTGGGCTCTGA
AAAGCAAATTCCCAACGACACACGCAATACATCGATGGAAAACCACAATGATTGAGACAT
AAATACTAAATGTTCGTTTTGGGCTCTGAAAAGCAAATTCCCAACAACACACGCAATAAA
TCGATGGAAAAACACAATGGTTGAGAGACAAATACCATATGTCGTTTTGGACTCTGAAAA
GCAAATTGTCAACGACACACGCAATACATCGATGAAAAACCAGAATGGTTGAGACGAAAA
AACCATACGTTAGTTTTGGGCTCTGAAAAACAAATTCCCAACGACACACAAAATACATCG
ATGGAAAACCACAATGGTTGAGATACAATTACCATATGTCAATTTTGGTCTCTGAAAAGC
AAATTCCCAACGACAGACACAATAAATGGGTGGAAAACCCCAATGGTAGAGACACAAATA
CTATATGTTCGTTTTGGGCTCTGAAAAGCAAATTATCAACGACACACGCATTACATCGAA
AGAAAAGCACTATGGTTGAGACACAAATACCATAAGTTCGTTTTGGGCTCTGAAAAGCAA
TTTCCCAACGACACACAAAATACATCGATGGAAAACCACAATGGTTGAGACACAATTACC
ATATGTCAGTTTTAATCTCTGAAAAGCAAATTCCCAACGACACACGCAATACATCGATGA
AAAACCAGAATGGTCGAGACACAAATACTATATGTTCGTTTTGGGCTCTGAAAAGCAAAT
TATCAACGACACACGCATTACATCGAAAGTAAAGCTCTATGGTTGAGACACAAATACCAT
ATGTTCGTTTTGGGCTCTGAAAAGCAAATTCTCAACGACAAACGCATTACATCGATGGAA
AACCACAATGGTTGAGAGCCACATACCATATGTTCGTTTGGGGCTATGAAAAGGAATTTC
CTAACGACACCCCCAATACATCGATGGAAAACCAAAATGGTCGAGACGCAAATACCATAT
GTTACTTTTGGGCTCTGAAAAGCAAATTCCCAACGACACACGCAATACATCGATGTAGAA
CCACAATGGTTGAAACACAAGTATCATATGTTCGTTATGGTCTCTGAAAAGCAAATTCCC
AACGACACTCCTAATAAATCGATGGAAAACCACAATGGTTGAGACACATATACCGTATGT
TCGTTTTGGGCTTTGAAAAGCAAATTCCCAACGACACACGCTATACATCGATGGAAAACC
ACAATGGTTGAGACACAAAAACTATATGTTCGTTTTGGGCTCTGAAAAGCAAATTATTAA
CGACACACGCATTACATCGAAGGAAAAGCACTATGGTTGAGACACAAATAATATATGTTC
CTTTTGGGCTCTGAAAAGCAATTTCCCAACGACACACGCAATACATCGTTGGAAAACCAC
AATGGTCGAGACACAAATACTATATATGTTCGTTTTGGGCTCTGAAAAGCAAATTGTCAA
CGACACACGCATTACATCGATGGAAAATCACAATGGTTAAGACACAAATACCATATGTTC
GTTTTGGGCTCTGAAAAGCAATTTCCCCACGACACAGGCAATACATCGATTGAAAACCAC
AATGGTCGAGACACAAATACCGTATGTTCGTTTTGGGCTCTCAAAAGCAAATTGTCAACG
ACACTCGCAATACATCCATGGAAAACCACAATGGTTGAGACACAAATACCATATGTTCGT
TTTGGGCTCTGAAAAGCAAATTGTCAACGACACACACAATACATCGATGGAAAACCAGAA
TGGTCGAGACACAAATACCATATGTTCGTTTTGGGCTCTGAAAGGCAAATTATCAACGAC
ACACGCATTACATCGAAGCACTAGGTAAATTGAAATTGTGT
GGCGTCCAGTGGGAAAGTACAGGGCTGTTGGTCTTCACATAGCCGCTAGCTAAATTGAAA
TTCTGTGGCCTACAGGGGAAAGTACAGGGCAGTGGTTTTTCACATAGCATTAGGGCCTAA
AATATATGTTTAATAGCACAACTATGACTCTCGATTTGATTTAACATCTAAAAGAAATAA
GTGCCAATAATATATCCTTGGGAAAATTCGGCCGGTTCTGTAGGGCTGACAACCTTTCCA
CCTTCATGTGCCGATTTTCTTAAAATCTGGTAGCCCCAGCTCCTGGAACTCTAAAGGGTC
TGTCATGGGCTGTATTATTATTATTATTGGTTATTATTATTATTATTATTATTATTATTA
TTTTCTCTGGCTCTGCAGCCCAGCTCGTGCCTATGGCCTCCTCTTTCACGATGTTTCTTA
ATCAAACCTAACGACGGGCCATATTCTGTAGGACTCCTCTGGACGAGTGATCGGCCCGAC
CCAGAGACTACTATCTGACAACAAACAACCCACACAACAGACAAACATCCCTGCTCCCGG
TGGGATTCGAACCCACGATCGCAGCAGGCGAGCGGCCGTCGACCTACGCCTTAGACCGCA
CGGCCACTGGGACCGGCTCTTAAGCCGGTAATGGGTTTTTTAACATTTAAATTAACAATA
TGGTACATATATGTATCATTATATACAGGGTGATTCAGGAGAAATCTGCAATAATTTGGG
AAATGATAGTATGTGTGATTCTAAGCAAAAAGTTCATATGAACTTGGGTACGGTTTTGAA
CGTTTATGGAAGATACGGACCATCGGTTCGAAATCCCTAGTAAAAAATCTCGTCCATTAT
ACAGGGTGATTCAGGAGAAATATGCAATACTTTGGTAAATTATAGTATGTGTGATTCTAA
GCAAAAATGTTCATATGAACATGGGTCCGATTTTTAATGGTTACGGAAAATACTGGGCAT
CCGTTCAAAATCCCCACTAAAAAATCTCGCCCATTATAACAGGATGATTCAGGAGAAATC
TTGCAATAGTTTGGAAAATGACAGTATGTGTGATTCTAAGCAAAAACGTTCATATGAACA
TGGGTCCGATTTCGAACCTTTACCGAGATATGGTAAAAGGAAAATTCAGACCATCCGTTC
CAAAATCGGACCTAGGTCCATATGAACTCTTTTGCTTAGAATCACACATACTATTCTTTT
CCCAAAGTATGCAAGATTTCACCTGAACTCACCCTGTTTATATATATATATATATATATA
TATATATATATATATATATATATATTCCTTCACCGTTTGGCTCCCGAAAATGTTCATCCC
ATTCTACTTCCAATAAAACTCCGTTATGGTAGCTGATTCCGGGAAAAATATTACGAGAGA
TGAGCGAACCTTCAGGTCTGAGAAGATATACAAAGTGCGACCCAAAAGTTTTTAGACGAG
TTGTTATTTTTTTTTCGACGCAGAAGGTGTAATTGATCGATAGTTTGTCCCTGAAGGGCA
GAAAGTAAATGCAGAATTCTACGTGGGTGTTTTGGATCGGCTACTGAAGAGAATTGAACG
CGTTAGAACGATCAAATTCCAACCCATTGAGTGGGTTCTACTTCACAATAACGCGCCATC
TCATAACGTTCCAATTGTACAGAATTTTCTGGCGAACAGGAACGTTGCTCTTCTCCACCA
TCAAGAGTGCATACCAATGTGAGCTGTCGTTCCACCATGTTCTCGTTGTGTTGGGCTGTC
GATGATCAGATGTACCAAATTGCTTACTCAGTTGTAACATACAATAAAAAGCAAACACCA
CTGAATGAGTTGGGGCGTGTTTACAGCGCGGTAGGGATTGATTTCTTACATTAAAAAATT
TATAAACCGTAGAGGAAAGTGTTTACAGCGCGGTACGGAATGATTGCTTACATAAAGCAG
ATATAACCGTGGTGGGAAGTTTTTAGAGCGGGGTAACGAACTGATTGCTTATATAAAGCG
GATTTATAACCGAGGTTGAAAGTAGCTTATCAGCGCGGTACAAACTGATTTCTTATAAGA
AAGCAGGTTTATAACCTCTGGTAGATAGTGTTTAAAGCGCGTGTAAGGACTGATTGTTAT
ATAAAGCAGATTTATGGACCGAGTTGGAATGTGGCTTGACAGCGCGGTACAGCAACAACG
CACCGAACGCAGCCACACCTGGTGTATTTGTTGGTTTTCC
ACCCATATATTAACGAAATGCACGGTTCAAGGAGCAAAATACCCAGTAAAAAAATCTCGT
CAGGCAGCGTTGCGCGGAGGGGTTTAATTCCGTCTTTTAAAGGTTAAACCCCATGGAAGT
TAAAGAGTTTGGCCTTTTTTAAACAAAAGACCACACTCTACATTTCCCTCCGCCCCGGGC
TGTGACAAGAGGTCGACTTGTAAATGAAGCGCAAGTTCTTATTAGTGGACCCCCCCCCCC
ACTCCAACTCCCCTTTTACGCTTTTCATGTAACGTACAATGCAGCCATGAGGTAAGGTTC
TGATCTTAAACTCTCTGCGCAAAGATTGTAATGAATTTTTGACATATAAGCTTTTGAGTT
TTAAAGACCAAGATATATAGAACTATAATTTTGCCTGTTGTTTTGTATGGGTGTGAAACT
TGGGTCTAAGAGGGACGAGGTGACAGGGGATTGGAAAAGTTACATAACGAGAAGCTGAAT
GATCTGTATTCCTTACCCAATATTGTGCGGGTGGTTAAATCGAGAAAAATGAGATGGGCG
GGGCATGTGGCGCGTATGGAGGAGGAGAGAGGGGTGCACAGGGGGTTGGTGGGGAAACCT
GAGGGAAAGAGGCCTTTGGGGAGACCCAGACTTAGATGGGAGTATAATATTAAGATGGAT
CTTCAGGAAGTTGAAGGGAGTCTTGGGGACTGGATAGAGTTGGCTCAGGATAGGCACGGG
TGGTGGACACTTGTGAGTACGGTAATGAACTTTCGGCTTCCATAAAAATGAGGGGAATTT
CTTGACTAGCTGCAAAGACTGGTCGGCTTTTCAAGAAGGACTCTGCTCCATGGACTATGT
AAGTAAGTAACCTTTTTAGTTTTATGTTCTGAACAGTCAGTATATAAAACATGCTACCGT
TCTTACAGTGGCTTAGCTAACCGGACGTACTTCCTCCCCGCTCACAATTATTACAGAAAT
TATGTTTTGTTCACCGTTCATCTCGCTATATGCGTGTAATGAAACCAACTTCGTTCACTA
TTCATCTTCATTTTATTCCGTAAATATACCTCTACATGTTTCGGGCTTGCTAGTTGCCCA
TCATCAGGAGGTAACAATGTACGTAGGTGAGTCAAATGAAAACCTTAAATACTTATTAAT
TTATTAAATATTAAATATTTATAGAACTGCAAAAAATTATAGACGATATGGTGTGCTGAT
CCATGGCTCATATCCAGATGTGCGGCTATTTCGTTCACTGTTACGCGGTGATTTTCCTGC
ACGATGGCTTCAACTGCCGCAATGGCCTCTGGTGTCACTACTCGCTGTGCCTGACCAGGT
CGAGGAGAGTCTTTTGCAGAACAAATGCCGTTCATGAAATTCTTAGTCCACTCATACACT
TGCTGTAGTGACAGACATGCATCACCTTACTGAACTTTCATTCTTCGATGAATTTCGTTG
GGTTTCATTCCTTCAGTATTTAAAAAACGGATGGTGCATGTTGCACGCGGAGCCGCCATC
TTTACACCAATACTGCGCCGTCATGTTGCATCCCTGCATCATACTGCCACCTGTCAGTCA
CATTTCAAACCATGAGTATTTTTGTTGTCAACTTACAAGACAATCGAGCTGTGTTTCGAA
TTTTTATAGCACTTTTAAAGTTTTCATTTGACTCTCCCTCGTATATACTCGTATGTGTGT
GTGTGTGTGTGTGTGTGTGGTGTGTGTGTGTGTAACAAATGTCACGTGTTGTACCTGTAT
TAGCCGACTGTCGCATCTCAACCAGGCCAGATGACAGTCAACTAAAACCTACAACACGTA
CCAATTGTTGCATATATACTTTGTTACCTCCTGATGATGGGCATCTAGCAAGCCCGAAAC
ATGTAGAAGTATCTTGATATGATCCATCCCGTTGTGTGTTGAAACAATAGTAACAAAGTA
GAACCCGTCAAACACAACACTTTGCATGGATTAAGATGTGAGCAAAGGTGGCTACACGTT
TCGGCCTAACATTTCATGTATTTTGCTGATCAATTGTTCCATAATGAATGACCTGATGAT
GGCCTCTTGCTAGGCCGAAACATGTAGCCAACTTTGCTCACA
>mite|17742|79|90|135|146|t7|85|m1|ave_score:0.614806
ATATACGGGCATCAGAACATGCTATGTCATTAATATGCACAAGGTCATAGAAGAATTTTA
ATATGACCTAATAGGACCTAATCTAATAAAAGTCTTTATATATACATTTTTAATTAATTA
GGTCTTATAATTTTATATTTATGTTCTAATTCTATCAAAAGAGTAACTGTACACTAGGGA
CCCATTAG
>mite|17987|1773|1783|1834|1844|t4|1778|m1|ave_score:0.678183
GTGAACGAGGAGGCCATAACACGCGCTGGGCTGCAGTGCCAGAGAAAATAAAACAAATAA
TATTCTCTAGAAAATAAAAACAAAAAATATTTCTATAGAAAATAAAAACAAATAATATTT
CTCTAGAAAATAATAACAAATAATATTTCTCTAGAAAATAAAAACAAATAATATTTCTCT
AGAAAATAAAAA

Looks like some header lines are missing?

Sincerely,

Cong

how to parse the output result

Hi,
Through I see the readme, I counld not know which the exact position of MITE, and the header is like this: mite|1|170025|170037|170078|170090|t2|170026|m1|ave_score:0.711664, how can I get the MITE position?
I waiting for your reply, thank you.

manual

Hi,

Could you please provide more info on how to install and run? After compiling it, I had miteFinderTest. However, miteFinderTest seems to be running forever without producing any results.

below is the code I used
/scratch/luohao/software/miteFinder/miteFinderTest -input bb.fa -threshold 0.5 -output bb

Thanks!

Issue in detection

Dear Jialu Hu,
Greetings!
I am using miteFinder in Transposon Ulitimate, but it generates 0 KB resutls.
Then when I run it alone it generates 2.7 MB results of 2.Gb size.
could you give me some suggestions?

with kind regards

Ramky

About pattern_scoring.txt

Hello

Thank you so much for developing this software.

I am trying to run the inicial command, however, I will like to know more about the pattern_scoring.txt

What is this file and can we used for any sample?

Sincerely,
Loi

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.