Coder Social home page Coder Social logo

ncbi-hackathons / thehumanpangenome Goto Github PK

View Code? Open in Web Editor NEW
65.0 32.0 24.0 99.82 MB

A Strategy for Building and Using a Human Reference Pangenome

Home Page: http://bit.do/TheHumanPangenome

License: MIT License

R 1.06% Shell 1.52% Python 2.21% CMake 0.56% C++ 1.43% WDL 0.62% HTML 0.53% Jupyter Notebook 91.09% Makefile 0.98%

thehumanpangenome's Introduction

TheHumanPangenome -- Creation of and Toolsets for Human Pangenomic Analysis

Alt text

The path of GRCh38 through a graph! For details check the DS folder!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/DS

Alt text

A Faster, Better Short-Read Mapper with Hit Chaining. Now with more corn! See the Giraffe folder for details!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/Giraffe

Alt text

Most Genome Annotations can be Imported from gff3 (to ggff!) quite easily! See the annotation folder for details!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/annotation

Alt text

Allele-Specific Expression can be calculated in a lightweight fashion! Grab the workflow in the RNA folder!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/RNA

Alt text

Calculate Mutations In, Outside of, and at Breakpoints of Structural Variants! See the SV folder for Details!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/SV/HG002

Alt text

We Mapped the MHC Region of HG002 to a Graph with Long Reads, Long Reads, 10X and love. Check out the MHC folder!

https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/MHC

thehumanpangenome's People

Contributors

6br avatar adamnovak avatar apregier avatar arkarachai avatar carlthewebmaster avatar cmarkello avatar cschin avatar dailydreaming avatar dcgenomics avatar eblerjana avatar evanbiederstedt avatar iminkin avatar jeizenga avatar jmonlong avatar jonassibbesen avatar maickrau avatar mehelmy avatar mlin avatar omicsnut avatar paudano avatar pjbradbury avatar tobiasmarschall avatar twrightsman avatar vaschn avatar yassines avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

thehumanpangenome's Issues

IndexError: string index out of range

Hi,
When I run the maf_to_gfa1.py script to convert alignment.maf to gfa format, but it occurs the following error:

Traceback (most recent call last):
  File "/home/cuixb/tools/biosoft/SibeliaZ-1.2.1/SibeliaZ-LCB/maf_to_gfa1.py", line 177, in <module>
    blocks, sequence = split_maf_blocks(args.maf)
  File "/home/cuixb/tools/biosoft/SibeliaZ-1.2.1/SibeliaZ-LCB/maf_to_gfa1.py", line 102, in split_maf_blocks
    next_profile = profile(maf, next_column)
  File "/home/cuixb/tools/biosoft/SibeliaZ-1.2.1/SibeliaZ-LCB/maf_to_gfa1.py", line 46, in profile
    return [group[i].body[column] == '-' for i in xrange(len(group))]
IndexError: string index out of range

And part of the alignment.maf file:

##maf version=1
# sibeliaz v.1.2.1
# cmd=-f 64 -t 28 -o westar_kale_chrA01 data/westar.fa.split/westar.id_chrA01.fa data/kale.fa.split/kale.id_kale_chrA01.fa

a
s kale_chrA01 19067038 227 + 40689054 GTTTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCCTTTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 18728550 226 + 40689054 >1_1
s kale_chrA01 21852872 224 - 40689054 GTTTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 21847912 224 - 40689054 >1_2
s kale_chrA01 18894209 224 + 40689054 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 18905069 226 + 40689054 >1_3
s kale_chrA01 18937683 224 + 40689054 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 18942636 226 + 40689054 >1_4
s kale_chrA01 21656164 226 - 40689054 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 19062092 225 + 40689054 >1_5
s kale_chrA01 18723593 225 + 40689054 GTTTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 21620759 224 - 40689054 >1_6
s kale_chrA01 21380478 224 - 40689054 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACATGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 21368346 224 - 40689054 >1_7
s kale_chrA01 21317989 224 - 40689054 GTTTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCGGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 21307298 224 - 40689054 >1_8
s kale_chrA01 19477728 226 + 40689054 GTTTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGGTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCCCTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 19488373 226 + 40689054 >1_9
s kale_chrA01 19756575 226 + 40689054 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCCTTTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TT
GCACTAGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s kale_chrA01 20912571 212 - 40689054 >1_10
s chrA01 28065206 226 + 46056803 -TTTACAAGTATTAATAGAGAGAGCACCAAGGAAATTCGAAATGGTTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TTGCACT
AGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCACTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-
s chrA01 27205390 226 + 46056803 >1_11
s chrA01 27210347 200 + 46056803 --TTACAAGTATTAATAGAGAGAGCAACAAGGAAATTCGAAATGGGTAAGCATGTGTAGTCAAAGGACAGGCTGGAACTCC-TTTTGAATCACTTGGCTGTGCTTTCTCACATGC-TTGCACT
AGTAT--AAAGGTAACTTCTCCTTTCCAGCATCATACAGGCTGTC-AAAGTGATCCCTTATCCTTCCTTAACCTCCCTTATCCTCTTTGGTCGAGTTTCCTCTCTTCT-

So, how to solve this error?
Thank you in advance!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.