jorussell-idm / magude Goto Github PK
View Code? Open in Web Editor NEWmagude cotransmission scenarios
Home Page: HomePage
magude cotransmission scenarios
Home Page: HomePage
Rework slides from 2.25 presentation deck into a high level
Illustrator magic on https://www.dropbox.com/home/Malaria%20Team%20Folder/projects/parasite_genetics/genomics/synthetic_genomes/eir_sweep/plots?preview=variants1000_af0.01_LMH_077_year_repDistribution.png
Should be private but shareable and to contain documented .ipynb scripts
rerun the Magude sim framework from GR
Three pipelines GenEpi+DTK
Questions of interest to collabs:
Questions:
What is the right threshold for informative contributions to polygenomic infections? (2?) By use case?
What is the right classification for polygenomic infections?
What is sensitivity of genetic metrics to the threshold of observability or classification?
What is a prototypical use case to demonstrate?
Q for Bryan: Is the first step the deconvolution of microhaplotype frequencies across sites? If low diversity between strains, you will underestimate true clonality because of relatedness. Does r refer to relatedness calculated by site or r by strain representation/deconvolution
Q: If you don't know how many unique strains you have (given sequence data from a polygenomic infection), how do you make estimate of COI, without accommodating relatedness in the parasite population you would tend to underestimate COI? Baseline estimate is unique contributors to the diversity at sites. Because of unknown phasing, you will almost certainly be underestimating true COI.
ROLE for MODEL!: Simulate a true COI to help calculate the effective scale factor for converting effective COI to true COI across range of COI (for example effCOI of 2 vs effCOI of 10, also dependent on number of sites and their diversity). Calculation of scale factor may depend on assumptions of transmission intensity and initialization of diversity, structure and linkage of diversity?
What is the most appropriate statistic to calculate for IBD relatedness when there is within host and between host relatedness (average by host? rank ordered?) Does this apply when true IBD is simulated
A collection of work items related to the GenMoz collaboration, designing and evaluating sample strategy in and around Magude as well as in the higher transmission Northern sites.
Try using a larger population, lower seasonal amplitude, and consistent outbreaks as a method for sustaining lower prevalence scenarios for use in GenEpi.
From Meeting with Jessica and Albert on representations (and calculations) of COI from EMOD+GenEpi
Sample id, locus id, haplotype id, unique values for each column
Download analyzer here: C:\git\malaria-cotransmission\analyzers\download_analyzer.py
Stored output here: C:\Users\jorussell\Dropbox (IDM)\Malaria Team Folder\projects\parasite_genetics\DTK\example_with_cotransmission\transmission_report_outputs\EIR_sweep
Deliverable: white paper in Q2
Strategy of sampling over size of sampling, this is what we can do in this time frame
Hash out in Wed MPG
in main reporters dir
header
arg less constructor
constructor
Serialize
Deserialize
for loop
When done building
flatten_config
Run regressions 79,80
After run, do diff checker and manual inspection of values in conInfIds
Copy over the new JSON into outputs
Create migration csv specifying two way migration pattern and generate binaries
See line 188 in genetics.variants.py
def make_dummy_24snp_panel(allele_ratio=0.5):
nsnps = 24
panel = SimpleVariantPanel()
panel.add_chromosomes(np.ones(14) * 100) # 14 dummy chromosomes of 100 "sites" each
root_properties = pd.DataFrame(index=range(nsnps))
root_properties['variant_frequencies'] = np.array([[allele_ratio, 1. - allele_ratio]] * nsnps).tolist()
panel.add_sites([20, 80, 190, 330, 470, 508, 570,
615, 630, 632, 640, 641, 642, 655, 695,
740, 840, 904, 980, 1005, 1015, 1205, 1250, 1320],
['0'] * nsnps,
[['0', '1']] * nsnps,
None,
metadata=['test' + str(i) for i in range(nsnps)],
root_properties=root_properties)
return panel
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.