Coder Social home page Coder Social logo

demografr's Introduction

> whoami

I'm a researcher and programmer at the Globe Institute, University of Copenhagen, where I develop new tools for spatial simulations and data analysis in population genetics and ecology.

An important focus of my work is facilitating reproducibility through the design of intuitive, programmable interfaces to complex computational methods. I enjoy thinking about ways to help experts work on hard problems more efficiently while lowering the barrier of entry to novice users.

Previously, I did my PhD in ancient DNA genomics at the Max Planck Institute for Evolutionary Anthropology in Leipzig where I studied the evolutionary histories of Neanderthals, Denisovans, and anatomically modern humans (but also extinct elephants!).


For more details about my work, check out my website at bodkan.net.

If you would like to get in touch, send me an email at [email protected].

demografr's People

Contributors

bodkan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

demografr's Issues

How to handle sequencing error and ancient DNA damage?

One major selling point of this R package is efficient, fast ABC inference using tree-sequences and slendr.

For lots of data sets, in particular low-coverage and/or ancient DNA data, erroneous SNP calls present an issue.

In the context of ABC, it is generally assumed that the summary statistics come from data which is reasonably clean and where various errors (especially aDNA damage) have been taken care of via filtering.

For many (?) analyses and summary statistics, aDNA errors would add noise around the true values of summary statistics. In cases like this, simulating summary statistics from perfect, clean tree sequences would not be a problem.

Still, it might be interesting to add an option to sprinkle artificial mutations that correspond to damage or sequencing errors on top of standard tree sequences. Then, summary statistics computation would proceed as normal, with the exception that it would be mutation-based rather than branch-based (which would be normally the mode of operation).

Although useful more generally, it doesn't make sense to suggest to add this functionality to tskit or msprime. Perhaps this package could have a tiny built-in Python submodule which would add damage or errors on top of mutated tree sequence.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.