Coder Social home page Coder Social logo

rosalind's Introduction

Rosalind

Practicing Python with a focus on genetics

I am an aspiring data scientist working on building up my portfolio. My first of many projects was simply to understand the Python coding language. With the help of Rosalind, "a platform for learning bioinformatics and programming through problem solving," (https://rosalind.info/problems/locations/) I was able to do so. I plan to explore Python packages next--specifically those that are frequently used by data scientists.
I am interested in working with biological data with an eye towards global climate resiliency; this is why I chose to learn through Rosalind, which has a focus on biological data. However, moving forward, I plan to work with data that I feel is more applicable to my field. My long term goal is to work in biotech in order to address environmental issues.

Individual Project Descriptions

Python Village

Six lessons in basic Python.

Counting DNA Nucleotides

Given: A DNA string s of length at most 1000 nt.
Return: Four integers (separated by spaces) counting the respective number of times that the symbols 'A', 'C', 'G', and 'T' occur in s.
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Transcribing DNA into RNA

Given: A DNA string t having length at most 1000 nt.
Return: The transcribed RNA string of t.
My Code
Sample Data that I Worked With
Results from Sample Data
Problem Description in Rosalind

Complementing a Strand of DNA

Given: A DNA string s of length at most 1000 bp.
Return: The reverse complement sc of s.
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Rabbits and Recurrence Relations

Given: Positive integers n≤40 and k≤5.
Return: The total number of rabbit pairs that will be present after n months, if we begin with 1 pair and in each generation, every pair of reproduction-age rabbits produces a litter of k rabbit pairs (instead of only 1 pair).
My Code
Problem Description in Rosalind

Computing GC Content

Given: At most 10 DNA strings in FASTA format (of length at most 1 kbp each).
Return: The ID of the string having the highest GC-content, followed by the GC-content of that string. Rosalind allows for a default error of 0.001 in all decimal answers unless otherwise stated; please see the note on absolute error below.
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Counting Point Mutations

Given: Two DNA strings s and t of equal length (not exceeding 1 kbp).
Return: The Hamming distance dH(s,t).
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Mendel's First Law

Given: Three positive integers k, m, and n, representing a population containing k+m+n organisms: k individuals are homozygous dominant for a factor, m are heterozygous, and n are homozygous recessive.
Return: The probability that two randomly selected mating organisms will produce an individual possessing a dominant allele (and thus displaying the dominant phenotype). Assume that any two organisms can mate.
My Code
Problem Description in Rosalind

Translating RNA into Protein

Given: An RNA string s corresponding to a strand of mRNA (of length at most 10 kbp).
Return: The protein string encoded by s.
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Finding a Motif in DNA

Given: Two DNA strings s and t (each of length at most 1 kbp).
Return: All locations of t as a substring of s.
My Code
Sample Data that I Worked With
Problem Description in Rosalind

Consensus and Profile

Given: A collection of at most 10 DNA strings of equal length (at most 1 kbp) in FASTA format.
Return: A consensus string and profile matrix for the collection. (If several possible consensus strings exist, then you may return any one of them.)
My Code
Sample Data that I Worked With
Problem Description in Rosalind

rosalind's People

Contributors

gracetexana avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.