Coder Social home page Coder Social logo

bbtools's Introduction

06/09/2009, 11:20am:
====================
fasta2datastore.py works, with robust logging. python fasta2datastore -h for instructions; note that you will need to
have AppEngine (i.e., whatever directory contains google/appengine/...) in your PYTHONPATH. 

06/09/2009:
===========
Working on a rather hackish little script, fasta2datastore.py, which will populate a datastore using the local
development server; this can then be exported and uploaded en masse to appspot. I don't recommend that anyone else use 
the approach I'm using, btw; I shouldn't be using it myself. :P

05/09/2009:
===========
sbdtp/properties.py defines a custom property, DNAProperty, for use with Google AppEngine. It stores DNA sequences using
Bio.Seq from biopython. Note that for some reason, Bio.Seq doesn't actually validate whether the sequence conforms to the
chosen Alphabet, but this property does take care of that, using its .validate() method.

AppEngine doesn't provide a facility for filtering on custom properties, but I'm going to modify the fulltext search tool
(see http://github.com/DocSavage/appengine-search/tree/master) so that we can search for substrings in a sequence, using
the .find() method of Bio.Seq.

04/09/2009:
===========
getattributes.py and secondarygetattributes.py are a couple of python scripts for scraping the DAS server at
partsregistry.org (or any DAS server, really). 

allbricks-DDMMYYYY.tar.bz2 is the features and sequence scraped for all bricks that these scripts could find on DDMMYYYY.

Usage:

$ python getattributes.py http://path/to/DAS/entry_points

This will pull the entry points list from the DAS server and then fetch the features and sequence for each entry point,
putting them into subdirectories (features/ and sequences/) of whatever directory you run the script in.

secondarygetattributes.py is BioBricks-specific. For some reason, not every valid BioBrick appears in the entry_points
list, but a lot of them appear as features. This script finds named BioBricks in features files and grabs their features
and sequences from the DAS server.

secondarygetattributes.py takes no arguments, just run the script.

Dependencies: lxml (easy_install lxml if your system has easy_install, http://codespeak.net/lxml if not)

Known issues:
 - lxml chokes on certain characters in text elements, e.g. '<' and '>'. This means that secondarygetattributes can't do
   anything with those files, so we may still be missing some data.
 - Not currently checking for any kind of HTTP errors, so any response other than 200 and the script dies. If this happens,
   just restart it; it won't bother to redownload anything that it already has both the features and sequence for.
 - Tags to use for identifying BioBrick names (e.g. SEGMENT and TYPE) are hardcoded; this should really be a commandline
   option.
 - Lots of duplicated code which really ought to be refactored out into a module.
 - Some form of commandline help / usage notes would be polite.

bbtools's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.