Coder Social home page Coder Social logo

dgreenhoe / symbolic-sequence-processing Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 788 KB

C++ code for symbolic sequence processing, including genomic sequence processing (GSP)

License: MIT License

C++ 94.29% C 2.09% CMake 1.12% Batchfile 0.14% Makefile 2.12% TeX 0.25%
sequence-analysis gsp genomic-data-analysis c-plus-plus cpp

symbolic-sequence-processing's Introduction

Readme file for Symbolic Sequence Processing repository

Daniel J. Greenhoe

Project Desription

This project is C++ code to support symbolic sequence processing, as described in the paper Greenhoe(2016). Below is an abstract for that paper:

Abstract for Greenhoe(2016)

A real-valued random variable X is a measurable function that maps from a probability space (Omega,E,P) to (R,<=,d,+,.,B) where B is the usual Borel sigma-algebra on the "real line" (X,<=,d,+,.), and where R is the set of real numbers, <= is the standard linear order relation on R, d(x,y)=|x-y| is the usual metric on R, and (R, +, .) is the standard field on R. Greenhoe(2015) has demonstrated that this definition of random variable is often a poor choice for computing statistics when the stochastic process that X maps from has structure that is dissimilar to that of the real line. Greenhoe(2015) has further proposed an alternative statistical system, that rather than mapping a stochastic process to the real line, instead maps to a weighted graph that has order and metric geometry structures similar to that of the underlying stochastic process. In particular, ideally the structure X maps from and the structure X maps to are, with respect to each other, both isomorphic and isometric. Such a mapping is useful for analysis of a single random variable---for example the expectation EX of X can be defined simply as the center of its weighted graph. However, the mapping has limitations with regards to a sequence of random variables in performing sequence analysis (using for example Fourier analysis or wavelet analysis) or in performing sequence processing (using for example FIR filtering or IIR filtering). Rather than mapping to a weighted graph, this present paper proposes instead mapping to an ordered distance linear space Y=(R^n,<=,d,+,.,R,+,x), where (R,+,x) is a field, + is the vector addition operator on R^n x R^n, and . is the scalar-vector multiplication operator on R x R^n. The linear space component of Y provides a much more convenient (as compared to the weighted graph) framework for sequence analysis and processing. The ordered set and distance space components of Y allow one to preserve the order structure and distance geometry inherent in the underlying stochastic process, which in turn likely provides a less distorted (as compared to the real line) framework for analysis and sequence processing.

References

symbolic-sequence-processing's People

Contributors

dgreenhoe avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.