Coder Social home page Coder Social logo

katosh / uap Goto Github PK

View Code? Open in Web Editor NEW

This project forked from fraunhofer-izi/uap

0.0 1.0 0.0 30.9 MB

Universal Analysis Pipeline - A python tool for controlled and coordinated data analysis

License: GNU General Public License v3.0

Shell 4.54% Python 94.79% R 0.67%

uap's Introduction

uap -- Universal Analysis Pipeline

Authors

Christoph Kämpf, Michael Specht, Alexander Scholz, Sven-Holger Puppel, Gero Doose, Kristin Reiche, Jana Schor, Jörg Hackermüller

uap: reproducible and robust HTS data analysis. BMC Bioinformatics 20, 664 (2019)

Introduction

The uap package is a framework to configure, run, and control large data multi-step analyses. Its main focus is on the analysis of high-throughput sequencing data.

The aim of this data processing pipeline is to enable robust and straightforward bioinformatics data evaluation. It is implemented in Python, runs under GNU/Linux and can be controlled from the command-line interface. Although the primary focus is the evaluation of sequencing data, its design allows for a variety of other applications.

About this Repository

This repository contains the development status of uap at Fraunhofer IZI. It is based on the uap repository as published in Kämpf, C., Specht, M.,Scholz, A. et al. uap: reproducible and robust HTS data analysis. BMC Bioinformatics 20, 664 (2019), which is located here.

This version v2.0.0rc2 contains the following changes (for a complete list see the CHANGELOG):

  • code conversion from Python2 to Python3
  • improved user interaction
  • enhanced error detection for configuration
  • validation of existing results by using annotation as configuration and recalculation of SHA256
  • status --details completely lists errors or changes caused by adaptation of the configuration
  • enhanced detection of changes (software version, output files, sha256 of results (optional))
  • improved error-management
  • removed checksum suffix in output directories
  • extended backward-compatible connection-management
  • Source_controller step to check input data
  • no need to configure uap internal scripts, GNU coreutils and lmod
  • improved job-execution (signal handling, array jobs, enhanced logging, changes to configuration do not impact running jobs)
  • processes are executed in temporary directories
  • error fixing and code improvement

Please note, the version v2.0.0rc2 of uap requires Python >= 3.5 and is only tested on SLURM.

Contacts

Christoph Kämpf [email protected]

License

Copyright (C) 2019 Gesellschaft zur Foerderung der angewandten Forschung e.V. acting on behalf of its Fraunhofer Institute for Cell Therapy and Immunology (IZI).

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

uap's People

Contributors

katosh avatar specht avatar alexanderscholz avatar kmpf avatar mai00fti avatar jhackermueller avatar boll3 avatar tiennes avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.