syndeseq's Introduction

Synechocystis RNAlocSeq Analysis

This repository contains a pipeline to analyze RNAloc-Seq Data from Synechocystis. The Pipeline can be run automatically via installing the conda environment from env.yml

conda env create -f env.yml

If you have the Data folder you can run the pipeline via

Warning The Data is not publically available yet

snakemake -s deseqpipeline.smk --cores 1 -k

This produces the PipelineData directory where you can find all important files.

The files within this directory contain wildcards, which map the to the corresponding analysis pipeline. The following Table describes the abbreviations used in file names:

Abbreviation	Values	Description
set	Light, LightDark, LightDarkLight	The Dataset used. Light contains everything but the Puromycin columns from the Light Dataset. LightDark represents the Light part of the LightDark Dataset LightDarkLight is the merged Dataset containing all Light columns from the Light and LightDark Dataset
design	rf	A mapping mapping the abbreviation to the design formula used in DESeq2. rf: "~ replicate + fraction"
cond	M, TC, C	Which fraction to compare: M: Membrane C: Cytosol TC: Total Cell
base	M, TC, C	The base to compare cond to. See cond for description
ud	upregulated, downregulated	Only present in Enrichment files. Shows whether the file shows up or downregulated enriched set.
ont	MF, BP, CC	Ontology subcategory: Molecular Function, Biological Process, Cellular Compartment

Recommend Projects