Coder Social home page Coder Social logo

ricetfs's People

Contributors

timedreamer avatar

Stargazers

 avatar

Watchers

 avatar  avatar

ricetfs's Issues

OsNAC127/129 few peaks

In the paper, it says:

After sequencing, 185, 6125, 220, 455 peaks were finally obtained respectively in OX127H, OX127N, OX129H and OX129N.

However, I got: 52--> NAC127H; 179--> NAC127N; 132-->NAC129H; 139-->NAC129N.

I tried some several settings in masc2, but always have fewer peaks than reported.
The method that got more peaks would be analyzing three reps individually with the control. Then combine all the peaks.

GSE72186 OsNF-YB1 ChIP replicate 1 and 2

For GSE72186 OsNF-YB1 ChIP, genes that found in either replicate were kept.
I think this is similar to what the paper did:

Overlapping peaks (having a stronger but narrower peak in at least one replicate) between two biological replicates were used for further analysis.

The final file is the GSE72186_rep12.MACS2_summits_gene_list_2404

OsTF1L(Rock 10) few peaks

It's not clear how they call peaks for this TF in the paper. I have many issues with the dataset.

  1. There are no control or input for Roc10-myc α-myc.
  2. The GEO supplementary tables are very confusing. What's the meaning for 46, H3K4me3, H3K27me3? 46 probably means transgenic plants?
  3. How were the binding sites in Table 1 calculated? I thought they were anti-myc. But in the GEO tables, many of them were labeled something else, like chr03-7132344-7133928-NT-H3K4me3-7455. Besides, the peak range is too long, probably from --broad in MACS? But should TF use this setting?

In summary, I re-did the analysis using the same pipeline as PE.
Only SRR5164450 Roc10-myc α-myc ChIPseq was used for peak calling.
I have to use --nomodel --extsize 147, otherwise, no peaks could be found.

In the end I got 161 peaks correspondent to 85 genes (within 2kb region)

problems with MADS78/79

  1. The data was from this paper.

  2. In the paper TableS3, there are 14 libraries. However, only 12 in the GEO/SRA. Missing one WT-48HAF and one WT-72-HAF.

  3. There is only one replicate of genotype x time combination. For example, one replicate of OE78 x 48HAF. So there is no way to do good the statistical tests. See table below.
    image

  4. The conclusion: give up MADS78/79. Unless we want to combine 48HAF and 72HAF and use as replicates.

multiple DE gene list for one TF

Does one TF allow multiple DE gene list?
For example, there are two lines fo mutants bzip76-1 and bzip76-2 with wildtype.
Do I keep two lists as bzip76-1 vs wt and bzip76-2 vs wt?

Or combine them together? Is so, should I use a union or intersection?

OsNAC6 data set alone

This paper has a root specific OX line, whole body OX line and a KO mutant, but I could only find one replicate for each of the mutant lines - so we probably can't use this right? - I think it is fine to skip since we already have OsNAC6 from the paper with the 4 NACs.

problems with OsNTL3

  1. there is no treatment info from the SRP meta-data. So I used the GSM ID to find the treatment info from GSE.
    image

  2. I added the info to the treatment column in SraRunTable_20200430_11SRP.tsv file.

  3. There are two SRA correspondent to one GSM. At first, I thought they split the single-end files because of the file size. However, they all have the same number of reads. So I think they are actually paired-end reads! But which is read1 and which is read2 is not clear.

image

In conclusion, unless we really want this TF. I can try to reallign. Otherwise, I gave up this TF.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.