Coder Social home page Coder Social logo

cytodata-hackathon-2018's People

Contributors

annecarpenter avatar cells2numbers avatar erinweisbart avatar jccaicedo avatar shntnu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

cytodata-hackathon-2018's Issues

Data posted on AWS Open Data is inconsistent

Screen Shot 2020-03-12 at 7 45 48 PM

There are new folders called profiles_cp and they were presumably previously names profiles. Its possible this was done when we were reprocessing the data.

@jccaicedo Any clue what might have happened here?

Only Bioactives-BBBC022-Gustafsdottir has both profiles_cp and profiles. The others have only profiles_cp.

Run and add QC for all Cell Painting and Deep Learning profiles

  • check if all mean profiles are available
  • run QC
    • for CP features: run the standard pipeline including feature selection and replicate correlation
    • for DL features: 1. run sanity check (make sure that all features have non zero values) and run similar tests feature selection and calculate replicate correlation

BBBC022 on s3://cytodata cannot be downloaded

I'm trying to copy the BBBC022 profiles from cytodata s3 bucket to set up a benchmark pipeline for our cell painting method development. I can list the files in the bucket, but cannot copy or sync anything. Could you please change the permissions accordingly?
Thanks.

BBBC037 dataset: example code

Treatments in BBBC037 are in the column Metadata_pert_name instead of Metadata_gene_name.

Aggregation code should be changed in the Python and R code examples.

Data normalization in cell painting gallery

There are multiple versions of single cell features in cell painting gallery. Could you describe which processing tasks were done on them? I figured out the _augmented, which is features + metadata. Where could I find information about processing steps for the other files? Are they the same as in the source papers?

For example: _normalized - what kind of normalization was used?
_feature_select - what methodology was used?

Basically, any information that I could use to reproduce the processing steps used on the datasets.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.