Coder Social home page Coder Social logo

dimensional-reduction's Introduction

dimensional-reduction - UseR22

Biology is evermore dependent on high-dimensional data. In this context, it can be challenging to evaluate and explore how this dimension reduction is used to extract information, help with data visualization and provide better models. In this hands-on tutorial, we will use dimensional reduction techniques (PCA, ICA, Multidimensional Scaling, t-sne, and U-MAP) in data to see how they behave. Our goal is to provide students an overview of some frameworks that can be used in R to do, thus harvesting R's power. First, students will use the tidy verse and tidy model frameworks to learn PCA and ICA. Secondly, they use other packages to learn Multidimensional Scaling. Thirdly, we will see how this field is still growing rapidly and how we can use reticulate to execute programs and code that has been written in python in R. We will then, use reticulate together with scikit-learn to do t-sne and U-Map from R. Lastly, students will discuss what they have learned and their main takeaways from the tutorial.

Requirements

This tutorial is aimed at people who already have some familiarity with R (they should be able to import data, be familiar with some data frame manipulations) that come from a biological background or are interested in applying this in a biological context. However, familiarity with data analysis in OMICs data or ecology is not required. Neither is knowledge of the packages themselves nor python. Ideally, the person should just be motivated to learn how to compare and implement different types of dimensional reduction methods.

Outline

Section 1: -Introduction -Explaining motivation behind dimensionality reduction -Examples from MultiOmics Data and Ecology (Slides) -Theoretical overview of methodologies: -Principal component Analysis -Multidimensional scaling -Independent Component Analysis (ICA) -t-SNE -U-MAP

  • Pooling for questions regarding method
  • Importing data to R
  • Exploring data frame values
  • Opening for more questions and student’s progress

--- Fist break (10 minutes) ---

Section 2:

  • Introduction to tidy models and usage of recipes for PCA and Independent Component Analysis (Slides)
  • Building recipes for PCA
  • Building recipes - students
  • Check-in and discussion from questions

--- Second break (10 minutes) ---

Section three:

  • Multidimensional scaling in R using other packages
  • Writing a script for Multidimensional scaling with students
  • Students try Multidimensional scaling with other distances
  • Check-in and discussion from questions

-- Third break (10 minutes)--

Section four: --Introduction to new methodology: examples from recent research developments in the field

  • What if some method I am interested in was developed in python?
  • Introduction to reticulated
  • Introduction to how we can use scikit learn in python
  • Showing the docs and coding for the t-sne in python
  • Coding for U-MAP with students
  • Check-in with student's progress and question
  • Wrapping up - Collaborative takeaways from this tutorial

dimensional-reduction's People

Contributors

bellabf avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.