Coder Social home page Coder Social logo

advancedmethodsdataanalysisclass-1's Introduction

Advanced Methods in Data Analysis

Instructor: Alejandro Correa Bahnsen

The use of statistical models in computer algorithms allows computers to make decisions and predictions, and to perform tasks that traditionally require human cognitive abilities. Machine learning is the interdisciplinary field at the intersection of statistics and computer science which develops such algorithms and interweaves them with computer systems. It underpins many modern technologies, such as speech recognition, internet search, bioinformatics, computer vision, Amazon’s recommender system, Google’s driverless car and the most recent imaging systems for cancer diagnosis are all based on Machine Learning technology.

This course on Time Series Analysis, Machine Learning and Natural Language Processing will explain how to build systems that learn and adapt using real-world applications. Some of the topics to be covered include time series analysis, machine learning, python data analysis, natural language processing models and recurrent models. The course will be project-oriented, with emphasis placed on writing software implementations of learning algorithms applied to real-world problems, in particular, churn modeling, natural language processing, sentiment detection, among others.

Requiriments

  • Python version 3.7;
  • Numpy, the core numerical extensions for linear algebra and multidimensional arrays;
  • Scipy, additional libraries for scientific programming;
  • Matplotlib, excellent plotting and graphing libraries;
  • IPython, with the additional libraries required for the notebook interface.
  • Pandas, Python version of R dataframe
  • Seaborn, used mainly for plot styling
  • scikit-learn, Machine learning library!

A good, easy to install option that supports Mac, Windows, and Linux, and that has all of these packages (and much more) is the Anaconda.

GIT!! Unfortunatelly out of the scope of this class, but please take a look at these tutorials

Evaluation

  • 75% Projects (3 projects, 25% each)
  • 15% Exercises
  • 10% Class participation

Schedule

Time Series Analysis

Date Session Notebooks/Presentations Exercises
June 16th ARIMA Processes
June 18th Working with TSA

Machine Learning Systems

Date Session Notebooks/Presentations Exercises
June 23rd Decision Trees & Ensembles
June 24th Random Forest and XGBoost
June 25th Machine Learning as a Service

Natural Language Processing

Date Session Notebooks/Presentations Exercises
June 30th Natural Language Processing
July 1st Sentiment Analysis
July 2nd NLP using Neural Networks 16 - LSTM

advancedmethodsdataanalysisclass-1's People

Contributors

albahnsen avatar juanelocabal avatar lponate avatar sergiomora03 avatar tusharsinghsuryavanshi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.