Coder Social home page Coder Social logo

kaggle-trackml's Introduction

Solution #7 for competition trackML

Yuval Reina: [email protected] Trian Xylouris: [email protected]

Below you can find a outline of how to reproduce our solution for the trackML competition.

For any questions, please contact us.

ARCHIVE CONTENTS

  • files: Directory containing the competition's event files and the user prepared training files
    • df_test_v1.pkl : user prepared validation file for ML algorithm
    • df_train_v2_reduced.pkl :user prepared training file for ML algorithm
    • event*-*.csv :competition's event files
  • functions: Directory with python code
    • cluster.py :the clustering functions
    • expand.py :expanding functions
    • ml_model.py :functions related to the Machine Learning algorithm
    • other.py :utility functions
  • trackml-library-master: Directory with competition utility files (https://www.kaggle.com/c/trackml-particle-identification/discussion/55708)
  • conda_python-dependencies.yml :conda environment file
  • create clustering.ipynb :jupyter notebook, used to create solutions for training
  • Create training.ipynb :jupyter notebook, used to create training files
  • trackML_solution.ipynb :jupyter notebook, our main solution notebook

HARDWARE:

We used verious hardware to train and run our solution. Any modern computer which can run ipython and jupyter notebooks will be ok. The software was tested on Windows 10 and Ubuntu 16.04 LTS.

SOFTWARE (python packages are detailed separately in requirements.txt):

Conda - 4.6.11

Python 3.6

IPython 6.2.1

On linux machine you can build your conda environment like this:

conda env create -f conda_python-dependencies.yml

Preparing training files

The notebooks used to prepare the training files are:

  1. create clustering.ipynb - used to create solutions, which are used to select false tracks for training.

Change path to point to the path where you put the training events from kaggle

Change out_path to point to the path where you want to store the clustring results

If you want to try the ML algorithm with another solution algorithm, you can still use clustering to build false tracks, or use your algorithm to do it.

  1. Create training.ipynb - used to create the training files.

Change train_path to point to the path where you put the training events from kaggle

Change clustered_path to point to the path where you storeed the clustring results

The training results would be stored if the directory 'files'

Running

Follow trackML_solution.ipynb and run the full solution.

You can also see and run most parts of this solution on kaggle's kernel

kaggle-trackml's People

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.