Coder Social home page Coder Social logo

bartekpog / modelcreator Goto Github PK

View Code? Open in Web Editor NEW
6.0 1.0 0.0 366 KB

Simple python package for creating predictive models

Home Page: https://pypi.org/project/modelcreator/

License: MIT License

Python 100.00%
automl machine-learning sklearn estimator package python predictive-modeling model-selection

modelcreator's People

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

modelcreator's Issues

Add learning progress bar

Add a progress bar showing the status of learning.

This task requires writing custom GridSearchCV function.

I found that this may be achieved this way. I also found this lbrary which seems to be an easy to use progress bar module.

Add expected computation intensity to interface

We shall give the user a possibility to choose whether to make a quick best model comparison or a robust, time-consuming analysis. It should influence the number of parameters and models analyzed in a Grid Search. It should be modifiable as an initial machine state, or later as a machine object parameter.

Add dask[dataframe] dependency

Problem

Dask[dataframe] is now an optional dependency in dask- so it does not install automatically. Now after installing the modelcreator package user has to install it by himself.

Solution

Add dask[dataframe] dependency with the same version as dask to the setup.py.

Add rare classes multiplication feature

We have to handle class imbalance. One of the methods is multiplying the rows with rare classes to equalize the class frequency in train dataset.

Add the proper method and expand the interface to handle this feature.

Visualization/report

Plan first version of the visualization/report that will be generated after running the program. Suggested format is HTML. Can be divided into two separated tasks if necessary - visualization and report.

Find imbalanced datasets

We need more imbalanced datasets for testing purposes. Find those and send links to those here.

How to handle imbalanced data?

What are the various ways to handle the imbalanced datasets?

One of them is multiplying rare classes, but are there other ways?

Translate issues to English

Translate all current issues to English.
Issues content shall not differ from the repository language. Let's keep things clean.

Expand models set

Add other and more various models to learning set.

The crucial ones are SVM models - both for regression and classification with multiple kernels and BalancedRandomForestClassifier from imblearn library.

Add model comparison metrics to interface

Currently, all classification models are compared by their prediction accuracy. It is acceptable in some cases, but in some imbalanced datasets it may lead to poor grid search decisions and not choosing the best model.

Give user a chance to override the default accuracy metrics by others - for example, those already supported by Grid Search in sklearn library.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.