Coder Social home page Coder Social logo

vmware-samples / efficient-multiclass-classification Goto Github PK

View Code? Open in Web Editor NEW
0.0 3.0 0.0 26 KB

Duet is a scikit-learn classifier for resource-efficient multiclass classification that incorporates the advantages of bagging and boosting decision-tree-based ensemble methods (DTEMs) by using two classifiers instead of a monolithic one. A simple bagging model is trained using the entire training dataset and is responsible for capturing the easier concepts. Then, a boosting model is trained using only a fraction of the dataset representing the concepts the bagging model finds hard.

License: Other

Python 100.00%

efficient-multiclass-classification's Introduction

Duet scikit classifier (v1.0)

Overview

Duet is a decision tree ensemble method based multiclass classification framework that offers a more efficient resource usage while preserving and even improving the classification accuracy in comparison to standard monolithic models.

Duet is based on a small bagging ensemble model and a booting model.
The current implementation of Duet is based on Random Forest and XGBoost.

Documentation

More details about the Duet can be found in the following paper:
"Efficient Multiclass Classification with Duet" [EuroMLSys '22]
https://dl.acm.org/doi/abs/10.1145/3517207.3526970
https://euromlsys.eu/pdf/euromlsys22-final4.pdf

Files:

duet_classifier.py

Duet scikit classifier

classification_example.py

Basic classification example by Duet

grid_search_example.py

Basic grid search example with Duet

Prerequisities:

numpy
pandas
skleran
xgboost

or alternatively, run:
$ pip3 install -r requirements.txt

Contributing

The efficient-multiclass-classification project team welcomes contributions from the community. Before you start working with efficient-multiclass-classification, please read our Developer Certificate of Origin. All contributions to this repository must be signed as described on that page. Your signature certifies that you wrote the patch or have the right to pass it on as an open-source patch. For more detailed information, refer to CONTRIBUTING.md.

License

BSD-3 License

Contact us

For more information, support and advanced examples contact:
Yaniv Ben-Itzhak, [email protected]
Shay Vargaftik, [email protected]

efficient-multiclass-classification's People

Contributors

fmanning07 avatar shayvar avatar vmwghbot avatar yanivbi avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.