Comments (6)
This is a good question. I might argue that:
- this could be a setting
- the base behaviour should probably be that you don't use the predictions from a previous model, rather that you the same features and that you had before (depending on other transformations)
from scikit-lego.
Not sure this is what you're looking for but I wrote a generic gradient library called starboost from which you might want to inspire yourself!
from scikit-lego.
The idea here is to implement a pipeline like;
from sklego.transformers import RandomAdder
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForest
...
mod = BoosterPipeline([
("scale", StandardScaler()),
("random_noise", RandomAdder()),
("global_model", LogisticRegression(solver='lbfgs')),
("residual_model", RandomForest())
])
...
It's more of a UI thing. The global_model
will first run and then the BoosterPipeline
should recognise that the residuals need to be picked up.
from scikit-lego.
So it is still like some form of stacking? Are you going to reuse the features that the global used, or are you going to base yourself on the predictions?
from scikit-lego.
I am wondering if this feature might break up the workflow too much. Instead it might also work if we just have a Booster
-model that accepts many different models.
from scikit-lego.
I tried to build something like this quite a while back and remember it took a lot of changes in the normal pipeline behaviour. A Booster
meta-model might be simpler
from scikit-lego.
Related Issues (20)
- [DOCS] Separate page for each meta feature HOT 2
- [DOCS] Document KlusterFoldValidation HOT 3
- [DOCS] Broken links on Home page to installation and user guide sections
- [DOCS] Remove netlify docs HOT 2
- [DOCS] Proposed addition: Adding a Quickstart or Overall User Guide Landing Page
- [DOCS] Latex markdown mixup HOT 1
- [DOCS] Missing explanation on how to run the documentation locally HOT 1
- [BUG] Rename `transform_train` to `resample`. HOT 8
- `linear_model.LowessRegression`
- `decomposition.pca_reconstruction.PCAOutlierDetection` HOT 1
- `decomposition.umap_reconstruction.UMAPOutlierDetection` HOT 5
- Delegate Missing Values and Categorical Handling in `GrouperTransformer` and `GrouperPredictor` HOT 6
- [FEATURE] Narwhals migration for dataframe-agnostic codebase HOT 23
- [BUG] zero_inflated_regressor.py HOT 1
- [FEATURE] equivalent to sklearn discovery module HOT 7
- [BUG] Fairness regularization HOT 1
- ModuleNotFoundError: No module named 'narwhals' when using RepeatingBasisFunction HOT 3
- [FEATURE] Ability to stratify with cols that contain some Nans values, this way people can hyperparameter tune best imputation methods HOT 1
- [BUG] CI/CD Failing
- [DOCS] linear_model missing docstrings HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scikit-lego.