Comments (3)
Data is split sequentially based on the indexing at the start. So in general, they will share the same split (unless the number of folds is varied, but there would still be an overlap). One way around this would be to shuffle the input data between layers.
Currently, if shuffle=True
, the input is shuffled at the ensemble level before any calls to the ParallelManager
. To prevent sharing splits, shuffling inputs should be performed between layers, so the it needs to be moved in to the ParallelManager.process
method. Shouldn't be too hard though.
from mlens.
Thanks for your prompt reply!
from mlens.
Not so fast :)
I think it makes sense to shuffle the samples between layers. Let's keep the issue open until implemented.
from mlens.
Related Issues (20)
- OSError: [Errno 24] Too many open files HOT 1
- Serialize mlens superlearner with KerasRegressor inside HOT 1
- mlen superlearner for MIMO multi-input multi-output HOT 2
- Error when using sklearn StratifiedKFold in Evaluator CV HOT 1
- getting zero score accuracy on test data
- If I already have trained models, how can I use mlens HOT 3
- confirmation
- Save / Restore model HOT 5
- How do I know the weight of the base model assigned by the meta model?
- Adding custom models in the superlearner
- Apply preprocessing to target variable as well
- Monotonic constraints
- Error when using preprocessing per case in model selection HOT 2
- Error involving Collections Module
- Getting error when executing the ensemble.fit(X_train, y_train) command HOT 1
- Prediction failing with 1 row of test data
- why the predict_proba() function do not return the probabilities?
- Error while running ensemble.fit(X_train, y_train)
- Error in index/base.py when using NumPy 1.24 or higher - Replace `np.int` with `np.int_`
- Superlearnerl on google colab (python 3.10 or 3.7) HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlens.