Hi. I am trying to get a CI for a new x value that was not in the training set. <d

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

V_IJ_unbiased is zero, when x_test = 1,about scikit-learn-contrib/forest-confidence-interval

Comments (3)

agrawalraj commented on June 7, 2024

I think the issue is a bug on line 239 of forestci.py. I think this line should be replaced by

pred_mean = np.mean(pred, 1)
pred_centered = (pred.T - pred_mean).T

because we want to average over the bootstrap dimension, not the test dimension.

from forest-confidence-interval.

cewaphi commented on June 7, 2024

@agrawalraj I have also faced this issue some time ago.

The solution I found back then is similar to yours.
While I was debugging the function from which you were copying lines I found out:
'pred' had the following dimensions:
0: the samples
1: the prediction for each tree
It is the result of this line:
pred = np.array([tree.predict(X_test) for tree in forest]).T

Due to the dimension of the prediction array for one sample the mean calculation might not return the result that is expected.
Either way, it does not make sense to average the prediction of different samples for the same tree instead of averaging the predictions of all trees of the forest for one sample.

This did the fix for me:
pred_mean = np.mean(pred, 1).reshape(X_test.shape[0], 1)
Nothing else had to be changed.

Maybe it would be benefitial to include the single (test) sample case (as in LOOCV) in the code tests.

from forest-confidence-interval.

arokem commented on June 7, 2024

We'd welcome a pull request

from forest-confidence-interval.

Recommend Projects

V_IJ_unbiased is zero, when x_test = 1 about forest-confidence-interval HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent