Coder Social home page Coder Social logo

Comments (8)

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
i think you could consider the voting of trees in the ensemble as the 
confidence measure. like if most of the trees in the ensemble agree on the 
label, it can be considered high confidence of the ensemble for the label.

look at the ode to obtain prediction per tree.
http://code.google.com/p/randomforest-matlab/source/browse/trunk/RF_Class_C/tuto
rial_ClassRF.m#230

this variable is explained here
http://code.google.com/p/randomforest-matlab/source/browse/trunk/RF_Class_C/clas
sRF_predict.m#22

If there are 500 trees and 10 examples you will have a 10x500 matrix. You can 
find the number of labels for each class (per example) and just divide by the 
number of trees to get a probability or confidence measure.

do tell me if you have more questions.

Original comment by abhirana on 8 Oct 2010 at 3:03

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
That was very clear and what I exactly wanted. Thanks.

Original comment by gayumahalingam on 8 Oct 2010 at 1:11

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
Hi, before anything, many thanks for the code. Came across with it a couple of 
days ago and it looks great.
May I ask you how can I calculate the level of confidence (let's say the 
standard deviation) of my test set when applying RF regression? (For example, 
the simplest problem: I have vectors xtrain and ytrain (1 feature only), and 
predict ytest given xtest)

Thanks

Original comment by [email protected] on 26 Jan 2012 at 12:53

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
hi

one way to look at confidence interval is to look at the output of each tree 
for each test example and that might serve as a confidence interval for that 
particular test example (maybe a histogram to examine the exact distribution). 
note that when you pass a test example to a tree, it will end up at one of the 
leaf node. the output of each tree for a given test example is the avg of the  
y from the training examples for regression (for classification its more like 
that majority vote). 



a more involved manner would be considering quantile regression forest 
http://jmlr.csail.mit.edu/papers/volume7/meinshausen06a/meinshausen06a.pdf
in a nutshell: you can decompose the output of the trees as a linear weighted 
model of the training examples. note that for a regression tree you end up at a 
leaf node and the avg of the y values (from training) at that leaf node is the 
output of the tree. taking that same with all the trees in the forest you can 
represent each test example as a weighted avg of the training examples and in a 
way describe any test example in terms of the training examples. i dont have 
this implemented, it requires getting what training examples land at each leaf 
node in the tree.



Original comment by abhirana on 26 Jan 2012 at 2:31

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
Thank you for this. I'll have a look in quantile regression forests. 

Original comment by [email protected] on 26 Jan 2012 at 10:45

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024

Original comment by abhirana on 8 Apr 2012 at 11:57

  • Changed state: Done

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
Hi.. may i know how to calculate the probability of my test data.

x_trn=[A B];
y_trn=a vector of zeros and ones.

output that i get form classifer is ones and zeros,but what i need is just 
probabilites . any help here  ??

Original comment by [email protected] on 4 Nov 2014 at 10:46

from randomforest-matlab.

GoogleCodeExporter avatar GoogleCodeExporter commented on May 30, 2024
and iam also missing some of the fields in 'model'..i dont have errtr and 
xbestsplit,so iam not able to perform the operation.why it is so?

Original comment by [email protected] on 4 Nov 2014 at 12:21

from randomforest-matlab.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.