Objekt Orient: Fully Automated Prediction with Random Forests

Wednesday, March 30, 2011

Fully Automated Prediction with Random Forests

Put the input vector down each of the trees in the forest. Each tree gives a classification, and we say the tree "votes" for that class. The forest chooses the classification having the most votes (over all the trees in the forest).

Basically Random Forests automatically generate many decision trees with mostly weak predictive goodness, and gain high predictive power by averaging them out. The algo can be sketched like this:

randomly sample variables and predictors, repeat:

identifying a predictor
repeat down the tree

seeking the most correlated variable
make a binary decision out of it

combine all predictions and average them out, voila!

Result: high predictive goodness sans parameters!

http://stat-www.berkeley.edu/users/breiman/RandomForests/

Wednesday, March 30, 2011

Fully Automated Prediction with Random Forests

No comments: