Thursday, March 31, 2011
Wednesday, March 30, 2011
Fully Automated Prediction with Random Forests
Put the input vector down each of the trees in the forest. Each tree gives a classification, and we say the tree "votes" for that class. The forest chooses the classification having the most votes (over all the trees in the forest).Basically Random Forests automatically generate many decision trees with mostly weak predictive goodness, and gain high predictive power by averaging them out. The algo can be sketched like this:
- randomly sample variables and predictors, repeat:
- identifying a predictor
- repeat down the tree
- seeking the most correlated variable
- make a binary decision out of it
- combine all predictions and average them out, voila!
Result: high predictive goodness sans parameters!
http://stat-www.berkeley.edu/users/breiman/RandomForests/
Tuesday, March 15, 2011
Switching of the Nuclear Powerplants in Germany now?
Today chancellor Merkel announced to switch of 7 nuclear power plants as a consequence of the fukushima catastrophe.
The decision means cutting 10% to 15% of Germanys power consumption, using the net power production median of the weakest nuclear powerplants (890 MW) and the median of the overall powerplants (1288 MW)
Update: they named the plants in question
de.wikipedia.org/wiki/Liste_der_Kernreaktoren_in_Deutschland
These have a net power of 771+785+878+890+1167+1240+1345= 7076 MW or 11.3% of annual consumption.
Update: the last data on power trade balance shows a surplus of 22 TWh
The decision means cutting 10% to 15% of Germanys power consumption, using the net power production median of the weakest nuclear powerplants (890 MW) and the median of the overall powerplants (1288 MW)
Update: they named the plants in question
de.wikipedia.org/wiki/Liste_der_Kernreaktoren_in_Deutschland
Unterweser | KKU | NI | E.ON | 1.410 | 1.345 |
Biblis B | KWB B | HE | RWE | 1.300 | 1.240 | |||||||||
Biblis A | KWB A | HE | RWE | 1.225 | 1.167 | |||||||||
Philippsburg 1 | KKP 1 | BW | EnBW | 926 | 890 | |||||||||
Isar/Ohu 1 | KKI 1 | BY | E.ON | 912 | 878 | |||||||||
Neckarwestheim 1 | GKN 1 | BW | EnBW | 840 | 785 | |||||||||
Brunsbüttel | KKB | SH | Vattenfall | 806 | 771 |
These have a net power of 771+785+878+890+1167+1240+1345= 7076 MW or 11.3% of annual consumption.
Update: the last data on power trade balance shows a surplus of 22 TWh
Monday, March 14, 2011
Hot Bozen
Since I study time series analysis, here a little exercise with data from Hydrographisches Amt South Tyrol: |
Its getting hotter: decomposing mean temperature in Bozen 1981-2010, the observed data is splitted into a seasonal component and a level The code for R: bz<-read.csv("data/bozen.csv.txt", header=FALSE) bz<-ts(bz[,1], start=c(1981,1), frequency = 12) |
Wednesday, March 9, 2011
R-Quote of the Day
"It is not meant to be the big data distributed crunching engine; it is more like the intellectual's answer to Excel." - such a post, and the day begins with a smile.
Subscribe to:
Posts (Atom)