Discussion:
[Scikit-learn-general] Random Forest Feature Importances Citation
Gavin Gray
2016-05-16 21:49:57 UTC
Permalink
In the Scikit-Learn documentation the feature importances are described as
coming from the relative depths features are used as decision nodes,
averaged across trees in the forest. Does anyone know which paper discusses
this method? Breiman's original paper seems to just talk about randomly
permuting the values of each variable and observing the change in the
objective function.

Thanks,
-Gavin
Sebastian Raschka
2016-05-16 23:08:37 UTC
Permalink
I’d say the probably best summary (and discussion) can be found
"Understanding variable importances in forests of randomized trees” by Gilles Louppe, Louis Wehenkel, Antonio Sutera and Pierre Geurts (with references to Breimans original proposed ideas)
http://papers.nips.cc/paper/4928-understanding-variable-importances-in-forests-of-randomized-trees.pdf

Best,
Sebastian
In the Scikit-Learn documentation the feature importances are described as coming from the relative depths features are used as decision nodes, averaged across trees in the forest. Does anyone know which paper discusses this method? Breiman's original paper seems to just talk about randomly permuting the values of each variable and observing the change in the objective function.
Thanks,
-Gavin
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
------------------------------------------------------------------------------
Mobile security can be enabling, not merely restricting. Employees who
bring their own devices (BYOD) to work are irked by the imposition of MDM
restrictions. Mobile Device Manager Plus allows you to control only the
apps on BYO-devices by containerizing them, leaving personal data untouched!
https://ad.doubleclick.net/ddm/clk/304595813;131938128;j_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
Loading...