Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.

Log In

or

or reset password

Need an account? Click here to sign up

Log In
Sign Up

Cost-Complexity Pruning of Random Forests

2017, Lecture Notes in Computer Science

Related papers

A SURVEY OF RANDOM FOREST PRUNING TECHNIQUES

Computer Science & Information Technology (CS & IT) Computer Science Conference Proceedings (CSCP)

Random Forest is an ensemble machine learning method developed by Leo Breiman in 2001. Since then, it has been considered the state-of-the-art solution in machine learning applications. Compared to the other ensemble methods, random forests exhibit superior predictive performance. However, empirical and statistical studies prove that the random forest algorithm generates unnecessarily large number of base decision trees. This may cost high computational efficiency, predictive time, and occasional decrease in effectiveness. In this paper, Authors survey existing random forest pruning techniques and compare the performance between them. The research revolves around both the static and dynamic pruning technique and analyses the scope of improving the performance of random forest by techniques including generating diverse and accurate decision trees, selecting high performance subset of decision trees, genetic algorithms and other state of art methods, among others.

View PDFchevron_right

On Extreme Pruning of Random Forest Ensembles for Real-time Predictive Applications

Khaled Fawagreh

Random Forest (RF) is an ensemble supervised machine learning technique that was developed by Breiman over a decade ago. Compared with other ensemble techniques, it has proved its accuracy and superiority. Many researchers, however, believe that there is still room for enhancing and improving its performance accuracy. This explains why, over the past decade, there have been many extensions of RF where each extension employed a variety of techniques and strategies to improve certain aspect(s) of RF. Since it has been proven empiricallthat ensembles tend to yield better results when there is a significant diversity among the constituent models, the objective of this paper is twofold. First, it investigates how data clustering (a well known diversity technique) can be applied to identify groups of similar decision trees in an RF in order to eliminate redundant trees by selecting a representative from each group (cluster). Second, these likely diverse representatives are then used to pr...

View PDFchevron_right

Performance Comparison of Random Forests and Extremely Randomized Trees

Eftim Zdravevski

View PDFchevron_right

Random forests: from early developments to recent advancements

Systems Science & Control Engineering, 2014

View PDFchevron_right

On pruning and feature engineering in Random Forests

Khaled Fawagreh

2016

Random Forest (RF) is an ensemble classification technique that was developed by Leo Breiman over a decade ago. Compared with other ensemble techniques, it has proved its accuracy and superiority. Many researchers, however, believe that there is still room for optimizing RF further by enhancing and improving its performance accuracy. This explains why there have been many extensions of RF where each extension employed a variety of techniques and strategies to improve certain aspect(s) of RF. The main focus of this dissertation is to develop new extensions of RF using new optimization techniques that, to the best of our knowledge, have never been used before to optimize RF. These techniques are clustering, the local outlier factor, diversified weighted subspaces, and replicator dynamics. Applying these techniques on RF produced four extensions which we have termed CLUB-DRF, LOFB-DRF, DSB-RF, and RDB-DR respectively. Experimental studies on 15 real datasets showed favorable results, d...

View PDFchevron_right

WildWood: a new Random Forest algorithm

IEEE Transactions on Information Theory

View PDFchevron_right

Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement

Katharina Morik

ArXiv, 2021

This appendix accompanies the paper ‘Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement’. It provides results for more experiments which are not given in the paper due to space reasons. 1. Transformation of the Many-Could-Be-Better-Than-All-Theorem

View PDFchevron_right

Pruning decision trees with misclassification costs

Lecture Notes in Computer Science, 1998

View PDFchevron_right

Limiting the Number of Trees in Random Forests

Patrice Latinne

2001

The aim of this paper is to propose a simple procedure that a priori determines a minimum number of classifiers to combine in order to obtain a prediction accuracy level similar to the one obtained with the combination of larger ensembles. The procedure is based on the McNemar non-parametric test of significance. Knowing a priori the minimum size of the classifier ensemble giving the best prediction accuracy, constitutes a gain for time and memory costs especially for huge data bases and real-time applications. Here we applied this procedure to four multiple classifier systems with C4.5 decision tree (Breiman's Bagging, Ho's Random subspaces, their combination we labeled 'Bagfs', and Breiman's Random forests) and five large benchmark data bases. It is worth noticing that the proposed procedure may easily be extended to other base learning algorithms than a decision tree as well. The experimental results showed that it is possible to limit significantly the number...

View PDFchevron_right

Learning with Ensembles of Randomized Trees : New Insights

Lecture Notes in Computer Science, 2010

View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Related papers

Stochastic aware random forests - a variation less impacted by randomness

2013

View PDFchevron_right

Improved Post Pruning of Decision Trees

Dr. Nuthanakanti Bhaskar

International Journal for Scientific Research and Development, 2015

View PDFchevron_right

Impact of learning set quality and size on decision tree performances

Jean-Hugues Chauchat

Int. J. Comput. Syst. Signals, 2000

View PDFchevron_right

New margin-based subsampling iterative technique in modified random forests for classification

Gabriel Dauphin

Knowledge-Based Systems, 2019

View PDFchevron_right

Maximizing Tree Diversity by Building Complete-Random Decision Trees

Lecture Notes in Computer Science, 2005

View PDFchevron_right

An Outlier Detection-based Tree Selection Approach to Extreme Pruning of Random Forests

Khaled Fawagreh

View PDFchevron_right

A Random Forest with Minority Condensation and Decision Trees for Class Imbalanced Problems

Krung Sinapiromsaran

WSEAS TRANSACTIONS ON SYSTEMS AND CONTROL, 2021

View PDFchevron_right

UNDERSTANDING RANDOM FORESTS

ALI MOULAEI NEJAD

University of Liège, 2014

View PDFchevron_right

An Ensemble Framework to Improve the Accuracy of Prediction Using Clustered Random-Forest and Shrinkage Methods

Applied Sciences

View PDFchevron_right

Navigating Random Forests and related advances in algorithmic modeling

Statistics Surveys, 2009

View PDFchevron_right

Mining data with random forests: A survey and results of new tests

Antanas Verikas

2011

View PDFchevron_right

Toward a theoretical understanding of why and when decision tree pruning algorithms fail

1999

View PDFchevron_right

New framework for Improving Random Forest Classification Accuracy

WARSE The World Academy of Research in Science and Engineering

International Journal of Emerging Trends in Engineering Research, 2022

View PDFchevron_right

Tree aggregation for random forest class probability estimation

Ulrike Genschel

Statistical Analysis and Data Mining: The ASA Data Science Journal, 2020

View PDFchevron_right

Machine learning, 2001

View PDFchevron_right

Guided Random Forest and its application to data approximation

2019

View PDFchevron_right

random forests and decision trees.pdf

Random Forests and Decision Trees , 2012

View PDFchevron_right

A comparative analysis of methods for pruning decision trees

Giovanni Semeraro

IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997

View PDFchevron_right

Unbiased Feature Selection in Learning Random Forests for High-Dimensional Data

Joshua Zhexue Huang

The Scientific World Journal, 2015

View PDFchevron_right

Related topics

Mathematics Computer Science Machine Learning Random Forests Pruning

About
Press
Blog
Papers
Topics
We're Hiring!
Help Center

Find new research papers in:
Physics
Chemistry
Biology
Health Sciences
Ecology
Earth Sciences
Cognitive Science
Mathematics
Computer Science

Terms
Privacy
Copyright
Academia ©2024