random-rotation-random-forests

Random Rotation Random Forests proposed in this paper implemented using sklearn's Random Forests classes.

Effect on decision boundaries and performance

Below is a demonstration of decision boundaries for vanilla Random Forests and Random Rotation Random Forests. Both models were trained with parameters max_depth=4, n_estimators=200 and max_features=1.

Gradient Boosting Machines can also benefit from random rotations. Here vanilla and Random Rotaton GBM (columns 2 and 4) were trained with parameters max_depth=4, n_estimators=200, max_features=1 and rotated ensemble version consists of 40 different GBM each with unique random rotated with the following hyperparameters: max_depth=4, n_estimators=5, max_features=1. Resulting ensemble consists of 200 (40x5) unique trees.

Usage example

from rrsklearn import RRRandomForestClassifier
from sklearn.datasets import make_moons
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler


# Generate sample data
X, y = make_moons(n_samples=250, noise=0.1)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3)
# Standardize data
X_train, X_test = [StandardScaler().fit_transform(x) for x in (X_train, X_test)]
# Fit RRRF
rrrf = RRRandomForestClassifier(n_estimators=200, max_depth=2)
rrrf.fit(X_train, y_train)
# Score on testing set
rrrf.score(X_test, y_test)

Example of using RandomRotation tranformer for GBM ensembles:

from rrsklearn import RandomRotation
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.ensemble import VotingClassifier
from sklearn.pipeline import make_pipeline

rrgbm = VotingClassifier(
    [(f'gbm{i}', make_pipeline(
            RandomRotation(),
            GradientBoostingClassifier(max_depth=4, n_estimators=5, max_features=1)
        )
    ) for i in range(40)], 
    voting='soft'
)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
examples		examples
pics		pics
rrsklearn		rrsklearn
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

random-rotation-random-forests

Effect on decision boundaries and performance

Usage example

About

Packages

Languages

License

xopclabs/random-rotation-sklearn

Folders and files

Latest commit

History

Repository files navigation

random-rotation-random-forests

Effect on decision boundaries and performance

Usage example

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Languages

Packages