Advanced mixture of experts#

from __future__ import annotations

from gemseo import create_benchmark_dataset
from gemseo.mlearning import create_regression_model
from gemseo.mlearning.classification.quality.f1_measure import F1Measure
from gemseo.mlearning.clustering.quality.silhouette_measure import SilhouetteMeasure
from gemseo.mlearning.regression.quality.mse_measure import MSEMeasure

In this example, we seek to estimate the Rosenbrock function from the RosenbrockDataset.

dataset = create_benchmark_dataset("RosenbrockDataset", opt_naming=False)

For that purpose, we will use an MOERegressor in an advanced way: we will not set the clustering, classification and regression algorithms but select them according to their performance from several candidates that we will provide. Moreover, for a given candidate, we will propose several settings, compare their performances and select the best one.

Initialization#

First, we initialize an MOERegressor with soft classification by means of the high-level machine learning function create_regression_model().

model = create_regression_model("MOERegressor", dataset, hard=False)

Clustering#

Then, we add two clustering algorithms with different numbers of clusters (called components for the Gaussian Mixture) and set the SilhouetteMeasure as clustering measure to be evaluated from the training dataset. During the learning stage, the mixture of experts will select the clustering algorithm and the number of clusters minimizing this measure.

model.set_clustering_measure(SilhouetteMeasure)
model.add_clusterer_candidate("KMeans", n_clusters=[2, 3, 4])
model.add_clusterer_candidate("GaussianMixture", n_clusters=[3, 4, 5])

Classification#

We also add classification algorithms with different settings and set the F1Measure as classification measure to be evaluated from the training dataset. During the learning stage, the mixture of experts will select the classification algorithm and the settings minimizing this measure.

model.set_classification_measure(F1Measure)
model.add_classifier_candidate("KNNClassifier", n_neighbors=[3, 4, 5])
model.add_classifier_candidate("RandomForestClassifier", n_estimators=[100])

Regression#

We also add regression algorithms and set the MSEMeasure as regression measure to be evaluated from the training dataset. During the learning stage, for each cluster, the mixture of experts will select the regression algorithm minimizing this measure.

model.set_regression_measure(MSEMeasure)
model.add_regressor_candidate("LinearRegressor")
model.add_regressor_candidate("RBFRegressor")

Note

We could also add candidates for some learning stages, e.g. clustering and regression, and set the machine learning algorithms for the remaining ones, e.g. classification.

Training#

Lastly, we learn the data and select the best machine learning algorithm for both clustering, classification and regression steps.

model.learn()

INFO - 16:21:38: Selected clusterer:
INFO - 16:21:38:    KMeans(n_clusters=4, parameters={}, random_state=0, transformer={}, var_names=())
INFO - 16:21:38:       based on the scikit-learn library
INFO - 16:21:38:       built from 100 learning samples
INFO - 16:21:38: Selected classifier:
INFO - 16:21:38:    RandomForestClassifier(input_names=(), n_estimators=100, output_names=['labels'], parameters={}, random_state=0, transformer={})
INFO - 16:21:38:       based on the scikit-learn library
INFO - 16:21:38:       built from 100 learning samples
INFO - 16:21:38: Selected regressor for cluster 0:
INFO - 16:21:38:    RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
INFO - 16:21:38:       based on the SciPy library
INFO - 16:21:38:       built from 28 learning samples
INFO - 16:21:38: Selected regressor for cluster 1:
INFO - 16:21:38:    RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
INFO - 16:21:38:       based on the SciPy library
INFO - 16:21:38:       built from 15 learning samples
INFO - 16:21:38: Selected regressor for cluster 2:
INFO - 16:21:38:    RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
INFO - 16:21:38:       based on the SciPy library
INFO - 16:21:38:       built from 29 learning samples
INFO - 16:21:38: Selected regressor for cluster 3:
INFO - 16:21:38:    RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
INFO - 16:21:38:       based on the SciPy library
INFO - 16:21:38:       built from 28 learning samples

Result#

We can get information on this model, on the sub-machine learning models selected among the candidates and on their selected settings. We can see that a KMeans with four clusters has been selected for the clustering stage, as well as a RandomForestClassifier for the classification stage and a RBFRegressor for each cluster.

model

MOERegressor(hard=False, input_names=(), output_names=(), parameters={}, transformer={'inputs': <gemseo.mlearning.transformers.scaler.min_max_scaler.MinMaxScaler object at 0x7c2f95294ad0>, 'outputs': <gemseo.mlearning.transformers.scaler.min_max_scaler.MinMaxScaler object at 0x7c2f952c13d0>})

built from 100 learning samples
Clustering
- KMeans(n_clusters=4, parameters={}, random_state=0, transformer={}, var_names=())
Classification
- RandomForestClassifier(input_names=(), n_estimators=100, output_names=['labels'], parameters={}, random_state=0, transformer={})
Regression
- Local model 0
  - RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
- Local model 1
  - RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
- Local model 2
  - RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})
- Local model 3
  - RBFRegressor(der_function=None, epsilon=None, function=multiquadric, input_names=(), norm=euclidean, output_names=(), parameters={}, smooth=0.0, transformer={})

Note

By adding candidates, and depending on the complexity of the function to be approximated, one could obtain different regression models according to the clusters. For example, one could use a PolynomialRegressor with order 2 on a sub-part of the input space and a GaussianProcessRegressor on another sub-part of the input space.

Once built, this mixture of experts can be used as any BaseRegressor.