Transform data to improve the ML algorithm quality¶

Introduction¶

A transformer to apply operations on NumPy arrays.

The abstract Transformer class implements the concept of a data transformer. Inheriting classes shall implement the Transformer.fit(), Transformer.transform() and possibly Transformer.inverse_transform() methods.

Scaling¶

Scaling a variable with a linear transformation.

The Scaler class implements the default scaling method applying to some parameter \(z\):

\[\bar{z} := \text{offset} + \text{coefficient}\times z\]

where \(\bar{z}\) is the scaled version of \(z\). This scaling method is a linear transformation parameterized by an offset and a coefficient.

In this default scaling method, the offset is equal to 0 and the coefficient is equal to 1. Consequently, the scaling operation is the identity: \(\bar{z}=z\). This method has to be overloaded.

Dimension reduction¶

Dimension reduction as a generic transformer.

The DimensionReduction class implements the concept of dimension reduction.

Dependence¶

This dimension reduction algorithm relies on the PCA class of the scikit-learn library.

class gemseo.mlearning.transform.dimension_reduction.pca.PCA(name='PCA', n_components=None, **parameters)[source]

Principal component dimension reduction algorithm.

Parameters:

name (str) –
A name for this transformer.

By default it is set to “PCA”.
n_components (int | None) – The number of components of the latent space. If None, use the maximum number allowed by the technique, typically min(n_samples, n_features).
**parameters (str) – The optional parameters for sklearn PCA constructor.

compute_jacobian(data)[source]

Compute Jacobian of transformer.transform().

Parameters:: data (ndarray) – The data where the Jacobian is to be computed.
Returns:: The Jacobian matrix.
Return type:: ndarray

compute_jacobian_inverse(data)[source]

Compute Jacobian of the transformer.inverse_transform().

Parameters:: data (ndarray) – The data where the Jacobian is to be computed.
Returns:: The Jacobian matrix.
Return type:: ndarray

duplicate()

Duplicate the current object.

Returns:: A deepcopy of the current instance.
Return type:: Transformer

fit(data, *args)

Fit the transformer to the data.

Parameters:

data (ndarray) – The data to be fitted.
args (Union[float, int, str]) –

Return type:

NoReturn

fit_transform(data, *args)

Fit the transformer to the data and transform the data.

Parameters:

data (ndarray) – The data to be transformed.
args (Union[float, int, str]) –

Returns:

The transformed data.

Return type:

ndarray

inverse_transform(data)[source]

Perform an inverse transform on the data.

Parameters:: data (ndarray) – The data to be inverse transformed.
Returns:: The inverse transformed data.
Return type:: ndarray

transform(data)[source]

Transform the data.

Parameters:: data (ndarray) – The data to be transformed.
Returns:: The transformed data.
Return type:: ndarray

property components: ndarray: The principal components.

property n_components: int: The number of components.

name: str: The name of the transformer.

parameters: str: The parameters of the transformer.

Examples¶

See the examples about: