Polynomial chaos expansion¶

The polynomial chaos expansion (PCE) discipline expresses the model output as a weighted sum of polynomial functions which are orthonormal in the stochastic input space spanned by the random input variables:

\[Y = w_0 + w_1\phi_1(X) + w_2\phi_2(X) + ... + w_K\phi_K(X)\]

where \(\phi_i(x)=\phi_{\tau_1(i),1}(x_1)\times\ldots\times \phi_{\tau_d(i),d}(x_d)\).

Enumerating strategy¶

The choice of the function \(\tau=(\tau_1,\ldots,\tau_d)\) is an enumerating strategy and \(\tau_j(i)\) is the polynomial degree of \(\psi_{\tau_j(i),j}\).

Distributions¶

PCE are stochastic models whose inputs are random variables and are often used to deal with uncertainty quantification problems.

If \(X_j\) is a Gaussian random variable, \((\psi_{ij})_{i\geq 0}\) is the Legendre basis. If \(X_j\) is an uniform random variable, \((\psi_{ij})_{i\geq 0}\) is the Hermite basis.

When the problem is deterministic, we can still use PCE under the assumptions that the random variables are independent uniform random variables. Then, the orthonormal basis function is the Hermite basis.

Degree¶

The degree \(P\) of a PCE is defined in such a way that \(\text{degree}(\psi_i)=\sum_{j=1}^d\tau_j(i)\leq P\).

Estimation¶

The coefficients \((w_1, w_2, ..., w_K)\) and the intercept \(w_0\) are estimated either by least square regression, sparse least square regression or quadrature.

Dependence¶

The PCE model relies on the FunctionalChaosAlgorithm class of the openturns library.

class gemseo.mlearning.regression.pce.PCERegression(data, probability_space, discipline=None, transformer=None, input_names=None, output_names=None, strategy='LS', degree=2, n_quad=None, stieltjes=True, sparse_param=None)[source]

Polynomial chaos expansion.

Constructor.

Parameters

data (Dataset) – learning dataset
probability_space (ParameterSpace) – probability space.
discipline (MDODiscipline) – discipline to evaluate if strategy=’Quad’ and data is empty.
transformer (dict(str)) – transformation strategy for data groups. If None, do not transform data. Default: None.
input_names (list(str)) – names of the input variables.
output_names (list(str)) – names of the output variables.
strategy (str) – strategy to compute the parameters of the PCE, either ‘LS’, ‘Quad’ or ‘SparseLS’. Default: ‘LS’.
degree (int) – polynomial degree of the PCE
n_quad (int) – number of quadrature points
stieltjes (bool) – stieltjes
sparse_param –
Parameters for the Sparse Cleaning Truncation Strategy and/or hyperbolic truncation of the initial basis:
- max_considered_terms (int) – Maximum Considered Terms,
- most_significant (int), Most Significant number to retain,
- significance_factor (float), Significance Factor,
- hyper_factor (float), factor for hyperbolic truncation strategy.

Example