Dataset

Basics

A generic data structure with entries and variables.

The concept of dataset is a key element for machine learning, post-processing, data analysis, …

A Dataset is a pandas MultiIndex DataFrame storing series of data representing the values of multidimensional features belonging to different groups of features.

A Dataset can be set either from a file (from_csv() and from_txt()) or from a NumPy array (from_array()), and can be enriched from a group of variables (add_group()) or from a single variable (add_variable()).

An AbstractFullCache or an OptimizationProblem can also be exported to a Dataset using the methods AbstractFullCache.to_dataset() and OptimizationProblem.to_dataset().

Visualization

Datasets post-processing.

A DatasetPlot is a generic graphical representation of a Dataset that can be displayed on screen or saved to a file. The different visualization tools (OptPostProcessor, SensitivityAnalysis.plot(), …) proposed by GEMSEO rely more and more on it to factorize the code, separate data and graph generation and standardize the visualizations.

As an example, RadarChart implements a radar chart and is used by both RadarChart to visualize the constraints stored in a Database and SensitivityAnalysis.plot_radar() to visualize the sensitivity indices generated by a SensitivityAnalysis.

Problems

Examples of dataset.

GEMSEO proposes several datasets containing academic data to illustrate its capabilities:

Examples

Creation

Convert a cache to a dataset

Convert a cache to a dataset

Convert a database to a dataset

Convert a database to a dataset

Dataset

Dataset

Dataset from a NumPy array

Dataset from a NumPy array

The input-output dataset

The input-output dataset

The optimisation dataset

The optimisation dataset

Use cases

Burgers dataset

Burgers dataset

Iris dataset

Iris dataset

Rosenbrock dataset

Rosenbrock dataset

Visualization

Andrews curves

Andrews curves

Bars

Bars

Boxplot

Boxplot

Customize with matplotlib

Customize with matplotlib

Lines

Lines

Parallel coordinates

Parallel coordinates

Radar chart

Radar chart

Scatter

Scatter

Scatter matrix

Scatter matrix

YvsX

YvsX

ZvsXY

ZvsXY

Gallery generated by Sphinx-Gallery