This script generates scatterplot matrices for an arbitrary number of data columns, given as a pandas dataframe. The user can choose which columns are used for color-coding the scatterplots, separately for the upper and lower triangle of the matrix. Moreover, the user can chose a data transformation (Percentiles, standardization, column-wise (0,1) scaling)
Exemplary results using the Boston House Prices Dataset (https://scikit-learn.org/stable/datasets/index.html#boston-dataset)
use_ranks=True:
transform_to_01=True:
standardize=True:
no transformation:
This script generates combined scatterplot-crosscorrelation matrices for an arbitrary number of data columns, given as a pandas dataframe. The user can choose which columns are used for color-coding the scatterplots, in the upper triangle of the matrix, the lower triangle contains the crosscorrelation matrix using Pearson's correlation coefficient. Moreover, the user can chose a data transformation (Percentiles, standardization, column-wise (0,1) scaling) for the scatterplots.
Exemplary results using the Boston House Prices Dataset (https://scikit-learn.org/stable/datasets/index.html#boston-dataset)
use_ranks=True:
transform_to_01=True:
standardize=True:
no transformation: