measures of fit and their standard names #44

corybrunson · 2022-08-13T14:36:36Z

background

Gower &al (2011) detail several measures of fit for biplots, most prominently

the quality of the $r$-dimensional biplot, measured as the proportion of variance in the plot, calculated as the quotient of the traces of $\Lambda_r = {D_r}^2$ and of $\Lambda = D^2$.
the adequacy of the representation of the $j$-th row (respectively, column) in the $r$-dimensional biplot, calculated as the $j$-th diagonal element of $U_r\ {U_r}^\top$ (respectively, $V_r\ {V_r}^\top$), understood as the fidelity of the projections of the standard coordinates.
the predictivity of the $j$-th row (respectively, column) in the $r$-dimensional biplot, measured as the quotient of the $j$-th diagonal elements of $U_r\ \Lambda_r\ {U_r}^\top$ and of $U\ \Lambda\ U^\top$ (respectively, of $V_r\ \Lambda_r\ {V_r}^\top$ and of $V\ \Lambda\ V^\top$), understood as the fidelity of the projections of the principal coordinates.

These can be calculated directly from any SVD or EVD and interpreted for any technique based on them. In some cases they may also be calculated for supplementary elements.

suggestions

A new .quality column, calculated as cumsum(.prop_var), could be added to the output of tidy.tbl_ord().
The 1- or 2-dimensional adequacy and predictivity could be computed for all wrapped classes by augment_ord(), possibly via an option measures_of_fit = TRUE. (It would not be appropriate to annotate a tbl_ord with all $n \times k$ or $p \times k$ adequacies or predictivities.)
Adequacy and predictivity for a specific value of $r$ could be computed in a mutate_*() call, e.g. mutate_rows(ord, fit_std = adequacy(dimension = 2L)), where adequacy() knows and is able to recover the necessary model components (cf. computing node and edge properties in tidygraph).

The value of (1) is, i think, self-evident. Probably only one of (2) and (3) would be appropriate, and i lean toward (3). Either would be valuable both (a) for downstream analysis of rows and columns and (b) as aesthetic mappings in biplots (e.g. to increase marker/vector opacity with predictivity/adequacy).

implementation

(2) and (3) would be supported by new recovery generics, possibly for the matrices of standard and of principal coordinates. (3) would probably require registration of the underlying model object within the wrapper, as in tidygraph.

The text was updated successfully, but these errors were encountered:

corybrunson · 2022-08-13T16:35:18Z

Item (1) is done in fd71caf.

corybrunson added the enhancement New feature or request label Aug 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

measures of fit and their standard names #44

measures of fit and their standard names #44

corybrunson commented Aug 13, 2022 •

edited

Loading

corybrunson commented Aug 13, 2022

measures of fit and their standard names #44

measures of fit and their standard names #44

Comments

corybrunson commented Aug 13, 2022 • edited Loading

background

suggestions

implementation

corybrunson commented Aug 13, 2022

corybrunson commented Aug 13, 2022 •

edited

Loading