Baptiste Nusaibah validation #11

BaptisteArchambaud · 2024-09-23T14:32:13Z

Add of functions for validation of function ae_table_grade

into baptiste-validation

DanChaltiel

Très bon début c'est super !

Il faut qu'on discute de l'output.
A terme, il faut que ça ait la structure d'un test de package, donc avec la syntaxe de testthat que je vous avais présentée.
https://github.com/Oncostat/grstat/blob/main/tests/testthat/test-ae-tables.R
Cette syntaxe n'a pas de nuance, soit le test passe, soit il échoue.

Proposition :

utiliser expect_xxx() pour tester les différences de N et pct (diff majeure)
utiliser message() pour signaler les différences de style (diff mineure), genre les informations manquantes dans une tables mais pas dans l'autre

Par contre c'est difficile de lire le code sans voir l'application vu que les data sont private.
Ce serait compliqué de faire ce qu'il y a dans #9 pour pouvoir tout faire sur GitHub ?

DanChaltiel · 2024-09-23T14:37:14Z

R/validation fonctions AE_table_grade/compair_grade.R

ce seront des fonctions de testing, elles ne doivent pas aller dans R/
Utilise usethis::use_test_helper()
Tu peux aussi aller voir la doc de testthat: https://cran.r-project.org/web/packages/testthat/vignettes/special-files.html

R/validation fonctions AE_table_grade/compair_grade.R

DanChaltiel · 2024-09-23T14:37:49Z

R/validation fonctions AE_table_grade/compair_grade.R

+
+  if (ncol(tabR)!=ncol(tabSAS)){stop("Different number of arm")}
+  if (all(dim(tabR)==dim(tabSAS))){
+    print("Check: same dimension of tables")


programmation défensive: on ne print pas si tout va bien, on warn s'il y a un problème

DanChaltiel · 2024-09-23T14:38:05Z

R/validation fonctions AE_table_grade/compair_grade.R

+  if (all(dim(tabR)==dim(tabSAS))){
+    print("Check: same dimension of tables")
+    df=tabR%>%arrange(grade)%>%full_join(tabSAS,by="grade",suffix = c(".r",".sas"))
+    indice=which(df[,paste0(tabR%>%select(-grade)%>%colnames(),paste=".r")]!=df[,paste0(tabSAS%>%select(-grade)%>%colnames(),paste=".sas")],


je n'aime vraiment pas les indices, je trouve que c'est à risque d'erreur
Cf mon commentaire sur Teams

DanChaltiel · 2024-09-23T14:38:30Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+    mutate(grade = replace_na(grade, 0)) %>% 
+    group_by(grade) %>% 
+    mutate(across(starts_with("N"), ~sum(., na.rm = T))) %>% 
+    distinct(grade, .keep_all = T) %>% 


On ne peut pas remplacer mutate+distinct par summarise ?

je crois que c'est parce que je n'arrivais pas à keep toutes les variables dans la base avec summarize(.by=) quand il y avait plusieurs bras

DanChaltiel · 2024-09-23T14:49:39Z

R/validation fonctions AE_table_grade/separate_n_pct.R

+  data <- colnames(data) %>% 
+    imap(
+      ~data %>% select(all_of(.x)) %>% 
+        separate(.x, into = c(paste0("N", .y), paste0("pct", .y)), sep = "\\(")
+    ) %>% 
+    bind_cols()


https://tidyr.tidyverse.org/reference/separate_wider_delim.html
On doit pouvoir s'en sortir sans boucle avec separate_wider_regex(), pas évident mais l'exemple aide beaucoup.

DanChaltiel · 2024-09-23T14:50:01Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+  ngroups <- (ncol(data) - 1) / 2
+  for(i in 1:ngroups){
+    npatients <- sum(data[, paste0("N", i)])
+    data[data$grade == 0, paste0("pct", i)] <- round(data[data$grade == 0, paste0("N", i)] * 100 / npatients, round)
+  }


Je vais avoir besoin de lancer le code pour trouver comment appliquer purrr

DanChaltiel · 2024-09-23T14:50:20Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+    mutate(grade = replace_na(grade, 0)) %>% 
+    group_by(grade) %>% 


mutate(.by=grade), plus concis et ne nécessite pas de ungroup()

DanChaltiel · 2024-09-23T14:53:39Z

R/validation fonctions AE_table_grade/compair_grade.R

+  if (nrow(tabR)!=nrow(tabSAS)){stop("Different number of grade levels")
+  }


soit sur une ligne, sans les {},
soit sur 3 lignes
jamais sur 2

DanChaltiel · 2024-11-12T14:55:41Z

R/utils_outputs_ae_table_grade.R

+  data <- colnames(data) %>% 
+    imap(
+      ~data %>% select(all_of(.x)) %>% 
+        separate(.x, into = c(paste0("N", .y), paste0("pct", .y)), sep = "\\(")
+    ) %>% 
+    bind_cols()
+
+  #extraction of figures into numeric columns
+  data <- data %>% 
+    mutate(
+      across(everything(), ~as.numeric(str_extract(.x, "\\d+\\.?\\d*")))
+    )


Belle utilisation de imap() :-)
Il y a une nouvelle fonction detidyr qui ferait le taff aussi (à adapter à plusieurs bras, c'est juste pour l'exemple):

data %>% separate_wider_regex(cols=-c(.id, label, variable, grade), patterns=c(N="\\d+", " \\(", pct="\\d+", "%\\)"))

github-actions bot and others added 4 commits September 23, 2024 14:05

Update dev version (Github Actions)

63521d9

validation functions added for ae_table_grade

15f314a

Merge branch 'baptiste-validation' of https://github.com/Oncostat/grstat

b3c2fb8

into baptiste-validation

Update dev version (Github Actions)

2af87f8

DanChaltiel added this to the v0.2 milestone Sep 23, 2024

DanChaltiel requested changes Sep 23, 2024

View reviewed changes

NusaibahIbr changed the title ~~Baptiste nusaibah validation~~ Baptiste Nusaibah validation Oct 24, 2024

BaptisteArchambaud added 4 commits October 29, 2024 12:10

updating structure of AE_table_grade validation files

6a4b9e7

Merge branch 'main' into baptiste-nusaibah-validation

c39b6fd

Update outputs_ae_table_grade.R

7d743c5

ae_table_soc validation - formatting SAS and R outputs

e5d6c1a

DanChaltiel reviewed Nov 12, 2024

View reviewed changes

NusaibahIbr added 3 commits November 19, 2024 14:20

compare_grade output table

99e08ff

sortie des 2 tables concordantes

dcb1467

create compare_soc (copy compare_grade)

e5447f8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Baptiste Nusaibah validation #11

Baptiste Nusaibah validation #11

BaptisteArchambaud commented Sep 23, 2024

DanChaltiel left a comment

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

BaptisteArchambaud Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Sep 23, 2024

DanChaltiel Nov 12, 2024 •

edited

Loading

		if (nrow(tabR)!=nrow(tabSAS)){stop("Different number of grade levels")
		}

Baptiste Nusaibah validation #11

Are you sure you want to change the base?

Baptiste Nusaibah validation #11

Conversation

BaptisteArchambaud commented Sep 23, 2024

DanChaltiel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DanChaltiel Nov 12, 2024 • edited Loading

Choose a reason for hiding this comment

DanChaltiel Nov 12, 2024 •

edited

Loading