Added group_by parameter to util_corr_fit() #69

emcfalls · 2024-01-16T15:08:30Z

No description provided.

…r_triangle function

…nces where the group_by variable(s) are different for the synthetic and actual data

awunderground

What happens if a group exists in one data set but not the other data set?
I like how you restructured the output of util_corr(), but it broke ALL of the tests. We need to rewrite the tests to reference the new output structure.

awunderground · 2024-05-10T17:44:46Z

R/util_corr_fit.R

-
-  # reorder data names
+  # reorder data names (this appears to check if the variables are the same)
+  # issue when the groups in the synthetic data do not match the groups in the og data, and vice versa


"og data" may be a little casual for our roxygen headers...

awunderground · 2024-05-10T17:45:29Z

R/util_corr_fit.R

+    dplyr::group_split(dplyr::across({{ group_by }}))
+
+  groups <- lapply(data, function(x) dplyr::select(x, {{ group_by }}) |>
+                     slice(1))


dplyr::slice() instead of just slice().

Can you replace this with count(data, groups)?

This code is to add the group by variables to the final datasets. I can add additional code to add the Ns to the metric data. I need to think more about how to add it to the corr_data dataset.

count(data, {{ group_by }}) will return a data frame with the groups and the frequency of the groups that you can plug into bind_cols() below.

awunderground · 2024-05-10T17:47:23Z

R/util_corr_fit.R

+    return(list(
+      corr_data,
+      metrics
+    ))


return( list( corr_data, metrics ) )

awunderground · 2024-05-10T17:48:39Z

R/util_corr_fit.R

  data <- dplyr::select(data, names(synthetic_data))

+  synthetic_data <- dplyr::select(synthetic_data, dplyr::where(is.numeric), {{ group_by }})  |>


We're still using %>% instead of |> now to make sure the code is backwards compatible with R < 4.0.0.

awunderground · 2024-05-10T17:49:58Z

R/util_corr_fit.R

+                  difference = .data$original - .data$synthetic,
+                  proportion_difference = .data$difference / .data$original)
+
+  correlation_data <- bind_cols(correlation_data, groups)


dplyr::binds_cols()

awunderground · 2024-05-10T17:52:09Z

R/util_corr_fit.R

+      correlation_fit = map_dbl(results, "correlation_fit"),
+      correlation_difference_mae = map_dbl(results, "correlation_difference_mae"),
+      correlation_difference_rmse = map_dbl(results, "correlation_difference_rmse"),


purrr::map_dbl()

awunderground · 2024-05-10T17:52:47Z

R/util_corr_fit.R

+      correlation_fit = map_dbl(results, "correlation_fit"),
+      correlation_difference_mae = map_dbl(results, "correlation_difference_mae"),
+      correlation_difference_rmse = map_dbl(results, "correlation_difference_rmse"),
+      bind_rows(groups)


dplyr::bind_rows()

awunderground · 2024-05-10T17:53:19Z

R/util_corr_fit.R

+      bind_rows(groups)
+    )
+
+    corr_data <- dplyr::bind_rows(map_dfr(results, "correlation_data"))


purrr::map_dfr()

emcfalls added 10 commits December 7, 2023 14:48

updated util_corr_fit outcome

ac10a71

added group_by param

ecf5cd2

added way to group by a variable for the data table

ea65fe4

extending group_by function to include all outputs

39cf538

fixed is.null() issue

543d063

changed param to group_by

731cb64

allowing the function to group by multiple variables, issue with lowe…

e8a292f

…r_triangle function

finished adding group_by param

6423b3a

cleaned up function slightly, added tests for util_corr_fit()

b2cccd6

finished formatting code and testing util_corr_fit

7a57123

emcfalls requested a review from awunderground January 16, 2024 15:08

updated code for util_corr_fit -> new code does not account for insta…

8309fc4

…nces where the group_by variable(s) are different for the synthetic and actual data

awunderground changed the base branch from version0.0.2 to version0.0.4 May 10, 2024 17:37

awunderground requested changes May 10, 2024

View reviewed changes

updated util_corr_fit code and tests

e42d840

awunderground mentioned this pull request Jul 16, 2024

General multiple replicate support for pointwise statistic distributions #86

Open

8 tasks

Base automatically changed from version0.0.4 to main October 30, 2024 19:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added group_by parameter to util_corr_fit() #69

Added group_by parameter to util_corr_fit() #69

emcfalls commented Jan 16, 2024

awunderground left a comment

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

emcfalls May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

awunderground May 10, 2024

		data <- dplyr::select(data, names(synthetic_data))

		synthetic_data <- dplyr::select(synthetic_data, dplyr::where(is.numeric), {{ group_by }}) \|>

Added group_by parameter to util_corr_fit() #69

Are you sure you want to change the base?

Added group_by parameter to util_corr_fit() #69

Conversation

emcfalls commented Jan 16, 2024

awunderground left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment