fix: update parameters and callsets #34

FelixMoelder · 2023-08-04T08:51:16Z

As we use panel data having high coverage for the mtb the preconfigured max read depth for varlociraptor preprocess was to low. To get a correct estimation the max-depth has been set to 30000 in the config.yaml.

Also considering the read position bias lead to missing variants in the past and therefore will be omitted.

…a-seq-mtb into fix/update_config

johanneskoester · 2023-09-23T11:21:20Z

workflow/resources/config/scenario.yaml

@@ -11,14 +11,20 @@ __definitions__:
    samples = params.samples.set_index("alias")
    if "ffpe" not in samples.columns:
      samples["ffpe"] = pd.NA
-  - sex = samples.loc["tumor", "sex"]
+  - sex = samples.loc[["tumor"], "sex"]


Why the brackets?

In case of groups with just one entry sample.loc["tumor", "sex"] will just return sex as a string.
But if there are multiple entries for a group sex will become a series.
In the previous implementation rendering the scenario only worked for groups with a single entry.
Changing sex to sample.loc[["tumor"], "sex"] will always return a series allowing to render single and multiple entries correctly.

Edit: In your other comment you mentioned that each alias should only occur once. So if we handle multiple panels by prefix this change probably also becomes unnecessary.

johanneskoester · 2023-09-23T13:11:43Z

workflow/resources/config/scenario.yaml

+    if len(samples.loc[["tumor"], "ffpe"].unique()) != 1:
+      raise ValueError(f"All samples within a group must to be either ffpe or not.")
+  - |
+    if len(samples.loc[["tumor"], "purity"].unique()) != 1:


Each alias should occur only once in a group. We should also check for that when validating the sample sheet. If there are two panels for a patient we could name the two tumors tumor_panelname1 and 2. the scenario could support that by looking for the prefix tumor.

FelixMoelder added 4 commits August 4, 2023 10:49

fix: increase max read depth

34a54e1

update release

8ec4865

update github action

dbece4e

Update Snakefile

1a0a08e

FelixMoelder requested a review from johanneskoester August 28, 2023 06:41

FelixMoelder added 6 commits September 21, 2023 07:40

minor changes

c447e75

Merge branch 'fix/update_config' of github.com:snakemake-workflows/dn…

6c48ff1

…a-seq-mtb into fix/update_config

update varlociraptor params

2266cc0

clean

1a4d9c8

Fix scenario

64d771f

update pathogenic callset

6830c58

FelixMoelder changed the title ~~fix: increase max read depth~~ fix: update parameters and callsets Nov 13, 2023

johanneskoester requested changes Nov 13, 2023

View reviewed changes

FelixMoelder added 2 commits November 14, 2023 08:51

update callset

69d1374

Merge branch 'main' into fix/update_config

2e84e7d

FelixMoelder closed this Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: update parameters and callsets #34

fix: update parameters and callsets #34

FelixMoelder commented Aug 4, 2023 •

edited

Loading

johanneskoester Sep 23, 2023

FelixMoelder Nov 13, 2023 •

edited

Loading

johanneskoester Sep 23, 2023

fix: update parameters and callsets #34

fix: update parameters and callsets #34

Conversation

FelixMoelder commented Aug 4, 2023 • edited Loading

johanneskoester Sep 23, 2023

Choose a reason for hiding this comment

FelixMoelder Nov 13, 2023 • edited Loading

Choose a reason for hiding this comment

johanneskoester Sep 23, 2023

Choose a reason for hiding this comment

FelixMoelder commented Aug 4, 2023 •

edited

Loading

FelixMoelder Nov 13, 2023 •

edited

Loading