Jobs for microsalt analyses not tracked correctly #2896

seallard · 2024-02-05T08:41:41Z

Description

For some microsalt analyses, the slurm jobs are not tracked. When checking the path to the job ids file for those analyses, it does not exist, which explains why no jobs show up.

It turns out for the analyses where jobs were being reported (over the past month), the jobs displayed were actually from the previous analysis of the case.

The underlying issue is that microsalt outputs a directory with a timestamp, and it is only created once the analysis is completed. So the pending analysis in trailblazer cannot be provided with the correct path.

Decided fix

Changing the output dir in microsalt is not viable given lack of tests and that the entire pipeline is being replaced. The least error prone path is to revert the old logic and just create the missing directory on Hasta for the slurm job id files.

This is the relevant logic creating the slurm job ids file:

        try:
            #Generates file with all slurm ids
            slurmname = "{}_slurm_ids.yaml".format(self.name)
            slurmreport_storedir = Path(self.config["folders"]["reports"],
                "trailblazer", slurmname)
            slurmreport_workdir = Path(self.finishdir, slurmname)
            yaml.safe_dump(
                data={"jobs": [str(job) for job in joblist]},
                      stream=open(slurmreport_workdir, "w"))
            shutil.copyfile(slurmreport_workdir, slurmreport_storedir)
            self.logger.info(
                "Saved Trailblazer slurm report file to %s and %s",
                slurmreport_storedir,
                slurmreport_workdir,
            )
        except Exception as e:
            self.logger.info("Unable to generate Trailblazer slurm report file")

Create directory on Hasta: /home/proj/production/microbial/results/reports/trailblazer and /home/proj/stage/microbial/results/reports/trailblazer
Update the logic in cg adding the pending microsalt analysis in trailblazer to pass paths like /home/proj/production/microbial/results/reports/trailblazer/<ticket_id>_slurm_ids.yaml. The out directory should be /home/proj/production/microbial/results/reports/deliverables (?).

seallard · 2024-02-05T10:11:33Z

If a microsalt analysis is re-run, will the old slurm ids be overwritten in the trailblazer directory?
Yes, the dir is open in write mode and any filecontents will be overwritten.

seallard · 2024-02-05T10:25:34Z

The name used for the job ids file seems to differ depending on the number of samples in the case 🤢 😭

        if isinstance(self.sampleinfo, list) and len(self.sampleinfo) > 1:
            self.name = self.sampleinfo[0].get("CG_ID_project")
            self.sample = self.sampleinfo[0]
            for entry in self.sampleinfo:
                if entry.get("CG_ID_sample") == self.name:
                    raise Exception(
                        "Mixed projects in samples_info file. Do not know how to proceed"
                    )
        else:
            if isinstance(self.sampleinfo, list):
                self.sampleinfo = self.sampleinfo[0]
            self.name = self.sampleinfo.get("CG_ID_sample")
            self.sample = self.sampleinfo

I'm going to disregard this since cases with one sample are rare in microsalt. And why would you even use different paths? 🤦 Added to backlog in microsalt Clinical-Genomics/microSALT#170

seallard mentioned this issue Feb 5, 2024

Microsalt jobs not tracked correctly #2812

Closed

seallard added the Bug label Feb 5, 2024

seallard mentioned this issue Feb 5, 2024

Patch microsalt jobs tracking #2897

Merged

1 task

seallard changed the title ~~Jobs for some microsalt analyses are not tracked~~ Jobs for microsalt analyses not tracked correctly Feb 5, 2024

seallard closed this as completed in #2897 Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jobs for microsalt analyses not tracked correctly #2896

Jobs for microsalt analyses not tracked correctly #2896

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024

seallard commented Feb 5, 2024 •

edited

Loading

Jobs for microsalt analyses not tracked correctly #2896

Jobs for microsalt analyses not tracked correctly #2896

Comments

seallard commented Feb 5, 2024 • edited Loading

Description

Suggested solution

seallard commented Feb 5, 2024 • edited Loading

seallard commented Feb 5, 2024 • edited Loading

Decided fix

seallard commented Feb 5, 2024

seallard commented Feb 5, 2024 • edited Loading

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024 •

edited

Loading

seallard commented Feb 5, 2024 •

edited

Loading