Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TARGET-GDC Initial Data Release #2105

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open

TARGET-GDC Initial Data Release #2105

wants to merge 4 commits into from

Conversation

jamesqo
Copy link
Contributor

@jamesqo jamesqo commented Nov 29, 2024

What?

This PR adds the initial batch of TARGET studies sourced from GDC using ISB-CGC BigQuery.

List of studies included (added to CRDC readme too):

checks

For all pull requests:

  • Passes validation

For a new study (in addition to above):

  • Does study name and study ID follow our convention? e.g. Tumor_Type (Institue, Journal Year); brca_mskcc_2015
  • Is the study meta data complete? e.g. pmid, citation
  • Were all samples profiled with WES/WGS? If not, is gene panel file curated?
  • Are oncotree codes of all samples curated; Cancer Type and Cancer Type Detailed needs to be added in addition to Oncotree Code
  • Clinical sample and patient data with meta files.
  • Mutations data with meta file.
  • Is the study based on hg38? If so, is the reference_genome: hg38 option included in meta study.
  • CNA data with meta files
  • CNA segment data with meta files
  • Expression data including z-scores with meta files
  • Other genomic profiles with meta files
  • Case-lists for all profiles.
  • Perform sanity checks based on the items in the checklist
  • Manual checking (Niki or JJ): Triage or private Portal link here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant