Skip to content
This repository has been archived by the owner on Sep 20, 2024. It is now read-only.

UC - 5. LINE-1 Retrotransposon Expression #6

Open
NoopDog opened this issue Jun 29, 2021 · 5 comments
Open

UC - 5. LINE-1 Retrotransposon Expression #6

NoopDog opened this issue Jun 29, 2021 · 5 comments
Assignees
Labels
completion phase SYS INTEROP System interoperability use case

Comments

@NoopDog
Copy link
Collaborator

NoopDog commented Jun 29, 2021

Interop Contact: Jack DiGiovanna
Active in 2021: Active,
Researcher: Wilson McKerrow [PDC (CRDC), GDC (CRDC), and GTEx (AnVIL) data]

Analysis Question

The Fenyo lab is studying how retrotransposons work, which is fundamentally a multi-omic question. Specifically, the insertion occurs in the genome; this insertion can change the transcriptome and resulting in altered protein expression. This research project involves testing a hypothesis that the activity of a specific retrotransposon, LINE1, is different in tumors than in normal cells. In order to test this hypothesis, the researcher requires datasets that have matching samples of DNA, RNA, and protein data. To date, work has focused on the tumor samples in the TCGA data and proteomic data from the CPTAC datasets. However, the number of normal samples in the TCGA data set is fairly small. The GTEx dataset has many more normal samples for the same tissue types as the tumor samples in TCGA, and they would like to expand their analysis to GTEx to better understand LINE1 activity in normal tissue and compare it to the tumor data.

The genomic and proteomic workflows are wrapped in CWL and functional on the CRDC. The results of their analysis of the TCGA data are already complete and available on the CRDC (highlighted at the prior Interop Meeting). The GTEx data is only accessible from the AnVIL platform, which currently only supports workflows wrapped in WDL. The interoperability project aims to find a path to connect the GTEx data on the AnVIL platform to further processing and also combination with a prior analysis on the CRDC. This “normals” use case is a frequent request from our users, so finding a solution would be extremely valuable for a large number of cancer researchers.

Analysis Plan

  1. Obtain confirmation from appropriate NIH Data Access Committees that these datasets and data uses are allowable and can be used/combined in this manner.
  2. (ideally) Through a single sign on event, authenticate user and authorize appropriate access through RAS integration
  3. Find proteomic cohort in PDC Data Portal
  4. Export manifest describing cohort
  5. Pull this data into a CRDC analysis ecosystem
  6. Perform proteomics analysis within CRDC
  7. Perform genomics analysis within CRDC
  8. Combined analysis of 5-6
  9. Find GTEx data cohort within AnVIL
  10. Copy this dataset to CRDC
  11. Perform GTEx analysis on CRDC
  12. Combine derived results from 7 and 10 as necessary
    Interop Requirements: Interop between GDC and PDC within CRDC; interop between AnVIL - CRDC
@NoopDog NoopDog added the Epic label Jun 29, 2021
@jackDiGi
Copy link
Collaborator

this has also been captured in a webinar, which details the scientific story.

@jackDiGi
Copy link
Collaborator

jackDiGi commented Sep 3, 2021

This use case has finished successfully.

@linikujp
Copy link
Member

Is the training material being developed?

@linikujp linikujp reopened this Oct 26, 2021
@NoopDog NoopDog self-assigned this Nov 8, 2021
@jackDiGi jackDiGi added the SYS INTEROP System interoperability use case label Nov 16, 2021
@NoopDog NoopDog moved this to Proposed in NCPI Use Case Tracker Dec 3, 2021
@NoopDog NoopDog changed the title UC - 5. NCI CRDC + NHGRI AnVIL UC - 5. LINE-1 Retrotransposon Expression Feb 4, 2022
@NoopDog NoopDog moved this from Proposed to Training Material Dev in NCPI Use Case Tracker Feb 4, 2022
@NoopDog
Copy link
Collaborator Author

NoopDog commented Feb 9, 2022

@jackDiGi @linikujp This is on the NCPI staging server here:

https://staging.anvilproject.org/ncpi/demonstration-projects/line-1-retrotransposon-expression-mckerrow

Your review would be appreciated!

Thanks,|
Dave

@jackDiGi
Copy link
Collaborator

jackDiGi commented Mar 7, 2022

@jackDiGi jackDiGi moved this from Training Material Dev to Complete in NCPI Use Case Tracker Mar 6, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
completion phase SYS INTEROP System interoperability use case
Projects
Status: Complete
Development

No branches or pull requests

3 participants