diff --git a/www/voyAGEr-WebAppTutorial.html b/www/voyAGEr-WebAppTutorial.html new file mode 100644 index 0000000..6610545 --- /dev/null +++ b/www/voyAGEr-WebAppTutorial.html @@ -0,0 +1,562 @@ + + + + +
+ + + + + + + + +voyAGEr is freely available at https://compbio.imm.medicina.ulisboa.pt/voyAGEr
+voyAGEr is composed of four main sections (the tabs in the navigation bar at the top):
+Home
(depicted by the home icon and no literal titling): to visually explain the used methodand its associated findings featured in the application.
Gene
: to lead a gene-centric investigation, namely to assess how the expression of a specificgene changes with age and sex in a specific tissue.
Tissue
: to analyse how tissue-specific transcriptomes change with age and sex.
Module
: to further examine sets of co-expressed genes whose expression is altered with age namely through their enrichment in specific cell types, biological pathways and association with diseases.
voyAGEr leverages RNA-seq datasets from the GTEx project (Lonsdale et al., 2013), encompassing tissue samples from hundreds of donors aged from 20 to 70 years.
+Cellular senescence is a stress-induced cell cycle arrest limiting proliferation of potentially oncogenic cells but progressively creating an inflammatory environment in tissues as they age and therefore an example of a process whose molecular mechanisms are of particular interest to ageing researchers (Gorgoulis et al., 2019; Van Deursen, 2014).
+Senescence markers, such as CDKN2A, encoding cell cycle regulatory protein p16INK4A thataccumulates in senescent cells (Erickson et al., 1998; Gil & Peters, 2006), can thus be studied as putative markers of ageing of certain tissues.
+To examine CDKN2A expression changes across age:
+Go to the Gene section
Type CDKN2A in the Gene field
++Note that gene names in voyAGEr are HGNC (HUGO Gene Nomenclature Committee) symbols. +For each gene, the respective NCBI and GeneCards webpages can be accessed by clicking on their logos next to its name on plot’s title.
+
Enter/select Lung in the Tissue field to investigate CDKN2A expression changes in that specific tissue.
+Plots of CDKN2A expression (top panel,identical to that in the Profile sub-tab) and the significance of its alterations over age (bottom panel) are then featured (Figure 2.3). +Significant CDKN2A expression changes are observed in around 30 years-old, late forties and mid fifties.
+The user can also check the the overall changes of CDKN2A with age, represented as the subtitle of this figure. +These are the results of fitting the ShARP-LM model on the entire age range, providing both p-value and t-value. +A positive t-value represents an increase of expression with age.
++GTEx transcriptomic data are from “healthy” tissue samples from donors that had, nonetheless, reported medical conditions (Lonsdale et al., 2013). +
+
Click on Sex in the Coloured by field,leaving All in the Shaped by field.
+CDKN2A lung expression progression with age appears to be influenced by the donors’ sex, particularly in the mid-thirties (Figure 2.4). +This observation can be statistically tested in the Alteration sub-tab by clicking on Sex in the Alterations associated with field.
+Back in the Profile sub-tab, click on All in the Coloured by fieldand on Condition inthe Shaped by field. Enter/select MHCOPD in the Select field.
+The CDKN2A lung expression profile is herein associated with medical conditions(positive if the donor suffered from the condition, negative if not and unknown if theassociation is uncharted). +Moreover, the median gene expression values for positiveand negative conditions are displayed. +The significance of Kruskal-Wallis tests for thedifference in gene expression medians between positive and negative donors is usedto rank conditions. +In this case, the condition selected (Chronic Respiratory Disease) is amongst those displaying a significant difference in median (adjusted p-value below 0.05). +On the scatter plot with CDKN2A lung expression over age, the curvesfitted independently for positive and negative conditions show that such difference ingene expression occurs mostly after the age of 50 (Figure 2.5).
++Limitations: In the GTEx dataset, there are conditions for which very few donors are positive and others for which very few donors have their condition state annotated. +The significance of the Kruskal-Wallis tests must therefore be regarded with caution and as providing limited information. +In this case, for example, even though significant differences in median were found for the Chronic Respiratory Disease, the low number of positive samples and their concentration in limited age ranges hamper any solid conclusion.
+
Go to the Tissue section.
+The landscape of Age-, Sex-and Age&Sex-associated global gene expression alterations along age for all tissues can be profiled using the significance of proportions of differentially expressed genes. +Three periods stand out with significant transcriptional changes associated with Age (keeping the default All tissues in the Tissue field and Age in the Alterations associated with field), after 55 years old (Figure 3.1). +Moreover, most of the significant transcriptional differences between sexes appear to occur in the fifth and sixth decades of life (All tissues in the Tissue field and Sex in the Alterations associated with field) (Figure 3.2).
Enter Adipose – Subcutaneous in the Tissue field and click on Age in the Alterations associated with field.
+The progression of the percentage of Age-associated altered genes over age is now featured (Figure 3.3). +The statistical significance of each proportion is also illustrated with a colour scale.Two periods of major transcriptional changes appear to occur, at late 20’s (13.6% altered genes) and late 40’s (4.7% altered genes).
Click on the dot at 29.57 years old (hovering over each point in the plot will show its details). +The list of differentially expressed genes, ordered by their significance, appears on the sidebar on the left. +Although visually not exatcly the same as in the web app, you can also explore the table below.
Click on the LMO3 row in the table.
+Plots of LMO3 expression and the significance of its alterations over age (like in Figure 2.3) appear.
Browse the expression alterations’ significance over age of the most altered genes by selecting them in the table.
+Some (e.g., PRELID1, RUNX1T1, FGFRL1) have their expression significantly modified only in the aforementioned first peak at around 28 years old.
Click on the dot at 46.43 y.o. +and similarly browse the expression alterations’ significance of the most differentially expressed genes at this age.
+Some (e.g. MT-CYB, MT-ND4, MT-ATP6, MT-ND2) have their expression significantlyaltered only in this second peak.
Different sets of genes may drive the different age periods of major transcriptional changes, which begs assessing if they reflect the activation of distinct biological processes. +For this purpose, the user can profile the biological functions of the genes underlying each peak of transcriptomic changes byassessing their enrichment in manually curated pathways from the Reactome database (Croft et al., 2014) or in user-provided gene sets.
+Go to the Enrichment sub-tab.
+A heatmap showing the normalised enrichment score (NES) of Reactome pathways (columns) along age (row) is displayed (Figure 3.4). +The percentage of altered genes over age can be found on the right side of the heatmap. +Reactome pathways are gathered in families of biologicalfunctions, based on shared genes, that can be found at the top of the heatmap.
++Note that, for visualisation ease, only the most significantly associated pathways are featured.
+The user can click on Select: in the Pathway field to examine results for a given Reactome pathway.
+
Below the heatmap, click on the red family (family 3) in the Families of pathways section.
+A word cloud provides a glimpse into the family’s biological functions.
+By clicking on the Pathways sub-tab in the Families of pathways section, the user hasaccess to the list of specific pathways from the Reactome, Gene Ontology (Gene OntologyConsortium, 2004) and KEGG (Kanehisa, 2000) databases that are associated with the family.
Click on User-specified in the Geneset field on the left.
+Let’s examine the enrichment of the three peaks of transcriptional changes in senescent-associated genes.
Enter the 230 senescent-associated genes (retrieved from Senequest (Gorgoulis et al., 2019) whose link with senescence is supported by at least 4 sources) from this document’s appendixin the List of genes field, leave a Significance threshold p-value of 0.05 and Run.
+Although we have two peaks, if you hover over the tip you’ll notice neither of them are significantly enriched in senescence-associated genes (Figure 3.5).
++Gene symbols can be in upper or lower case but must still follow the HGNC naming. +If a gene symbol is notrecognised as such, the gene is not included in the analysis.
+
Genes with highly correlated expression are likely to be coregulated and share biological functions or associations with phenotypical or pathological traits (van Dam et al., 2017). +Clusters of these genes, called modules, are identified in 4 tissues using voyAGEr and their enrichment in cell types, Reactome pathways, and disease markers can be analysed.
+Go to the Results sub-section of the Module section.
+++The About sub-section graphically summarises the methods employed to obtain the modules.
+
Each module is made of a set of genes and characterised by an eigengene representing their averageexpression profile.
+Modules’ eigengene expression and enrichment in Reactome pathways, cell types, and diseasemarkers can be respectively found in the 4 sub-tabs: Expression
,Cell types
, Pathways
, Diseases
.
Choose Brain - Cortex in the Tissue field.
+8 modules were identified in this tissue. +Each module is named based on the colour used to depict it.
+In the Expression
tab, the user can explore the scaled expression of the eigengene of each module (Figure 4.1).
Go to the Cell types sub-tab.
+++For each tissue, cell types and their markers were retrieved from the literature and then differ from a paper to another. +Regarding the Brain - Cortex analysis, we can see that the annotation of cell types from Fan (Fan et al., 2018) is more comprehensive than Descartes (Cao et al., 2020).
+
Select the Descartes signature. +At least four of the 5 modules appear to be particularlyenriched for certain cell types markers (Figure 4.2): the yellow module for Microglia, turquoise for Endothelial cells, brown for oligodendrocytes, and blue for astrocytes.
Choose MEblue in the Module field.
+The four layers of information captured in the four Module sub-tabs are now specifically displayed for the chosen module. +Besides, the module’s 241 genes are identified on the left. +The expression of the module’s eigengene appears to increase with age (Figure 4.3).
Click on Sex in the Colored by field.
+The module’s eigengene expression suggests differences between sexes throughout age (Figure 4.4). +However, the number of Female samples is decreased in young ages when compared to Male samples, and as such these results should be interpreted with a grain of salt.
+
+