You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: 15-workshop-wrapup.Rmd
+7Lines changed: 7 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -5,23 +5,30 @@ Over the last two days, as well as the webinar in October, we've explored the st
5
5
## Summary of key messages
6
6
7
7
**<spanstyle="color: green;">ORA and GSEA are different statistical analyses, and their inputs differ</span>**
8
+
8
9
GSEA: Kolmogorov-Smirnov test, requires a ranked yet unfiltered gene list
9
10
ORA: Hypergeometric or Fisher’s Exact test, requires a filtered unranked gene list and experimental background gene list
10
11
11
12
**<spanstyle="color: green;">Always correct for multiple testing</span>**
13
+
12
14
Never use unadjusted P values, as this will introduce many false positives. Different tools offer different multiple testing correction such as FDR or the more stringent BH. Always report your chosen method and the significance threshold applied to terms.
13
15
14
16
**<spanstyle="color: green;">Different analysis methods will return different results</span>**
17
+
15
18
This is expected, due to underlying differences in database, algorithm, P value methods etc. As long as your methods are robust, sensible and reproducible, you can have confidence that your methods will stand up to scrutiny under peer review.
Lack of reproducibility through under-reporting methods is a common issue in this field (see Wijesooriya et al, linked below). Ensure to record all methodological details while you are working, including all the parameters and arguments applied, how the gene lists were generated, versions of databases and tools etc. If using R, specify a seed for constant random number generation in GSEA.
19
23
20
24
**<spanstyle="color: green;">Interpret your results in their biological context</span>**
25
+
21
26
Functional categories are often broad and redundant. Use the FEA results as a guide, not the end point. Use visualisations and explore term redundancy methods to help focus results. Validate through aditional means according to the nature of your experiment, with the gold standard being wet-lab rather than *in silico* validation methods.
22
27
23
28
**<spanstyle="color: green;">There are many databases and tool choices available</span>**
29
+
24
30
Suitability to your experiment depends on many factors, including:
31
+
25
32
- Your species, and what tools support it
26
33
- What databases and gene sets are relevant to your experiment, from the general (eg GO) to the specific (eg cancer pathways)
0 commit comments