You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: vignettes/solutions.Rmd
+6-6Lines changed: 6 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@ Questions:
17
17
18
18
Suggested answers are below. You might have some different code e.g. to customise the volcano plot as you like. Feel free to comment on any of these solutions in the workshop website as described [here](https://github.com/stemangiola/bioc_2020_tidytranscriptomics/blob/master/CONTRIBUTING.md).
Copy file name to clipboardExpand all lines: vignettes/supplementary.Rmd
+7-7Lines changed: 7 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -67,7 +67,7 @@ counts_tt %>%
67
67
68
68
We can also check how many counts we have for each sample by making a bar plot. This helps us see whether there are any major discrepancies between the samples more easily.
MA plots enable us to visualise amount of expression (logCPM) versus logFC. Highly expressed genes are towards the right of the plot. We can also colour significant genes (e.g. genes with FDR < 0.05)
128
128
129
-
```{r out.width = "40%"}
129
+
```{r out.width = "70%"}
130
130
# perform differential testing
131
131
counts_de <-
132
132
counts_tt %>%
@@ -147,7 +147,7 @@ counts_de %>%
147
147
148
148
A more informative MA plot, integrating some of the packages in tidyverse.
Copy file name to clipboardExpand all lines: vignettes/tidytranscriptomics.Rmd
+8-8Lines changed: 8 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -94,7 +94,7 @@ Measuring gene expression on a genome-wide scale has become common practice over
94
94
95
95
There are many steps involved in analysing an RNA sequencing dataset. The main steps for a differential expression analysis are shown in the figure below. Sequenced reads are aligned to a reference genome, then the number of reads mapped to each gene can be counted. This results in a table of counts, which is what we perform statistical analyses on in R. While mapping and counting are important and necessary tasks, today we will be starting from the count data and showing how differential expression analysis can be performed in a friendly way using tidybulk.
@@ -202,7 +202,7 @@ After we run `scale_abundance` we should see some columns have been added at the
202
202
203
203
We can visualise the difference of abundance densities before and after scaling. As tidybulk output is compatible with tidyverse, we can simply pipe it into standard tidyverse functions such as `filter`, `pivot_longer` and `ggplot`. We can also take advantage of ggplot's `facet_wrap` to easily create multiple plots.
@@ -319,7 +319,7 @@ The samples separate by treatment on PC1 which is what we hope to see. PC2 separ
319
319
320
320
An alternative to principal component analysis for examining relationships between samples is using hierarchical clustering. Heatmaps are a nice visualisation to examine hierarchical clustering of your samples. tidybulk has a simple function we can use, `keep_variable`, to extract the most variable genes which we can then plot with tidyHeatmap.
321
321
322
-
```{r out.width = "40%"}
322
+
```{r out.width = "70%"}
323
323
counts_scaled %>%
324
324
325
325
# filter lowly abundant
@@ -461,7 +461,7 @@ topgenes_symbols
461
461
462
462
Volcano plots are a useful genome-wide plot for checking that the analysis looks good. Volcano plots enable us to visualise the significance of change (p-value) versus the fold change (logFC). Highly significant genes are towards the top of the plot. We can also colour significant genes (e.g. genes with false-discovery rate < 0.05)
463
463
464
-
```{r out.width = "40%"}
464
+
```{r out.width = "70%"}
465
465
# volcano plot, minimal
466
466
counts_de %>%
467
467
filter(!lowly_abundant) %>%
@@ -474,7 +474,7 @@ counts_de %>%
474
474
475
475
A more informative plot, integrating some of the packages in tidyverse.
476
476
477
-
```{r out.width = "40%", warning=FALSE}
477
+
```{r out.width = "70%", warning=FALSE}
478
478
counts_de %>%
479
479
pivot_transcript() %>%
480
480
@@ -501,7 +501,7 @@ Before following up on the differentially expressed genes with further lab work,
501
501
502
502
With stripcharts we can see if replicates tend to group together and how the expression compares to the other groups. We'll also add a box plot to show the distribution.
503
503
504
-
```{r out.width = "40%"}
504
+
```{r out.width = "70%"}
505
505
strip_chart <-
506
506
counts_scaled %>%
507
507
@@ -525,7 +525,7 @@ A really nice feature of using tidyverse and ggplot2 is that we can make interac
525
525
526
526
We can also specify which parameters from the `aes` we want to show up when we hover over the plot with `tooltip`.
0 commit comments