-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathmotif_finding_notes.txt
More file actions
37 lines (20 loc) · 1012 Bytes
/
motif_finding_notes.txt
File metadata and controls
37 lines (20 loc) · 1012 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
This workshop originally used motifRG, however it has been removed from Bioconductor.
The only Bioconductor motif finder now seems to be rGADEM. It seems to have build problems, plus only runs in unstranded mode.
I've decided it's better to emphasize using an external program, from the MEME suite.
This is the old motifRG code:
```{r cache=TRUE, warning=FALSE, message=FALSE}
library(motifRG)
# Generate a background set of sequences by shuffling bases
# - calling sample() on a DNAString shuffles it
background_seqs <- endoapply(initiation_seqs, sample)
# Equivalent to:
# background_seqs <- DNAStringSet( lapply(initiation_seqs, sample) )
names(background_seqs) <- paste0(names(background_seqs), "-shuffled")
results <- findMotifFgBg(
initiation_seqs, background_seqs,
both.strand=FALSE, start.width=4)
summaryMotif(results$motif, results$category)
motifHtmlTable(results)
refined <- refinePWMMotif(results$motifs[[1]]@match$pattern, initiation_seqs)
seqLogo(refined$model$prob)
```