@@ -14,8 +14,8 @@ follows procedures already used in other publications. We first map
1414the SMURF-seq reads using BWA:
1515```
1616bwa mem -x ont2d -k 12 -W 12 \
17- -A 4 -B 10 -O 6 -E 3 -T 120 bwa-mem/index/hg19.fa smurf_reads.fa
18- > mapped_smurf_reads.sam
17+ -A 4 -B 10 -O 6 -E 3 -T 120 bwa-mem/index/hg19.fa \
18+ smurf_reads.fa > mapped_smurf_reads.sam
1919```
2020The parameters for the Smith-Waterman scoring ('A', 'B', 'O' and 'E')
2121were determined using the simulation approach outined below (see also
@@ -25,10 +25,17 @@ seeds. The 'W' indicates to discard a chain if seed bases are shorter
2525than this value. The 'k' and 'W' are set to be liberal to catch and
2626evaluate as many candidate mappings as possible.
2727
28- Then the mapped fragments are given to a script that obtains the
29- counts of reads in bins :
28+ The mapped fragments are given to a script that filters ambiguously
29+ mapped fragments :
3030```
31- ./getBinCounts.py -i mapped_smurf_reads.sam -c hg19.chrom.sizes \
31+ ./filterAlnScoreAndQual.py -i mapped_smurf_reads.sam \
32+ -o unambig_smurf_frags.sam -s 120 -q 1
33+ ```
34+
35+ Then the remaining fragments are given to a script that obtains
36+ the counts of reads in bins:
37+ ```
38+ ./getBinCounts.py -i unambig_smurf_frags.sam -c hg19.chrom.sizes \
3239 -b bins_5k_hg19.bed -o bin_counts.bed -s bin_stats.txt
3340```
3441The input file ` mapped_smurf_reads.sam ` is just the mapped reads
0 commit comments