Skip to content

Commit ef3d128

Browse files
committed
added filtering step before CNV processing
1 parent 0ef1881 commit ef3d128

File tree

1 file changed

+12
-5
lines changed

1 file changed

+12
-5
lines changed

README.md

Lines changed: 12 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,8 @@ follows procedures already used in other publications. We first map
1414
the SMURF-seq reads using BWA:
1515
```
1616
bwa mem -x ont2d -k 12 -W 12 \
17-
-A 4 -B 10 -O 6 -E 3 -T 120 bwa-mem/index/hg19.fa smurf_reads.fa
18-
> mapped_smurf_reads.sam
17+
-A 4 -B 10 -O 6 -E 3 -T 120 bwa-mem/index/hg19.fa \
18+
smurf_reads.fa > mapped_smurf_reads.sam
1919
```
2020
The parameters for the Smith-Waterman scoring ('A', 'B', 'O' and 'E')
2121
were determined using the simulation approach outined below (see also
@@ -25,10 +25,17 @@ seeds. The 'W' indicates to discard a chain if seed bases are shorter
2525
than this value. The 'k' and 'W' are set to be liberal to catch and
2626
evaluate as many candidate mappings as possible.
2727

28-
Then the mapped fragments are given to a script that obtains the
29-
counts of reads in bins:
28+
The mapped fragments are given to a script that filters ambiguously
29+
mapped fragments:
3030
```
31-
./getBinCounts.py -i mapped_smurf_reads.sam -c hg19.chrom.sizes \
31+
./filterAlnScoreAndQual.py -i mapped_smurf_reads.sam \
32+
-o unambig_smurf_frags.sam -s 120 -q 1
33+
```
34+
35+
Then the remaining fragments are given to a script that obtains
36+
the counts of reads in bins:
37+
```
38+
./getBinCounts.py -i unambig_smurf_frags.sam -c hg19.chrom.sizes \
3239
-b bins_5k_hg19.bed -o bin_counts.bed -s bin_stats.txt
3340
```
3441
The input file `mapped_smurf_reads.sam` is just the mapped reads

0 commit comments

Comments
 (0)