@@ -29,17 +29,22 @@ The mapped fragments are given to a script that filters ambiguously
2929mapped fragments:
3030```
3131./filterAlnScoreAndQual.py -i mapped_smurf_reads.sam \
32- -o unambig_smurf_frags.sam -s 120 -q 1
32+ -o unambig_smurf_frags.sam -s 120 -q 1
3333```
3434
35+ The input file ` mapped_smurf_reads.sam ` is just the mapped reads
36+ (e.g. with BWA). The output file ` unambig_smurf_frags.sam `
37+ contains mapped fragments with mapping quality greater than or equal
38+ to 1.
39+
3540Then the remaining fragments are given to a script that obtains
3641the counts of reads in bins:
3742```
3843./getBinCounts.py -i unambig_smurf_frags.sam -c hg19.chrom.sizes \
3944 -b bins_5k_hg19.bed -o bin_counts.bed -s bin_stats.txt
4045```
41- The input file ` mapped_smurf_reads .sam` is just the mapped reads
42- (e.g. with BWA). The file ` hg19.chrom.size ` is the size of all chroms
46+ The input file ` unambig_smurf_frags .sam` is the same as described above.
47+ The file ` hg19.chrom.size ` is the size of all chroms
4348in the reference genome. This file for the hg19 reference is supplied
4449in the ` data ` directory in this repo, and was obtained from the UCSC
4550Genome Browser's database. The pre-defined bins file ` bins_5k_hg19.bed `
@@ -164,7 +169,7 @@ well it recovers the known mapping locations. The steps are as follows.
164169 ` bedtools ` (we used v2.26.0). We also require the ` deadzones `
165170 program from ` http://github.com/smithlabcode/utils ` but this could
166171 be substituted for any means of excluding unmappable regions.
167- In our CNV analysis, we used ` bwa ` (version... ).
172+ In our CNV analysis, we used ` bwa ` (0.7.17 ).
168173
169174## Contacts and Bug Reports
170175
0 commit comments