Skip to content

Commit 430d26c

Browse files
author
Samson Weiner
committed
instructions for generating real CNA profiles
1 parent a908bf9 commit 430d26c

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

data/README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
# Generating CNA profiles
2+
Details of how the CNA profiles were generated for each dataset are provided below.
3+
4+
## T10 and T16
5+
Raw read counts in the form of fastq files were obtained from the Sequence Read Archive (SRA) under accession code SRA018951.105. We then ran bwa alignment, filtered out low quality reads, and indexed the resulting bam files following the raw data preprocessing steps provided in the README file at the root of the repository.
6+
7+
SCOPE profiles were generated following the instructions provided in the user manual with a bin length of 1Mbp.
8+
9+
To generate the SEACON profiles, we first ran `seacon prep_readcount` followed by `seacon pseudonormal` to identify normal cells and create a pseudonormal bam file. Next, we obtained a set of phased SNPs from the pseudonormal file following the instructions provided in the README file at the root of the repository. Next, we called the BAF estimation routine of CHISEL and `seacon prep_baf` to prepare the input BAFs. Finally, we ran the main call function of seacon using the following command:
10+
```
11+
seacon call -o SEACON_T16 --upper-filter 5 --tolerance 0.15 --max-wgd 1
12+
```
13+
14+
## 10x
15+
Processed CNA profiles of CHISEL as well as input RDRs and BAFs for all cells were acquired from the [chisel data repository](https://github.com/raphael-group/chisel-data). We then filtered the input RDRs and BAFs for cells from section E and fed these directly into SEACON. The main call function was executed as follows:
16+
```
17+
seacon call -o SEACON_10x_E --upper-filter 20 --tolerance 0.05
18+
```

0 commit comments

Comments
 (0)