Skip to content

Commit 0032def

Browse files
Update README.md
1 parent aa67759 commit 0032def

File tree

1 file changed

+142
-0
lines changed

1 file changed

+142
-0
lines changed

README.md

Lines changed: 142 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -166,3 +166,145 @@ export SAMPLE="TWJF-10146-0029"
166166
python3 /opt/scripts/setup_review.py -WB $WORKING_BASE -a ../itb-review-files/*.xlsx -c $WORKING_BASE/../generate_protein_fasta/candidates/annotated_filtered.vcf-pass-51mer.fa.manufacturability.tsv -samp $SAMPLE -classI $WORKING_BASE/final_results/pVACseq/mhc_i/*.all_epitopes.aggregated.tsv -classII $WORKING_BASE/final_results/pVACseq/mhc_ii/*.all_epitopes.aggregated.tsv
167167
```
168168
Open colored_peptides51mer.html and copy the table into an excel spreadsheet. The formatting should remain. Utilizing the Annotated.Neoantigen_Candidates and colored Peptides_51-mer for manual review.
169+
170+
# Description of Scripts includes
171+
172+
## Get Basic QC
173+
174+
```bash
175+
python3 /opt/scripts/get_neoantigen_qc.py --help
176+
usage: get_neoantigen_qc.py [-h] [-WB WB] [-f FIN_RESULTS] [--n_dna N_DNA] [--t_dna T_DNA] [--t_rna T_RNA] [--concordance CONCORDANCE] [--contam_n CONTAM_N] [--contam_t CONTAM_T]
177+
[--rna_metrics RNA_METRICS] [--strand_check STRAND_CHECK] --yaml YAML [--fin_variants FIN_VARIANTS]
178+
179+
Get the stats for the basic data QC review in the neoantigen final report.
180+
181+
optional arguments:
182+
-h, --help show this help message and exit
183+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
184+
-f FIN_RESULTS, --fin_results FIN_RESULTS
185+
Name of the final results folder in gcp immuno
186+
--n_dna N_DNA file path for aligned normal dna FDA report table
187+
--t_dna T_DNA file path for aligned tumor dna FDA report table
188+
--t_rna T_RNA file path for aligned tumor rna FDA report table
189+
--concordance CONCORDANCE
190+
file path for Somalier results for sample tumor/normal sample relatedness
191+
--contam_n CONTAM_N file path for VerifyBamID results for contamination the normal sample
192+
--contam_t CONTAM_T file path for VerifyBamID results for contamination the tumor sample
193+
--rna_metrics RNA_METRICS
194+
--strand_check STRAND_CHECK
195+
--yaml YAML
196+
--fin_variants FIN_VARIANTS
197+
```
198+
199+
## GET FDA metrics
200+
python3 /opt/scripts/get_FDA_thresholds.py --help
201+
usage: get_FDA_thresholds.py [-h] [-WB WB] [-f FIN_RESULTS] [--n_dna N_DNA] [--t_dna T_DNA] [--t_rna T_RNA] [--una_n_dna UNA_N_DNA] [--una_t_dna UNA_T_DNA] [--una_t_rna UNA_T_RNA]
202+
[--somalier SOMALIER] [--contam_n CONTAM_N] [--contam_t CONTAM_T]
203+
204+
Get FDA qc stats from various files and determine if they pass or fail.
205+
206+
optional arguments:
207+
-h, --help show this help message and exit
208+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
209+
-f FIN_RESULTS, --fin_results FIN_RESULTS
210+
Name of the final results folder in gcp immuno
211+
--n_dna N_DNA file path for aligned normal dna FDA report table
212+
--t_dna T_DNA file path for aligned tumor dna FDA report table
213+
--t_rna T_RNA file path for aligned tumor rna FDA report table
214+
--una_n_dna UNA_N_DNA
215+
file path for unaligned normal dna FDA report table
216+
--una_t_dna UNA_T_DNA
217+
file path for unaligned tumor dna FDA report table
218+
--una_t_rna UNA_T_RNA
219+
file path for unaligned tumor rna FDA report table
220+
--somalier SOMALIER file path for Somalier results for sample tumor/normal sample relatedness (concordance.somalier.pairs.tsv)
221+
--contam_n CONTAM_N file path for VerifyBamID results for contamination the normal sample
222+
--contam_t CONTAM_T file path for VerifyBamID results for contamination the tumor dna sample
223+
224+
## Setup Review
225+
226+
The set up review script runs two other scripts: generate_reviews_files.py and color_peptides51mer.py. The first sets up the Annotated.Neoantige.Canidates spreadsheet and the Peptides 51mer spreadsheet. The second script colors the Peptides 51mer sequences and outputs an html table whihc can be copied into a Microsoft spreadsheet.
227+
228+
```bash
229+
python3 /opt/scripts/setup_review.py --help
230+
usage: setup_review.py [-h] [-WB WB] [-samp SAMP] [-a A] [-c C] -classI CLASSI -classII CLASSII
231+
232+
Sets up manuel review files
233+
234+
optional arguments:
235+
-h, --help show this help message and exit
236+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
237+
-samp SAMP Name of the sample
238+
-a A Path to ITB Reviewed Candidates
239+
-c C Path to annotated_filtered.vcf-pass-51mer.fa.manufacturability.tsv
240+
-classI CLASSI Path to classI all_epitopes.aggregated.tsv
241+
-classII CLASSII Path to classII all_epitopes.aggregated.tsv
242+
243+
```
244+
245+
## Generate Review Files
246+
247+
```bash
248+
python3 /opt/scripts/generate_reviews_files.py --help
249+
usage: generate_reviews_files.py [-h] [-a A] [-c C] [-samp SAMP] [-WB WB] [-f FIN_RESULTS]
250+
251+
Create the file needed for the neoantigen manuel review
252+
253+
optional arguments:
254+
-h, --help show this help message and exit
255+
-a A The path to the ITB Reviewed Candidates
256+
-c C The path to annotated_filtered.vcf-pass-51mer.fa.manufacturability.tsv from the generate_protein_fasta script
257+
-samp SAMP The name of the sample
258+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
259+
-f FIN_RESULTS, --fin_results FIN_RESULTS
260+
Name of the final results folder in gcp immuno
261+
```
262+
263+
## Color Peptides 51mer
264+
265+
```
266+
python3 /opt/scripts/generate_reviews_files.py --help
267+
usage: generate_reviews_files.py [-h] [-a A] [-c C] [-samp SAMP] [-WB WB] [-f FIN_RESULTS]
268+
269+
Create the file needed for the neoantigen manuel review
270+
271+
optional arguments:
272+
-h, --help show this help message and exit
273+
-a A The path to the ITB Reviewed Candidates
274+
-c C The path to annotated_filtered.vcf-pass-51mer.fa.manufacturability.tsv from the generate_protein_fasta script
275+
-samp SAMP The name of the sample
276+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
277+
-f FIN_RESULTS, --fin_results FIN_RESULTS
278+
Name of the final results folder in gcp immuno
279+
root@02d92932f0f8:/# python3 /opt/scripts/color_peptides51mer.py --help
280+
usage: color_peptides51mer.py [-h] -p P -classI CLASSI -classII CLASSII [-WB WB] [-samp SAMP]
281+
282+
Color the 51mer peptide
283+
284+
optional arguments:
285+
-h, --help show this help message and exit
286+
-p P The path to the Peptides 51 mer
287+
-classI CLASSI The path to the classI all_epitopes.aggregated.tsv used in pVACseq
288+
-classII CLASSII The path to the classII all_epitopes.aggregated.tsv used in pVACseq
289+
-WB WB the path to the gcp_immuno folder of the trial you wish to tun script on, defined as WORKING_BASE in envs.txt
290+
-samp SAMP Name of the sample
291+
```
292+
293+
## Bold Class II
294+
295+
Bold Class II is not utilized in the current workflow of setting up the manual review. However, it is included as an example of how adding stylization (in this case bold) to certain characters within individual cells of spreadsheets can be accomplished using BeautifulSoup. If you wanted to only bold certain characters (or do any stylzing such as coloring), you can insert a style tag directly into the HTML. However, if you wanted to do formatting inside formatting such as in these review files where there needs to some characters which are red, bold, underlined, or any combination if thise mentioned, BeuaitfulSoup cannot accomplish this.
296+
297+
```bash
298+
python3 /opt/scripts/bold_classII.py --help
299+
usage: bold_classII.py [-h] -p P -classI CLASSI -classII CLASSII -o O
300+
301+
Bold the class II pepetides
302+
303+
optional arguments:
304+
-h, --help show this help message and exit
305+
-p P The path to the Peptides 51 mer
306+
-classI CLASSI The path to the classI all_epitopes.aggregated.tsv used in pVACseq
307+
-classII CLASSII The path to the classII all_epitopes.aggregated.tsv used in pVACseq
308+
-o O Output location
309+
310+
```

0 commit comments

Comments
 (0)