|
1 | 1 | # snakemake-crispr-guides |
2 | 2 |
|
3 | 3 | [](https://snakemake.github.io) |
4 | | -[](https://github.com/MPUSP/snakemake-crispr-guides/actions/workflows/main.yml) |
| 4 | +[](https://github.com/MPUSP/snakemake-crispr-guides/actions/workflows/snakemake-tests.yml) |
5 | 5 | [](https://docs.conda.io/en/latest/) |
6 | 6 | [](https://sylabs.io/docs/) |
7 | 7 | [](https://snakemake.github.io/snakemake-workflow-catalog/docs/workflows/MPUSP/snakemake-crispr-guides.html) |
@@ -31,9 +31,9 @@ A Snakemake workflow for the design of small guide RNAs (sgRNAs) for CRISPR appl |
31 | 31 |
|
32 | 32 | The usage of this workflow is described in the [Snakemake Workflow Catalog](https://snakemake.github.io/snakemake-workflow-catalog/docs/workflows/MPUSP/snakemake-crispr-guides.html). |
33 | 33 |
|
34 | | -Detailed information about input data and workflow configuration can also be found in the [`config/README.md`](config/README.md). |
| 34 | +Detailed information about input data and workflow configuration can be found in the [`config/README.md`](config/README.md). |
35 | 35 |
|
36 | | -If you use this workflow in a paper, don't forget to give credits to the author(s) by citing the URL of this (original) repository and its DOI (see above). |
| 36 | +If you use this workflow in a paper, don't forget to give credits to the author(s) by citing the URL of this repository, the release, and its DOI if available. |
37 | 37 |
|
38 | 38 | ## Workflow overview |
39 | 39 |
|
@@ -108,47 +108,7 @@ Important requirements when using custom `*.fasta` and `*.gff` files: |
108 | 108 |
|
109 | 109 | ### Parameters |
110 | 110 |
|
111 | | -This table lists all parameters that can be used to run the workflow. |
112 | | - |
113 | | -| parameter | type | details | default | |
114 | | -| ---------------------- | ------- | ---------------------------------------------- | --------------------------------- | |
115 | | -| GET_GENOME | | | | |
116 | | -| database | string | one of `ncbi`, `manual` | `ncbi` | |
117 | | -| assembly | string | RefSeq ID | `GCF_000006945.2` | |
118 | | -| fasta | path | optional input | `Null` | |
119 | | -| gff | path | optional input | `Null` | |
120 | | -| gff_source_type | list | allowed source types in GFF file | `'RefSeq': 'gene', ...` | |
121 | | -| DESIGN_GUIDES | | | | |
122 | | -| target_region | numeric | use subset of regions for testing | `["NC_003277.2"]` | |
123 | | -| target_type | string | specify targets for guide design (see below) | `["target", "intergenic", "ntc"]` | |
124 | | -| tss_window | numeric | upstream/downstream window around TSS | `[0, 500]` | |
125 | | -| tiling_window | numeric | window size for intergenic regions | `1000` | |
126 | | -| tiling_min_dist | numeric | min distance between TSS and intergenic region | `0` | |
127 | | -| circular | logical | is the genome circular? | `False` | |
128 | | -| canonical | logical | only canonical PAM sites are included | `True` | |
129 | | -| strands | string | target `coding`, `template` or `both` | `both` | |
130 | | -| spacer_length | numeric | desired length of guides | `20` | |
131 | | -| guide_aligner | string | one of `biostrings`, `bowtie` | `biostrings` | |
132 | | -| crispr_enzyme | string | CRISPR enzyme ID | `SpCas9` | |
133 | | -| score_methods | string | see _crisprScore_ package | default scores are listed below | |
134 | | -| score_weights | numeric | opt. weights when calculating mean score | `[1, 1, 1, 1, 1, 1]` | |
135 | | -| restriction_sites | string | sequences to omit in entire guide | `Null` | |
136 | | -| bad_seeds | string | sequences to omit in seed region | `["ACCCA", "ATACT", "TGGAA"]` | |
137 | | -| no_target_controls | numeric | number of non targeting guides (neg. controls) | 100 | |
138 | | -| FILTER_GUIDES | | | | |
139 | | -| filter_best_per_gene | numeric | max number of guides to return per gene | `10` | |
140 | | -| filter_best_per_tile | numeric | max number of guides to return per ig/tile | `10` | |
141 | | -| filter_score_threshold | numeric | mean score to use as lower limit | `Null` | |
142 | | -| filter_multi_targets | logical | remove guides that perfectly match >1 target | `True` | |
143 | | -| filter_rna | logical | remove guides that target e.g. rRNA or tRNA | `True` | |
144 | | -| gc_content_range | numeric | range of allowed GC content | `[30, 70]` | |
145 | | -| fiveprime_linker | string | optionally add 5' linker to each guide | `Null` | |
146 | | -| threeprime_linker | string | optionally add 3' linker to each guide | `Null` | |
147 | | -| export_as_gff | logical | export result table to `.gff` file | `True` | |
148 | | -| export_as_fasta | logical | export result table to `.fasta` file | `True` | |
149 | | -| REPORT | | | | |
150 | | -| show_examples | numeric | number of genes to show guide position | `10` | |
151 | | -| show_genomic_range | numeric | genome start and end pos to show tiling guides | `[0, 50000]` | |
| 111 | +Detailed information about input data and workflow configuration can be found in the [`config/README.md`](config/README.md). |
152 | 112 |
|
153 | 113 | ### Target type |
154 | 114 |
|
@@ -238,10 +198,10 @@ The workflow generates the following output from its modules: |
238 | 198 |
|
239 | 199 | ## Authors |
240 | 200 |
|
241 | | -- The custom `snakemake`, `R`, `R markdown`, and `python` scripts were written by Michael Jahn, PhD |
242 | | -- Affiliation: [Max-Planck-Unit for the Science of Pathogens](https://www.mpusp.mpg.de/) (MPUSP), Berlin, Germany |
243 | | -- Visit the MPUSP github page at https://github.com/MPUSP for info on this workflow and other projects |
244 | | -- Visit the author's github page at https://github.com/m-jahn for info on other projects |
| 201 | +- Dr. Michael Jahn |
| 202 | + - Affiliation: [Max-Planck-Unit for the Science of Pathogens](https://www.mpusp.mpg.de/) (MPUSP), Berlin, Germany |
| 203 | + - ORCID profile: https://orcid.org/0000-0002-3913-153X |
| 204 | + - github page: https://github.com/m-jahn |
245 | 205 |
|
246 | 206 | ## License |
247 | 207 |
|
|
0 commit comments