Skip to content

Commit 9ffe200

Browse files
committed
- Removing Klebsiella_k_locus_variant_reference.gbk.
- Updating docs to reflect this. - Correcting confidence criteria in docs.
1 parent b94e596 commit 9ffe200

File tree

5 files changed

+14
-19360
lines changed

5 files changed

+14
-19360
lines changed

docs/source/Databases.rst

Lines changed: 2 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -187,12 +187,6 @@ The *Klebsiella* K locus primary reference database (``Klebsiella_k_locus_primar
187187
Synthetic IS-free K locus sequences were generated for K loci for which no naturally occurring IS-free variants have
188188
been identified to date.
189189

190-
The variants database (``Klebsiella_k_locus_variant_reference.gbk``) comprises full-length annotated sequences for
191-
variants of the distinct loci:
192-
193-
* IS variants are named as KLN -1, -2 etc e.g. KL15-1 is an IS variant of KL15.
194-
* Deletion variants are named KLN-D1, -D2 etc e.g. KL15-D1 is a deletion variant of KL15.
195-
196190
.. note::
197191
KL156-D1 is included in the primary reference database since no full-length version of this locus has been
198192
identified to date.
@@ -201,7 +195,8 @@ We recommend screening your data with the primary reference database first to fi
201195
have poor matches or are particularly interested in detecting variant loci you should try the variant database.
202196

203197
.. warning::
204-
We do not currently recommend using the variant database with Kaptive 3. It is included in the repository for completeness.
198+
The variants database (``Klebsiella_k_locus_variant_reference.gbk``) has been retired as of ``v3.0.0b6`` as it's no
199+
longer actively maintained and results can be misleading without additional in depth analysis.
205200

206201
Database versions:
207202

docs/source/Interpreting-the-results.rst

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -26,16 +26,16 @@ Typeable loci
2626
The locus was found in a single piece in the query assembly with no genes below the minimum translated identity
2727
according to the :ref:`locus thresholds <Locus definition>` and:
2828

29-
* no missing genes (as determined by at least 50% nucleotide mapping coverage)
30-
* no unexpected genes (genes from other loci) inside the locus region of the assembly
29+
* no missing genes
30+
* no (non-truncated) unexpected genes (genes from other loci) inside the locus region of the assembly
3131

3232
**OR**
3333

3434
The locus was found in more than one piece in the query assembly with no genes below the minimum translated identity
3535
according to the :ref:`locus thresholds <Locus definition>` and:
3636

37-
* no more than 1 missing gene
38-
* no more than 1 unexpected gene (genes from other loci) inside the locus region of the assembly
37+
* no less than N% missing genes (default: 50%)
38+
* no more than N unexpected gene (genes from other loci) inside the locus region of the assembly (default: 1)
3939

4040
These criteria were designed in consideration of the locus definition rules (i.e. that each locus represents a unique set of genes defined at a given minimum translated identity threshold) and following systematic analysis of Kaptive outputs for draft genome assemblies compared against manually confirmed loci determined from matched completed genomes.
4141

@@ -57,7 +57,7 @@ Problems
5757
* ``-`` = genes expected in the locus were not found.
5858
* ``+`` = extra genes were found in the locus.
5959
* ``*`` = one or more expected genes was found but with translated identity below the minimum threshold.
60-
* ``!`` = one or more genes was found but truncated
60+
* ``!`` = one or more genes was found but truncated.
6161

6262

6363
Exploring the other columns

docs/source/Method.rst

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,12 @@ After the best matching locus type has been identified, Kaptive will:
6060
and each alignment represents the best gene for that part of the contig.
6161
#. For each remaining gene alignment, the gene is then evaluated.
6262

63+
.. note::
64+
You may see multiple entries for the same gene in the gene details columns. This can happen when there are
65+
multiple significant alignments for a gene that has been split, either over contigs or by an artefact such as
66+
and insertion element. This will be evident by the low percent coverage of the entries, which should all add up
67+
to ~100% if they are indeed part of the same gene.
68+
6369
.. _Gene-evaluation:
6470

6571
Gene evaluation

docs/source/Usage.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,7 @@ See database options :ref:`here <Database-options>` and other options::
120120
-V, --verbose Print debug messages to stderr
121121
-v , --version Show version number and exit
122122
-h , --help Show this help message and exit
123-
-t , --threads Number of threads for alignment (default: maximum available CPUs)
123+
-t , --threads Number of threads for alignment (default: maximum available CPUs / 32)
124124

125125
.. _kaptive-convert:
126126

0 commit comments

Comments
 (0)