2.0.0
Major changes:
- GTDB-TK now uses a divide-and-conquer approach where the bacterial reference tree is split into multiple order-level subtrees. This reduces the memory requirements of GTDB-Tk from 320 GB of RAM when using the full GTDB R07-RS207 reference tree to approximately 35 GB. A manuscript describing this approach is in preparation. If you wish to continue using the full GTDB reference tree use the
--full-treeflag. - Archaeal classification now uses a refined set of 53 archaeal-specific marker genes based on the recent publication by Dombrowski et al., 2020. This set of archaeal marker genes is now used by GTDB for curating the archaeal taxonomy.
- By default, all directories containing intermediate results are now removed by default at the end of the
classify_wfandde_novo_wfpipelines. If you wish to retain these intermediates files use the--keep-intermediatesflag. - All MSA files produced by the
alignstep are now compressed with gzip. - The classification summary and failed genomes files are now the only files linked in the root directory of
classify_wf.
Features:
convert_to_itolto convert trees into iTOL format (#373)- Output FASTA files are compressed by default (#369)
- Intermediate files will be removed by default when using classify/de-novo workflows unless specified by
--keep_intermediates(#369) - Add --genes flag for Error (#362)
- A warning will be displayed if pplacer fails to place a genome (#360 / #356)
Important
- This version is not backwards compatible with GTDB release 202.
- This version requires a new reference package