-
Notifications
You must be signed in to change notification settings - Fork 32
Description
Hello developers,
thank you for the great tool!
I have been using chewBBACA to create cgMLST schemes for three different species and then validated the schemes by typing a few thousand genomes and calculating the percentage of loci present in those genomes. In both cases >99% of genomes contained >95% of loci, so I consider these schemes stable.
I now wanted to create a genus-wide cgMLST. The genus is quite diverse, so I expected a lower number of loci in the cgMLST95. When I tested the stability of the scheme, I found that 137 of 723 loci are in less than 50% of genomes. For some genomes that can be explained by the fact, that I did not include a genome of that exact species for schema creation. But even for species, for which I included several genomes less than 80% of loci are present.
Have you experienced something similar, when creating schemes for more diverse groups?
Thank you for your help!