Skip to content

Description of the ntSynt algorithm

Lauren Coombe edited this page Apr 24, 2025 · 1 revision

Main steps in the algorithm:

  1. Generate ordered minimizer sketches for each of the input genome assemblies
  • By default, the minimizer sketches will be generated using a "common" Bloom filter, which contains all the k-mers that are in common between each assembly
  • Only minimizers from k-mers that are found in this common Bloom filter will be included in the output sketch
  1. Filter the minimizers to retain only those that are found in all input assemblies and found in single copy in each assembly
  2. Build an initial minimizer graph, where the nodes are minimizers and there are edges between adjacent minimizers
  3. Run graph simplification to remove simple tangles in the graph
  4. Find linear paths through the graph, and compute the initial synteny blocks
  5. Compute minimizer sketches for regions not covered by the initial synteny blocks using a lower window size, and augment the minimizer graph
  6. Repeat the graph simplification and filtering steps, and output refined synteny blocks. Repeat steps 6-7 for each desired lower window size.
  7. Merge collinear synteny blocks within the specified range
  8. Output the final synteny blocks

Clone this wiki locally