Skip to content

Polyploidy Polyploidy Polyploidy #1742

@conJUSTover

Description

@conJUSTover

This is an initial issue to start the discussion around the logistic of implementing polyploidy into stdpopsim. Here are a few discussion points to kick around:

engine:
The SLIM script from Booker and Schrider 2024 is probably the best place to start. Right now, that script starts with a tetraploid population at gen0, and can include selection, various dominance relationships between the 3 heterozygous states, and transitions between auto/allo via controlling whether pairing occurs at random between 4 genome, or between main/spare (p1 and p2) populations of genomes.

Outputting data:
SLIM does not support vcf files with tetraploid SNP calls, although it is trivial to output a diploid VCF where two 'individuals' contain diploid genotype calls for the same tetraploid individual. Could create a quick wrapper script to combine diploid SNPs together to create tetraploid individuals instead of 2 diploid individuals (but should also consider diffs between auto/allo here, and whether the user might want to 'phase' genotype calls into subgenomes, even if they aren't fully phased into haplotypes)
I don't have experience with the tree sequencing recording for the tetraploid model, but my assumption is that it is similar to VCF output.

Published demographic models:

  1. Capsella bursa-pastoris (allo) from Blischak et al 2022, but includes HEs as a 'migration-like' parameter in dadi, so it may not transition into new SLiM modeling correctly (but that the only way so far to model HEs, so 🤷)
  2. Brassica napus, including interploidy introgression from Wang et al 2023
  3. Arabidopsis arenosa (auto) from Monnahan et al 2018

DFE models:
For allos, there is a basic modeling question of whether the DFE should be subgenome-specific (i.e. a diploid DFE) or model co-segregating variants between subgenomes (i.e. a tetraploid DFE). Does standard popsim support joint DFE? Does SLiM?
Diploid DFEs:

  1. Arabidopsis kamchatica from Paape et al 2018

Some additional potential features to build onto here:

  1. Transition from diploid to tetraploid, instead of starting with tetraploid in gen0. Could also include multiple polyploid formation events or interploidy introgression. I think mixed cytotypes (e.g. triploid that could create diploid or haploid gametes) might be too ambitious for now, but could be a cool implementation
  2. Do we need different scripts for autos vs allos? For more sound burn-in for allos, we'd also need to simulate the divergence of progenitor diploids (to capture shared diversity and fixed differences between subgenomes), but burn-in for autos could be done without diploid progenitor pop
  3. Add in double reduction for autos, homoeologous exchanges for allos, and homoeologous gene conversion for allos. None of these are natively supported by SLiM, so would need to be hard-coded and carefully validated. But, this is not trivial, since these steps would also require tracking centromeres and possibly revisiting how SLiM handles recombination (to allow for some chromatids that do not undergo recombination)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions