Impact of molecular signal and population model assumptions on the estimation of node heights in time-resolved phylogenetic trees.
Time-calibrated phylogenetic trees provide critical insights into pathogen evolution, transmission, and epidemiology, but their accuracy depends on both the availability of mutational signal and the choice of coalescent priors. In this study, we investigated how assumptions about population size models affect estimates of node heights in dated phylogenies.
We derive analytic posteriors for the two-sequence tMRCA under constant size, exponential growth, exponential decline, and bottleneck models, and quantify prior influence using summary-based metrics such as mean shift, median shift, and mode shift, together with the Wasserstein-1 distance and the coalescent information ratio
This notebook contains the code for the first part of this project. The plot_population_size_impact.ipynb notebook summarizes all results obtained in this part. Code can be found in the src folder.