This repository was archived by the owner on Dec 12, 2024. It is now read-only.
ARGs and SPRs #77
jeromekelleher
started this conversation in
General
Replies: 2 comments 2 replies
-
I'm not sure what we should do with this in terms of the paper, but I thought it was interesting and worth recording here. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Thinking about this a bit more, I think this corresponds to having to backtrack through an arbitrary number of nodes in the graph, if you're following the Griffiths traversal rules for generating trees. So it's surely an unavoidable property of working with this graph. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was interested to see what the relationship between ARGs in the various states of simplification and SPRs is. The result is interesting I think.
Full/implicit ARG
Consider the "full" ARG first:
which gives us:
So, it takes us 15 edge insertions to build the first tree, and then we only insert and remove one edge for each tree transition (SPR). For example, in the first transition we remove the edge
(11, 12)
and insert(11, 13)
. This is straightforwardly the consequence of the recombination that occured on node 11, with left_parent 12 and right_parent 13.So, I think it's probably true that every tree transition requires exactly 1 edge change, but the first tree is very large because of all the non ancestral topology that we're maintaining.
Reduced/explicit ARG
Consider a different example in which we get rid of the non-ancestral topology:
Consider the first transition, in which node 8 switches from left parent 18 to right parent 19 (that is a child of 22). For this transition we need to remove five edges and to insert two. We need to remove 5 edges to get rid of the unary chain from 18 (which is no longer ancestral) up to 26.
I think this means that we can have arbitrarily many diffs between neighbouring trees, if we're working with this resolved graph.
Fully simplified
We know from Kelleher et al 2016 (Fig 5) that the number of edges changing (in and out) between two adjacent trees is 2 (modulo changing from the coalescence record to edge format).
Beta Was this translation helpful? Give feedback.
All reactions