Updates to recombinants_summary dataframe#508
Conversation
Used to be arbitrary, now is one of the samples that has be fewest mutations in its hmm copying path.
|
Two significant changes here @szhan and @hyanwong Changes the choice of representative sample for a recombinant to one of those that have the minimum number of mutations in the hmm match. Before this was done arbitrarily, and this may slightly change some results for the X lineages. We'll need to check Shing's hand-made figure carefully against the final data to make sure that no errors creeped in here. The second change is to be more systematic about the list of metadata fields included for each node of interest. We now have (e.g.): This is following most of the points raised in this issue: jeromekelleher/sc2ts-paper#468 Note: I've changed the "_time_tsdate" and "_date_tsdate" fields to just "_time" and "_date" as this more of a function of the tree sequence you run the function on than anything to do with tsdate. Hopefully this is all the metadata we need in the first pass, and we can then proceed with adding more columns later with results from HMM reruns and pangonet? |
No description provided.