-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
We seem to take an inordinately long time to read large newick files. If we are claiming to be able to be useful for phylogeneticists with big trees (like those produced from covid studies), I think we should be able to do this a lot more quickly. Here's an example on my laptop of a 30Mb tree with 2.3 million samples from https://files.opentreeoflife.org/synthesis/opentree13.4/output/labelled_supertree/index.html (see labelled_supertree.tre): it takes 16 minutes.
>>> start = datetime.datetime.now(); ts=tsconvert.newick.read("labelled_supertree.tre"); print(datetime.datetime.now() - start)
0:16:08.770515
It would be a good showcase to be able to do this in a few seconds (should be possible with disk IO at 30MB/s, right?)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels