Importing large newick files

We seem to take an inordinately long time to read large newick files. If we are claiming to be able to be useful for phylogeneticists with big trees (like those produced from covid studies), I think we should be able to do this a lot more quickly. Here's an example on my laptop of a 30Mb tree with 2.3 million samples from https://files.opentreeoflife.org/synthesis/opentree13.4/output/labelled_supertree/index.html (see [labelled_supertree.tre](https://files.opentreeoflife.org/synthesis/opentree13.4/output/labelled_supertree/labelled_supertree.tre)): it takes 16 minutes.

```
>>> start = datetime.datetime.now(); ts=tsconvert.newick.read("labelled_supertree.tre"); print(datetime.datetime.now() - start)
0:16:08.770515
```

It would be a good showcase to be able to do this in a few seconds (should be possible with disk IO at 30MB/s, right?)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Importing large newick files #36

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Importing large newick files #36

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions