Skip to content

Every uproot.dask call increases memory footprint by 30 MB (it's in dask.base.function_cache) #1093

@lgray

Description

@lgray

reproducer:

import uproot

for _ in range(200):
    uproot.dask({
	"https://github.com/CoffeaTeam/coffea/raw/master/tests/samples/nano_dy.root": "Events"
    })

This particular instance leaks ~30MB per open. This adds up very quickly if you need to extract the form of hundreds of files in a remote process as evident from scikit-hep/coffea#1007 where this bug manifested pretty nastily.

Metadata

Metadata

Assignees

No one assigned

    Labels

    performanceWorks, but not fast enough or uses too much memory

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions