Improved ExecutionPlan/TermGraph performance by everling · Pull Request #289 · stefan-jansen/zipline-reloaded

everling · 2025-06-23T15:20:20Z

Hi,
For sufficiently large pipelines the execution planning stage becomes a bottleneck. I've made 3 changes in zipline/pipeline/graph.py that reduce redundant operations and improve time complexity properties.

Changes:

Graph nodes are added once (_add_to_graph). The cyclical dependency check should still work.
Instead of calling dict(self.graph.nodes())[term] for every call to _ensure_extra_rows, init the dict once.
set_extra_rows stops early if the term has already been ensured for more rows than the current min_extra_rows parameter

I'm pretty sure functionality remains intact, but it would be great if someone could have a look, as the performance gains are tremendous. I've compared pipeline execution times across commits and pipeline depths:

The pipeline is a simple branching DAG:

import sqlalchemy as sa
import pandas as pd
import numpy as np
from zipline.assets import AssetDBWriter, AssetFinder
from zipline.pipeline.domain import US_EQUITIES
from zipline.pipeline.loaders.frame import DataFrameLoader
from zipline.pipeline.data import USEquityPricing
from zipline.pipeline import Pipeline
from zipline.pipeline import SimplePipelineEngine
import time

# make dummy data
engine = sa.create_engine("sqlite:///:memory:")
writer = AssetDBWriter(engine)

asset_start = pd.Timestamp("2000-01-01", tz='UTC')
asset_end = pd.Timestamp("2050-12-31", tz='UTC')

equities_data = pd.DataFrame(
    {
        "sid": [1, 2],
        "symbol": ["AAPL", "GOOG"],
        "asset_name": ["Apple Inc.", "Alphabet Inc."],
        "start_date": [asset_start, asset_start],
        "end_date": [asset_end, asset_end],
        "first_traded": [asset_start, asset_start],
        "exchange": ["NYSE", "NASDAQ"],
        "security_end_date": [asset_end, asset_end],
    }
)
exchanges_data = pd.DataFrame(
    {
        "exchange": ["NYSE", "NASDAQ"],
        "country_code": ["US", "US"],
    }
)
writer.write(
    equities=equities_data,
    exchanges=exchanges_data,
)

dates = US_EQUITIES.calendar.sessions_in_range("2000","2026")
baseline = pd.DataFrame(index=dates, columns=equities_data.index, 
                        data=np.random.random((len(dates), len(equities_data.index))))
frame_loader  = DataFrameLoader(column=USEquityPricing.close, baseline=baseline )

def get_dummy_loader(column):
    return frame_loader

asset_finder = AssetFinder(engine)
engine = SimplePipelineEngine(get_loader=get_dummy_loader, asset_finder=asset_finder)
start_date = pd.Timestamp("2022-01-03")
end_date = pd.Timestamp("2024-01-03")
stats = {}

# make increasingly deep pipelines
for depth in range(1, 13):
    out = USEquityPricing.close.latest
    for i in range(depth):
        out = (out > np.random.random()).if_else(out, out / 2)
    p = Pipeline(columns={"out":out})
    start = time.time()
    df = engine.run_pipeline(p, pd.Timestamp("2022-01-03"),pd.Timestamp("2024-01-03"))
    dur = time.time() - start
    stats[depth] = dur

print(stats)

everling · 2025-08-14T12:27:20Z

@stefan-jansen I have been running zipline-reloaded with these edits for quite some time, haven't noticed any issues

everling added 4 commits June 23, 2025 14:56

make node dict once

a6b5ee9

add nodes once

208cdc4

ensure greater extra rows

2977c70

name

7877b09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved ExecutionPlan/TermGraph performance#289

Improved ExecutionPlan/TermGraph performance#289
everling wants to merge 4 commits intostefan-jansen:mainfrom
everling:planning-perf

everling commented Jun 23, 2025

Uh oh!

everling commented Aug 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

everling commented Jun 23, 2025

Uh oh!

everling commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

everling commented Aug 14, 2025 •

edited

Loading