Using an approximation for the calculate_edit_distance for scalability #58

ai-symphony · 2025-06-06T04:39:03Z

Improving performance when calculating the edit distances of large programs (i.e. 3K+ characters).

CLAassistant · 2025-06-06T04:39:09Z

All committers have signed the CLA.

benjaminy · 2025-06-16T21:39:50Z

Is this PR waiting on something? The edit distance calculation can be an annoying bottleneck.

ai-symphony · 2025-06-16T22:01:58Z

Is this PR waiting on something? The edit distance calculation can be an annoying bottleneck.

I don't see an option to Merge this. Can someone with permissions please do this for me? @codelion, @jvm123, @DavyMorgan

codelion · 2025-06-17T03:39:20Z

This is fixed in the https://github.com/codelion/openevolve/tree/feat/MLX-kernel-optimization branch I will merge that into main soon.

codelion · 2025-06-19T07:00:20Z

This should be fixed in main now.

ai-symphony · 2025-06-20T06:13:25Z

This should be fixed in main now.

Looks like calculate_edit_distance is still used here: https://github.com/codelion/openevolve/blob/656e153b8ce1e74dbb0af80ca75fdc62cf545d70/openevolve/database.py#L560
Will this also be fixed?

codelion · 2025-06-20T06:34:59Z

Looks like calculate_edit_distance is still used here:

Yes but now we only sample up to 5 programs so it doesn't take that long to calculate it. https://github.com/codelion/openevolve/blob/656e153b8ce1e74dbb0af80ca75fdc62cf545d70/openevolve/database.py#L556

I am open to moving to more efficient edit distance calculation but in my experiments it was necessary to restrict the number of programs anyways as it can get quite large.

benjaminy · 2025-06-20T12:14:32Z

Sorry if this is too off-topic, but I'm curious what the thinking is regarding edit distance. I guess the point is to approximate the behavioral diversity of a group of programs. The edit distance of their code seems like at best a pretty rough estimate of that. Maybe a system with app-specific scores/traces/descriptions would be a better approximation?

codelion · 2025-06-20T13:02:14Z

The edit distance of their code seems like at best a pretty rough estimate of that. Maybe a system with app-specific scores/traces/descriptions would be a better approximation?

Sure, the edit distance comes from other evolutionary algorithms like the ones that target strings like DNA strands etc. There can be other choices for diversity so if we can experiment and benchmark with some examples to see what can work better? The goal would can be to reach the same best_program.py from the given initial_program.py for some of the examples faster. That would show convergence and we can see if particular choice of diversity of programs.

Using an approximation for the calculate_edit_distance for scalability

30605bd

ai-symphony closed this Jun 20, 2025

ai-symphony reopened this Jun 20, 2025

ai-symphony closed this Jun 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using an approximation for the calculate_edit_distance for scalability #58

Using an approximation for the calculate_edit_distance for scalability #58

Uh oh!

ai-symphony commented Jun 6, 2025

Uh oh!

CLAassistant commented Jun 6, 2025 •

edited

Loading

Uh oh!

benjaminy commented Jun 16, 2025

Uh oh!

ai-symphony commented Jun 16, 2025 •

edited

Loading

Uh oh!

codelion commented Jun 17, 2025

Uh oh!

codelion commented Jun 19, 2025

Uh oh!

ai-symphony commented Jun 20, 2025

Uh oh!

codelion commented Jun 20, 2025

Uh oh!

benjaminy commented Jun 20, 2025

Uh oh!

codelion commented Jun 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Using an approximation for the calculate_edit_distance for scalability #58

Using an approximation for the calculate_edit_distance for scalability #58

Uh oh!

Conversation

ai-symphony commented Jun 6, 2025

Uh oh!

CLAassistant commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjaminy commented Jun 16, 2025

Uh oh!

ai-symphony commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codelion commented Jun 17, 2025

Uh oh!

codelion commented Jun 19, 2025

Uh oh!

ai-symphony commented Jun 20, 2025

Uh oh!

codelion commented Jun 20, 2025

Uh oh!

benjaminy commented Jun 20, 2025

Uh oh!

codelion commented Jun 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CLAassistant commented Jun 6, 2025 •

edited

Loading

ai-symphony commented Jun 16, 2025 •

edited

Loading