Fast json reading for msgs and events #2228

ekin-aisi · 2025-08-06T16:38:19Z

This PR contains:

New features

What is the current behavior? (You can also link to an open issue here)

Reading large eval log files is slow, particularly when extracting messages and events fields from samples. The current implementation uses Pydantic validation for all fields, which adds significant overhead.

What is the new behavior?

Added read_eval_log_as_json() function that:

Bypasses Pydantic validation entirely for faster field extraction

Uses multiprocessing to parallelize reading of sample files from eval formatted log files
Provides 10-20x speedup for extracting messages and events fields from large eval logs

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

No breaking changes. This adds a new optional function alongside existing functionality. Users can continue using the validated read_eval_log() when full validation is needed, or opt into read_eval_log_as_json() when performance is critical and validation can be skipped.

ekin-aisi added 3 commits August 6, 2025 16:27

json reading for msgs and events

5281c3c

add TypeAdapter version

4272931

change types

109751c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast json reading for msgs and events #2228

Fast json reading for msgs and events #2228

Uh oh!

ekin-aisi commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fast json reading for msgs and events #2228

Are you sure you want to change the base?

Fast json reading for msgs and events #2228

Uh oh!

Conversation

ekin-aisi commented Aug 6, 2025

This PR contains:

What is the current behavior? (You can also link to an open issue here)

What is the new behavior?

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants