Is there a way to explode a column? #264

JaoMarcos · 2026-01-29T12:35:53Z

JaoMarcos
Jan 29, 2026

I'm working on a plugin (https://github.com/JaoMarcos/data_designer_lambda_column
) and ran into an issue with the strict verification in the DatasetBatchManager.update_records function. Currently, it enforces that the number of incoming records matches the current buffer size.

DataDesigner/packages/data-designer-engine/src/data_designer/engine/dataset_builders/utils/dataset_batch_manager.py

Line 194 in 184348a

def update_records(self, records: list[dict]) -> None:

The Use Case I need to support cases where a single input record produces multiple output records (1:N), essentially "exploding" the dataframe.

The main driver for this is cost and efficiency with LLMs. For complex prompts with large input contexts, if I need multiple variations (e.g., "Generate 5 variations of X"), it is significantly cheaper and faster to ask the model to generate all 5 in a single API call rather than making 5 separate calls with the same large input.

Generating them in a single pass also often improves quality/variance, as the model has "in-context" awareness of the other variations it is generating, preventing duplicates.

Question What is the best way to handle this in DatasetBatchManager?

johnnygreco · 2026-02-05T15:56:23Z

johnnygreco
Feb 5, 2026
Maintainer

Looks like @andreatgretel has started looking into this as part of issue #265! Please feel free to continue the discussion here or as part of the issue if you have any other questions / feedback 🙌

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a way to explode a column? #264

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Is there a way to explode a column? #264

Uh oh!

JaoMarcos Jan 29, 2026

Replies: 1 comment

Uh oh!

johnnygreco Feb 5, 2026 Maintainer

JaoMarcos
Jan 29, 2026

johnnygreco
Feb 5, 2026
Maintainer