feat(source-datagen): add wide schema flavor and fix bugs#75542
feat(source-datagen): add wide schema flavor and fix bugs#75542sophiecuiy wants to merge 2 commits intomasterfrom
Conversation
… count Add a new "wide" flavor that generates a configurable number of columns (1-1000, default 50) cycling through all 12 Airbyte data types. Also fix unsafe !! null assertions in DataGenPartitionReader and cache codec references in TypesDataGenerator to reduce per-record overhead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
Note 📝 PR Converted to Draft More info...Thank you for creating this PR. As a policy to protect our engineers' time, Airbyte requires all PRs to be created first in draft status. Your PR has been automatically converted to draft status in respect for this policy. As soon as your PR is ready for formal review, you can proceed to convert the PR to "ready for review" status by clicking the "Ready for review" button at the bottom of the PR page. To skip draft status in future PRs, please include |
👋 Greetings, Airbyte Team Member!Here are some helpful tips and reminders for your convenience. 💡 Show Tips and TricksPR Slash CommandsAirbyte Maintainers (that's you!) can execute the following slash commands on your PR:
📚 Show Repo GuidanceHelpful Resources
|
|
…ndency Published CDK artifacts in the 0.x line are missing dependency metadata, so bulk-cdk-core-base (containing AirbyteSourceRunner, ConfigErrorException, OpaqueStateValue, etc.) was not resolved transitively via core-extract. Version 1.0.1 includes proper Gradle module metadata that declares core-base as a transitive dependency. Also fix field.name -> field.id in WideDataGenerator to match the Field data class API. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
Note Detected that there are differences in the Gradle dependencies. |
|
/format-fix
|
Summary
integer,string,boolean,number,big_integer,big_decimal,date,time_with_tz,time_without_tz,timestamp_with_tz,timestamp_without_tz,json). Column 0 is alwaysid(primary key).!!assertions: Replaced two!!non-null assertions inDataGenPartitionReaderwith safe alternatives —mapNotNullfor resource filtering and an?: throw IllegalStateExceptionwith a clear error message for the record acceptor lookup.ascodec casts inTypesDataGeneratorto class-level properties, reducing per-record overhead.Test plan
DataGenStreamState.kthas a pre-existing compilation error on master due to unresolvedOpaqueStateValue— unrelated to this PR){"flavor": {"data_type": "wide", "column_count": 20}, "max_records": 10}and verify 20 columns appear with correct type cyclingincrementandtypesflavors still work as expectedcolumn_count=1(id only),column_count=1000🤖 Generated with Claude Code