Skip to content
Open
Show file tree
Hide file tree
Changes from 32 commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
83ebf36
feat: mysql chunking optimization
saksham-datazip Jan 27, 2026
f5766f8
mysql optimization comment resolve
saksham-datazip Jan 27, 2026
443cf94
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Jan 27, 2026
6fc574c
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 2, 2026
c09aee8
chore: formatting fix
saksham-datazip Feb 3, 2026
53520de
my-sql-chunking-formatting-resolved
saksham-datazip Feb 3, 2026
3b9fbe7
mysql-chunking-self-reviewed
saksham-datazip Feb 3, 2026
8e4ba6a
mysql-chunking-optimization-for-string-pk
saksham-datazip Feb 7, 2026
1707ae1
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 7, 2026
feca5a0
Merge branch 'staging' into feat/mysql-chunking-optimization
vaibhav-datazip Feb 9, 2026
ccfb371
feat: solved lint issue
saksham-datazip Feb 9, 2026
fe4b4b2
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 10, 2026
910246a
feat: mysql chunking optimization review resolved
saksham-datazip Feb 10, 2026
1eacf5a
feat: resolving-lint-extra-spaces
saksham-datazip Feb 10, 2026
964a2ee
feat: lint error resolved
saksham-datazip Feb 10, 2026
11a9f03
feat: self-reviewed
saksham-datazip Feb 10, 2026
348c21a
Merge branch 'staging' into feat/mysql-chunking-optimization
vaibhav-datazip Feb 12, 2026
94a6fd8
feat: recommiting the logical issue in mysql chunking calculation
saksham-datazip Feb 16, 2026
aa03463
Merge branch 'staging' into feat/mysql-chunking-optimization
ImDoubD-datazip Feb 20, 2026
bc1abf8
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 23, 2026
6e5e82f
feat: added splitviaprimarykey function
saksham-datazip Feb 24, 2026
d9189b9
chore: Merge branch 'feat/mysql-chunking-optimization' of https://git…
saksham-datazip Feb 24, 2026
69a1714
chore: improved formatting and resolved calculating validationcount q…
saksham-datazip Feb 28, 2026
96b5689
chore: merge conflict resolved
saksham-datazip Mar 1, 2026
66749dd
chore: constant size readjusted
saksham-datazip Mar 1, 2026
fa24a2c
chore: saperated buildChunkConditionMySQL function from mssql
saksham-datazip Mar 2, 2026
8411568
chore: fixed buildChunkConditionMySQL function for multiple colummns
saksham-datazip Mar 3, 2026
64f31c1
chore: resolved comment for final-testing
saksham-datazip Mar 4, 2026
debd4eb
chore: resolved lint error
saksham-datazip Mar 4, 2026
86a2d91
fix: changes pulled from staging
saksham-datazip Mar 7, 2026
8ead67e
chore: float and uint8 issue resolved
saksham-datazip Mar 9, 2026
0caf2aa
chore: converted float64 to int64
saksham-datazip Mar 9, 2026
8ccfdd6
chore: added uint8[] block and took datatype for numeric value from i…
saksham-datazip Mar 11, 2026
7754d72
chore: self reviewed
saksham-datazip Mar 11, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions constants/constants.go
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,15 @@ const (
EffectiveParquetSize = int64(256) * 1024 * 1024 * int64(8)
DB2StateTimestampFormat = "2006-01-02 15:04:05.000000"
DefaultStateTimestampFormat = "2006-01-02T15:04:05.000000000Z"
// DistributionLower and DistributionUpper define the acceptable range
// of the distribution factor for validating evenly distributed numeric PKs.
DistributionLower = 0.05
DistributionUpper = 100.0
Comment on lines +38 to +39
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how did we choose these distribution factors ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is an assumption and airbyte used it as well

// UnicodeSize is the total number of valid Unicode code points (0 to 0x10FFFF)
UnicodeSize = 1114112
// MysqlChunkAcceptanceRatio defines the minimum ratio of expected chunks that must be generated
// for the split to be considered valid.
MysqlChunkAcceptanceRatio = float64(0.8)
)

type DriverType string
Expand Down
Loading
Loading