feat: add decimal value representation #182

zhjwpku · 2025-08-17T15:21:06Z

No description provided.

zhjwpku · 2025-08-22T15:57:58Z

This is ready for review. I've extracted the Decimal128 implementation and tests from Arrow. Please help to review. @wgtmac @lidavidm @mapleFU

LICENSE

src/iceberg/util/port.h

src/iceberg/expression/decimal.h

src/iceberg/expression/decimal.cc

src/iceberg/expression/decimal.h

wgtmac · 2025-08-23T15:32:15Z

Could you explicitly mark which part is exactly ported from Arrow to make the current review and future sync process easier?

zhjwpku · 2025-08-24T07:07:39Z

Could you explicitly mark which part is exactly ported from Arrow to make the current review and future sync process easier?

That's not easy, Arrow's Decimal type uses a complex inheritance hierarchy and supports bit lengths in any 64-bit multiple. Since we only need Decimal128, l omitted the inheritance layer. But the algorithms adapted from Arrow is quite straightforward and should be easy for future sync.

src/iceberg/expression/decimal.h

mapleFU · 2025-08-24T10:10:13Z

What about a thirdparty/ vendor directory, and we can use it same as <arrow/xxx> but it's vendored by us? (Current solving is also ok to me)

wgtmac · 2025-08-24T13:16:18Z

I guess we cannot port arrow/util/decimal.h without significant change due to dependency on arrow utilities.

My concern is that we use std::array<uint64_t, 2> as the internal representation but use int128_t for testing. I'm not sure if we can directly use int128_t as the physical representation for the Decimal class. I did exactly this when I worked for my previous employer and it should work well.

zhjwpku · 2025-08-25T02:23:14Z

I guess we cannot port arrow/util/decimal.h without significant change due to dependency on arrow utilities.

My concern is that we use std::array<uint64_t, 2> as the internal representation but use int128_t for testing. I'm not sure if we can directly use int128_t as the physical representation for the Decimal class. I did exactly this when I worked for my previous employer and it should work well.

uint128_t is now used for Decimal multiplier, I think we can use int128_t for the division, let me try the int128_t physical representation idea for the next few days.

src/iceberg/util/decimal.cc

LICENSE

src/iceberg/util/decimal.h

test/CMakeLists.txt

src/iceberg/util/decimal.h

src/iceberg/util/decimal.cc

src/iceberg/util/decimal.h

src/iceberg/util/decimal.cc

wgtmac · 2025-09-06T14:49:40Z

src/iceberg/util/decimal.cc

+
+Decimal::Decimal(std::string_view str) {
+  auto result = Decimal::FromString(str);
+  if (!result) {


Then please throw IcebergError and add a comment with \throw by recommending Decimal::FromString to be used in production.

zhjwpku · 2025-09-07T23:42:16Z

@mapleFU Can you take a look at this again, thanks.

wgtmac · 2025-09-08T06:32:35Z

This is mainly ported from Arrow C++. Could you help take a quick pass? @lidavidm @zeroshade

Fokko

This looks good to me @zhjwpku Thanks for working on this, and thanks @wgtmac and @dongxiao1198 for the review 🙌

Fokko · 2025-09-16T11:38:03Z

src/iceberg/util/decimal.cc

+  uint128_t value = 0;
+  ShiftAndAdd(dec.while_digits, value);
+  ShiftAndAdd(dec.fractional_digits, value);
+  Decimal result(static_cast<int128_t>(value));


Keep in mind that Iceberg might also store the Decimal as an int32 or int64:

Thanks for the reminder, will keep in mind, thanks.

zhjwpku force-pushed the int128_and_decimal branch 2 times, most recently from 70068f9 to e48c9f8 Compare August 22, 2025 14:03

zhjwpku marked this pull request as ready for review August 22, 2025 15:43

wgtmac reviewed Aug 23, 2025

View reviewed changes

wgtmac reviewed Aug 24, 2025

View reviewed changes

src/iceberg/expression/decimal.h Outdated Show resolved Hide resolved

zhjwpku force-pushed the int128_and_decimal branch 4 times, most recently from 6ab1a93 to a4b5dde Compare August 30, 2025 03:41

zhjwpku commented Aug 30, 2025

View reviewed changes

src/iceberg/util/decimal.cc Outdated Show resolved Hide resolved

wgtmac reviewed Sep 2, 2025

View reviewed changes

zhjwpku force-pushed the int128_and_decimal branch 2 times, most recently from 2b23ca3 to bfe9fc1 Compare September 4, 2025 00:06

zhjwpku requested a review from wgtmac September 6, 2025 01:41

wgtmac approved these changes Sep 6, 2025

View reviewed changes

zhjwpku force-pushed the int128_and_decimal branch from 9cbc0df to 54b09e7 Compare September 7, 2025 12:59

zhjwpku added 8 commits September 8, 2025 07:20

feat: add decimal value representation

5a15ef2

feat: add ToIntegerString

f43023d

feat: add ToString

4ebb281

feat: add FromReal and ToReal

d93b650

feat: add more ToReal tests

640f8c2

feat: add FromBigEndian

51cf048

fix: include missing headers

83487db

feat: add more tests

aaafbfe

zhjwpku added 12 commits September 8, 2025 07:20

feat: add rescale

00dac13

fix: windows compiling error

74742e7

chore: add license

feff5bd

fix: review comments

026dfc9

fix: use int128_t as Decimal data

c272c55

chore: use int128_t calculation

2d5096a

fix: remove unrelated codes

a46b2f4

fix: review comments

922d930

chore: remove FromReal/ToReal/ToInteger conversions

e293fc9

chore: use uint128 for FromBigEndian

d1fe186

chore: fix minor issue

cb22a3c

fix: review comments

081ef1f

zhjwpku force-pushed the int128_and_decimal branch from 54b09e7 to 081ef1f Compare September 7, 2025 23:23

dongxiao1198 approved these changes Sep 12, 2025

View reviewed changes

Fokko approved these changes Sep 16, 2025

View reviewed changes

Fokko merged commit 0fc573a into apache:main Sep 16, 2025
7 checks passed

zhjwpku mentioned this pull request Sep 16, 2025

[DISCUSSION] How to represents decimal type? #116

Closed

zhjwpku deleted the int128_and_decimal branch September 17, 2025 10:48

feat: add decimal value representation #182

feat: add decimal value representation #182

Uh oh!

Conversation

zhjwpku commented Aug 17, 2025

Uh oh!

zhjwpku commented Aug 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wgtmac commented Aug 23, 2025

Uh oh!

zhjwpku commented Aug 24, 2025

Uh oh!

Uh oh!

mapleFU commented Aug 24, 2025

Uh oh!

wgtmac commented Aug 24, 2025

Uh oh!

zhjwpku commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wgtmac Sep 6, 2025

Choose a reason for hiding this comment

Uh oh!

zhjwpku commented Sep 7, 2025

Uh oh!

wgtmac commented Sep 8, 2025

Uh oh!

Fokko left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Fokko Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

zhjwpku Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fokko left a comment •

edited

Loading