Add `LARGE_BINARY` Attribute Type for External Storage of Large Fields #3457

kunwp1 · 2025-05-31T08:07:10Z

This PR introduces a new attribute type, LARGE_BINARY, to enable efficient handling of large binary fields by storing them externally in S3 rather than embedding them directly within tuples. It also includes various enhancements to support this new data type across the system.

Motivation

Storing entire tuples in external storage adds unnecessary complexity and hinders direct access to small fields. Instead, we now store only individual large fields externally, which simplifies implementation and improves flexibility. Users can still access smaller fields directly in memory while large binary data is managed separately and efficiently.

Design Overview

The new large_binary attribute type is distinct from the existing binary type:

binary: Stores raw byte arrays directly in the tuple.
large_binary: Stores a URI reference to an external binary object.

Lifecycle of `large_binary` fields

Creation: Before emitting a tuple, the operator uploads the binary object to external S3 storage.
Transfer: Tuples remain lightweight by storing only the URI.
Read: Downstream operators use a utility API to resolve the URI and fetch the binary content.
Deletion: Reference counting is used to manage deletion. When the count reaches zero, the binary object is deleted from S3.

Implementation Details

S3 buckets for storing large binary objects are created when the computing unit master is launched. The bucket name is defined in storage.conf.
The S3StorageClient class was moved to core/workflow-core to be accessible from the computing unit master.
LARGE_BINARY type is:
- Stored as a string in Iceberg.
- Distinguished from regular strings using a magic prefix: "TEXERA_LARGE_BINARY:" added to the attribute name.
- If a string attribute uses the "TEXERA_LARGE_BINARY:" prefix, it will fail schema propagation, preventing accidental misinterpretation of types.
A new PostgreSQL table stores the reference count for each URI for reference count tracking.
Transactions are used for concurrency control to ensure reference count consistency under concurrent insertions and deletions.
The result export logic for cell data has been updated to resolve URIs back to actual binary content.
File uploads to S3 use multipart upload to ensure speed and reliability. Tested with a 3GB file, which took ~16 seconds depending on network conditions.
The FileScan operator throws an error if a BINARY type is used with files larger than 2GB.

Scope

This PR currently supports only the FileScan Java Native Operator.

Migration Notice

After merging this PR, you must run the following SQL script to apply the necessary schema changes for reference count tracking:

core/scripts/sql/updates/08.sql

This script creates the required PostgreSQL table for storing reference counts of large binary URIs. Failing to run this script will result in runtime errors when handling LARGE_BINARY attributes.

TODOs

Add support for UDF operators
Investigate and update other Java Native Operators as needed

This reverts commit ce58e88.

bobbai00

Left some comments

core/amber/src/main/scala/edu/uci/ics/texera/web/ComputingUnitMaster.scala

core/workflow-core/src/main/scala/edu/uci/ics/amber/core/tuple/AttributeType.java

core/amber/src/main/scala/edu/uci/ics/texera/web/service/ResultExportService.scala

core/workflow-core/src/main/scala/edu/uci/ics/amber/util/IcebergUtil.scala

...ow-operator/src/main/scala/edu/uci/ics/amber/operator/source/scan/FileScanSourceOpExec.scala

kunwp1 · 2025-06-06T06:46:42Z

@bobbai00 I've addressed your comments and also added support for concurrency control. Additionally, the UI has been updated to hide the URI from the user. The PR description has been updated accordingly. Here are the key highlights:

A new PostgreSQL table is used to track the reference count for each URI.
Transactions are employed to ensure concurrency control and maintain reference count consistency during concurrent insertions and deletions.

Please review the PR once more. Thanks.

kunwp1 added 17 commits May 28, 2025 16:48

Add new type called large binary

d8674b7

Add reference counter and move s3 storage client

5a17aff

Implement increment and decrement of reference counter

821aee9

Create S3LargeBinaryManager

7438252

Refactor code

3b5b978

Merge S3ReferenceCounter to S3LargeBinaryManager

fc70deb

Refactor code

076a32c

Fix large file issue

da7acc0

Improve performance

06aad48

Improve performance

8f23d4d

Formatting

3cd00df

Refactor file scan

0e74f66

Add s3 bucket

7ab0673

Use StorageConfig

f9e7ecf

Download from S3

e4184cf

Enable download

9bd8b83

Add test cases

ce58e88

kunwp1 requested a review from bobbai00 May 31, 2025 08:07

kunwp1 self-assigned this May 31, 2025

kunwp1 added 3 commits May 31, 2025 01:11

Add comment

92c5fe1

Revert "Add test cases"

591046f

This reverts commit ce58e88.

Fix test

cdad7a7

kunwp1 added feature engine labels May 31, 2025

Fix unit test

28c1dd1

kunwp1 marked this pull request as ready for review May 31, 2025 08:57

kunwp1 added 4 commits June 1, 2025 10:44

Add unit test

46e6456

Add unit test

d191bab

Fix unit test

36b6e7b

Fix error message

32a3d11

kunwp1 added 4 commits June 2, 2025 13:11

Fix unit test

f26bd57

Change frontend

e25958b

Change frontend

44dcb52

Add postgresql table

22f2c64

bobbai00 reviewed Jun 5, 2025

View reviewed changes

kunwp1 added 3 commits June 5, 2025 22:37

From DB

605e85e

Address comments

79ac5fe

Address comments

44f573d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `LARGE_BINARY` Attribute Type for External Storage of Large Fields #3457

Add `LARGE_BINARY` Attribute Type for External Storage of Large Fields #3457

Uh oh!

kunwp1 commented May 31, 2025 •

edited

Loading

Uh oh!

bobbai00 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunwp1 commented Jun 6, 2025

Uh oh!

Uh oh!

Add LARGE_BINARY Attribute Type for External Storage of Large Fields #3457

Are you sure you want to change the base?

Add LARGE_BINARY Attribute Type for External Storage of Large Fields #3457

Uh oh!

Conversation

kunwp1 commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Design Overview

Lifecycle of large_binary fields

Implementation Details

Scope

Migration Notice

TODOs

Uh oh!

bobbai00 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunwp1 commented Jun 6, 2025

Uh oh!

Uh oh!

Add `LARGE_BINARY` Attribute Type for External Storage of Large Fields #3457

Add `LARGE_BINARY` Attribute Type for External Storage of Large Fields #3457

kunwp1 commented May 31, 2025 •

edited

Loading

Lifecycle of `large_binary` fields