Skip to content

Support/2.16 deploy#1

Merged
xdelox merged 15 commits intosupport/2.16from
support/2.16-deploy
Dec 18, 2025
Merged

Support/2.16 deploy#1
xdelox merged 15 commits intosupport/2.16from
support/2.16-deploy

Conversation

@xdelox
Copy link

@xdelox xdelox commented Dec 18, 2025

Please add a meaningful description for your change here


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Run mvn clean install apache-rat:check to make sure basic checks pass. A more thorough check will be performed on your pull request automatically.
  • If you have a group of commits related to the same change, please squash your commits into one and force push your branch using git rebase -i.
  • Mention the appropriate issue in your description (for example: addresses #123), if applicable.

To make clear that you license your contribution under the Apache License Version 2.0, January 2004
you have to acknowledge this by using the following check-box.

sramazzina and others added 15 commits December 17, 2025 17:24
… Types, fixes apache#5947 (apache#6186)

This commit introduces significant improvements to the Parquet Input and Output transforms by implementing comprehensive support for Parquet's Logical Types.

Previously, the transforms relied primarily on primitive types, leading to conversions issues and errors with data when handling complex types, such as Timestamps..

Key Changes & Features:

1. Parquet Input:
* Logical Type Mapping: Refactors the field discovery to use `LogicalTypeAnnotation` (instead of only primitive type), enabling correct mapping for semantic types.
* Timestamp/Date Precision: Implements a conversion mechanism to map Parquet's timestamps units (MILLIS, MICROS..) to Hop's `TYPE_TIMESTAMP` and `TYPE_DATE`, preserving precision and handling UTC adjustments.
* JSON Support: Adds explicit support for the JSON Logical Type, converting the Parquet binary/string data into Hop's `TYPE_JSON` object.
* Decimal Handling: Uses precision and scale from `DecimalLogicalTypeAnnotation` to correctly convert binary/long Parquet decimals into Hop's `TYPE_BIGNUMBER`.

2. Parquet Output:
* Date/Timestamp Consistency: Ensures that Hop's `TYPE_DATE` and `TYPE_TIMESTAMP` are consistently converted to a `LONG` representation with the Parquet `timestampMillis` logical annotation, which is the most compatible format.
* Schema Mapping: Maps Hop's `TYPE_JSON` and `TYPE_UUID` to Parquet `STRING` types in the schema definition.

Testing and Validation:
* Test Data Enrichment: The test dataset (`golden-parquet-input.json`) was extended to include new fields: `isActive` (Boolean), `registrationTimestamp` (Timestamp), and `metadataJson` (JSON), ensuring the new types are covered end-to-end.
* Unit Test Update: The unit test configuration (`0029-parquet-input UNIT.json`) was updated to map and validate the new fields, confirming the correct functionality of the transform.

This resolves a major limitation regarding data fidelity when dealing with common modern Parquet schemas.

(cherry picked from commit 6583915)
* Fix apache#5164 - value type in "Formula" transform injection

* Fix apache#5225 - Reverted previous changes. Applied usage of InjectionTypeConverter to convet from type id to datatype name. Added integration tests

* Fix apache#5225 - Fixed missing license header

(cherry picked from commit a54ea4b)
…#6060 (apache#6065)

* fix show filenames button throws an error in Get Data From XML

Signed-off-by: lance <leehaut@gmail.com>

* fix show filenames button throws an error in Get Data From XML

Signed-off-by: lance <leehaut@gmail.com>

---------

Signed-off-by: lance <leehaut@gmail.com>
(cherry picked from commit ccf230a)
* Fix apache#5225 - Cannot wire Workflow Executor results rows hop to next transform

* Fix apache#5225 - Added integration test to check for execution results' rows

(cherry picked from commit 8ff5bd2)
@github-actions github-actions bot added documentation Improvements or additions to documentation Beam Hop Gui Integration Testing labels Dec 18, 2025
@xdelox xdelox merged commit 1162c40 into support/2.16 Dec 18, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Beam documentation Improvements or additions to documentation Hop Gui Integration Testing

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants