Skip to content

Commit 441c9ed

Browse files
committed
add DataIntegration changelog entries
1 parent e5c3a07 commit 441c9ed

File tree

1 file changed

+196
-57
lines changed
  • docs/release-notes/corporate-memory-24-3

1 file changed

+196
-57
lines changed

docs/release-notes/corporate-memory-24-3/index.md

Lines changed: 196 additions & 57 deletions
Original file line numberDiff line numberDiff line change
@@ -14,11 +14,11 @@ Corporate Memory 24.3.0 is the third major release in 2024.
1414
The highlights of this release are:
1515

1616
- Explore and Author:
17-
-
17+
-
1818
- Build:
19-
-
19+
-
2020
- Automate:
21-
-
21+
-
2222

2323
!!! info inline end "Important info"
2424

@@ -28,91 +28,225 @@ The highlights of this release are:
2828
This release delivers the following component versions:
2929

3030
- eccenca DataIntegration v24.3.0
31-
- eccenca Explore v24.3.0 (formally separated into DataPlatform and DataManager)
31+
- eccenca Explore v24.3.0 (formerly DataPlatform and DataManager)
3232
- eccenca Corporate Memory Control (cmemc) v24.3.0
3333

3434
We tested this release with the following dependency components:
3535

36-
- Ontotext GraphDB v10.8.0
37-
- Keycloak v26.0.5
36+
- Ontotext GraphDB v10.8.2
37+
- Keycloak v25.0.6
3838

3939
More detailed information for this release is provided in the next sections.
4040

4141
## eccenca DataIntegration v24.3.0
4242

43-
...
43+
We're excited to bring you the latest update to DataIntegration v24.3, which introduces new features, improvements and bug fixes:
44+
45+
v24.3.0 of DataIntegration adds the following new features:
46+
47+
- Workspace search:
48+
- Support to filter workflows that contain replaceable datasets.
49+
- Display tags on workflow search items when they contain replaceable datasets.
50+
- Add file name and graph URIs to search items as searchable tags.
51+
- Workflow editor:
52+
- Support creating knowledge graph datasets from DataPlatform graphs matching the search query.
53+
- Added copy prefixes option in copy task dialog.
54+
- Integration of a Prometheus endpoint to expose many useful metrics.
55+
- Transform operators to retrieve attributes from input tasks:
56+
- _Input Task attributes_ retrieves individual attributes from the input task (such as the modified date) or the entire task as JSON.
57+
- _Input file attributes_ retrieves a metadata attribute from the input file (such as the file name).
58+
- JdbcDialect implementation for Trino: Fixes STRING type mapping, adds isolationLevel option to avoid Connections resetting AutoCommit mode and serves as example for the dialect concept.
59+
- File hash transformer:
60+
- Calculates the hash sum of a given file
61+
- Works on either the input file dataset or a selected file from the project
62+
- JSON special paths:
63+
- `#propertyName` accesses the current object key
64+
- `*` selects all direct children of the current token
65+
- Add link from a task parameter description into the task's Markdown documentation for this parameter, if available.
66+
- Show sample (output) entities for workflow operators in the workflow reports.
67+
- Text dataset allows to configure the zip regex.
68+
- Support setting the locale for the 'Parse date pattern' and 'Parse date' transform operators.
69+
- `*` selects all direct children of the current token
70+
- More fine-grained access control:
71+
- In addition to a base action, it is possible to specify as many specific actions that protect specific endpoints.
72+
- Endpoints are configured in a whitelist as URI prefixes per specific action.
73+
- All endpoints that are protected by any specific action cannot be accessed anymore via the base action.
74+
- Two new actions are configured by default and protect the Python plugin management and specific workspace API endpoints. See changes and migrations.
75+
- Global variables can be marked sensitive for storing passwords:
76+
- Sensitive variables can only be used in password fields.
77+
- Using sensitive variables in other fields or in variable templates fails and does not expose the value.
78+
- Example:
79+
80+
```conf
81+
config.variables = {
82+
global = {
83+
sensitiveVar = {
84+
value = "value 2"
85+
isSensitive = true
86+
}
87+
}
88+
}
89+
```
90+
91+
- Delete project files operator: Allows to delete project files in a workflow based on a regex.
92+
- Added Snowflake dataset type.
93+
94+
v24.3.0 of DataIntegration introduces the following changes:
95+
96+
- Optimized writing to Neo4j, resulting in a 25x speed improvement.
97+
- Upgraded Spark to 3.5.3.
98+
- Upgraded to typescript version 5.5.3.
99+
- After saving a workflow the undo/redo queues are cleared which is consistent with other editors in DI/DM.
100+
- Renamed DI action from `urn:eccenca:di` to `<https://vocab.eccenca.com/auth/Action/Build>`.
101+
- Line breaks are forced for evaluation preview tooltips.
102+
- If a project is copied to another project, all referenced project variables and their dependent variables are copied to the target project as well.
103+
- docker image: switch to `eclipse-temurin:17-ubi9-minimal` base image
104+
- Prefix handling:
105+
- Only prefixes added to a specific project are serialized/exported, no prefixes loaded by the workspace (e.g. from DP).
106+
- Only load user prefixes and prefixes of installed vocabularies from DP into DI.
107+
- All datasets that support zips can be written now.
108+
- Increase visibility of breadcrumbs in application header.
109+
- Configurable Favicon in DataIntegration.
110+
111+
v24.3.0 of DataIntegration ships the following fixes:
112+
113+
- Jinja templates can lead to OutOfMemory issues.
114+
- Loading of JDBC Type 4 Drivers from Jar at runtime.
115+
- Add add-opens JDK option to sbt parameters to avoid Serialization errors in executors.
116+
- User defined function removed to prevent startup error in local dev mode.
117+
- After saving a workflow the workflow editor can be closed without warning of unsaved changes.
118+
- Race condition in Excel map transformer cache.
119+
- Remote Client-Side Code Execution through CSV Injection identified in penetration testing.
120+
- CSV datasets should not be cleared at the beginning of a workflow since they are overwritten anyway.
121+
- Ports of datasets are shown as required in workflow validation, but are not.
122+
- In workspace/project item search disable Enter behavior while a search is pending.
123+
- Use correct icons for copy/clone actions.
124+
- Workflow editor:
125+
- Workflow is not re-validated after undo/redo operations.
126+
- Re-configuring a workflow node to not having a data output is not immediately visible (only after reload).
127+
- When the `Create new dataset` operator is used it always creates a _dataset_ even though the item type was changed.
128+
- Caches of file base datasets are not refreshed when updated via file download operator.
129+
- Dependency ports checkbox does not show checkmark in workflow tasks with unconnected output port.
130+
- Fix text on node menu options that have a checkbox. Always show the _enabled_ text.
131+
- REST task:
132+
- When paging is enabled and entities are output only the last request result is output.
133+
- Add TLSv1.3 support.
134+
- Hierarchical mapping editor: Entity relationship direction input does not show current selection.
135+
- Transform rule editor:
136+
- Validation errors are not shown when starting the evaluation.
137+
- Notifications are not correctly cleared and shown.
138+
- Transform execution report:
139+
- Type URI validation issues are not shown in the transform execution report.
140+
- Rule tree in transform execution report and evaluation tab has a broken collapse/expand state.
141+
- Password parameter templates are empty initially.
142+
- Fix issues in create/update dialog:
143+
- Depending input gets disabled if dependent input has an empty default value.
144+
- Data preview of dataset with nested parameters is not working.
145+
- Task config preview has a different parameter ordering than in the create/update dialog.
146+
- Evaluation of a text path of a text dataset in a rule editor fails.
147+
- Cannot execute SPARQL update queries with parameter templates.
148+
- `Evaluate template` operator: Changed project variable not updated without evaluating transform.
149+
- Jinja interpreter does not clear previous errors.
150+
- Process of opening and closing the handle tools menu.
151+
- Manually defined project prefixes are automatically copied to other projects after reload.
152+
- Removing a vocabulary does not remove the vocabulary prefix from the DI projects.
153+
- Cannot reconfigure parameter values with templates in workflows.
154+
- Workflow report shows multiple executions of some operators even though they were only executed once.
155+
- Python Workflow status incorrect.
156+
- Python Workflow operators could not be cancelled in some cases.
157+
- Alignment dataset should support the clear method so it can be used in workflows.
158+
- Drop zone in workflow editor freezes sometimes after dropping an operator.
159+
- Transform/Linking operator's 'Restriction' documentation is incorrectly formatted.
160+
- DI project "Items per page" cuts off "100" as "1...".
161+
- Wide task descriptions are not nicely scrollable.
162+
- Inline documentation of `Clean HTML` is incomplete/wrong.
163+
- Cannot delete mapping rule target type anymore.
164+
- SPARQL Construct task does not update its execution report.
165+
166+
## eccenca Explore v24.3.0
44167
45168
!!! info inline end "Important info"
46169
47170
Since v24.3.0, the components eccenca DataPlatform and eccenca DataManager are merged
48171
into a single component eccenca Explore.
49-
## eccenca Explore v24.3.0
50172
51-
...
173+
We are excited to announce Explore v24.3, which introduces new features, improvements and bug fixes.
174+
175+
v24.3.0 of Explore adds the following new features:
176+
177+
- …
178+
179+
v24.3.0 of Explore ships the following changes:
180+
181+
- …
182+
183+
v24.3.0 of Explore ships the following fixes:
184+
185+
- …
52186
53187
## eccenca Corporate Memory Control (cmemc) v24.2.0
54188
55189
This version of cmemc adds the following new features:
56190
57191
- `graph validation execute` command
58-
- `--inspect` option to return the list of violations instead of the summary (includes `--wait`)
192+
- `--inspect` option to return the list of violations instead of the summary (includes `--wait`)
59193
- `graph validation inpect` command
60-
- retrieval and display of titles as terminal links for resources
61-
- completion: retrieval and display of titles as descriptions
194+
- retrieval and display of titles as terminal links for resources
195+
- completion: retrieval and display of titles as descriptions
62196
- `graph validation list` command
63-
- retrieval and display of titles as terminal links for graphs
197+
- retrieval and display of titles as terminal links for graphs
64198
- `graph export` command
65-
- option `--compress` to generate compressed ttl file
199+
- option `--compress` to generate compressed ttl file
66200
- `graph import` command
67-
- support import of compressed ttl/nt files
201+
- support import of compressed ttl/nt files
68202
- `admin store export` command
69-
- `--replace` option to replace an existing file
70-
- if no BACKUP_FILE is given, a default of `{{date}}-{{connection}}.store.zip` is used
203+
- `--replace` option to replace an existing file
204+
- if no BACKUP_FILE is given, a default of `{{date}}-{{connection}}.store.zip` is used
71205
- `project import` command
72-
- `--replace` option to replace an existing project
206+
- `--replace` option to replace an existing project
73207
- `project export` command
74-
- `--replace` option to replace an existing file
208+
- `--replace` option to replace an existing file
75209
- `admin workspace export`
76-
- `--replace` option to replace an existing file
210+
- `--replace` option to replace an existing file
77211
- `admin metrics` command group
78-
- support for build / data integration metrics, e.g. `build:cmem_workspace_task_spec_size`
79-
- support for graphdb store metrics, e.g. `store:graphdb_slow_queries_count`
212+
- support for build / data integration metrics, e.g. `build:cmem_workspace_task_spec_size`
213+
- support for GraphDB store metrics, e.g. `store:graphdb_slow_queries_count`
80214
- `admin metrics list` command
81-
- documentation column to output table
82-
- `--filter` option to filter metrics table by job, name, ID, or type
215+
- documentation column to output table
216+
- `--filter` option to filter metrics table by job, name, ID, or type
83217
- `admin acl` command group
84-
- support for updated 24.3 access condition vocabulary and ACL graph
218+
- support for updated 24.3 access condition vocabulary and ACL graph
85219
- `admin migration` command group
86-
- `admin migration list` command - List migration recipies
87-
- `admin migration execute` command - Execute needed migration recipes
88-
- The following migration recipes are available:
89-
- `bootstrap-data` - Re-import bootstrap system data to match current version
90-
- `workspace-configurations` - Forward-upgrade explore workspace configurations
91-
- `acl-graph-24.3` - Move access conditions and used queries to new ACL graph
92-
- `acl-vocab-24.3` - Migrate auth vocabulary terms (actions and other grants)
93-
- `chart-widgets-24.3` - Migrate Chart Property Shapes to Widget Integrations
94-
- `workflow-trigger-widgets-24.3` - Migrate Workflow Trigger Property Shapes to Widget Integrations
220+
- `admin migration list` command - List migration recipies
221+
- `admin migration execute` command - Execute needed migration recipes
222+
- The following migration recipes are available:
223+
- `bootstrap-data` - Re-import bootstrap system data to match current version
224+
- `workspace-configurations` - Forward-upgrade explore workspace configurations
225+
- `acl-graph-24.3` - Move access conditions and used queries to new ACL graph
226+
- `acl-vocab-24.3` - Migrate auth vocabulary terms (actions and other grants)
227+
- `chart-widgets-24.3` - Migrate Chart Property Shapes to Widget Integrations
228+
- `workflow-trigger-widgets-24.3` - Migrate Workflow Trigger Property Shapes to Widget Integrations
95229
96230
In addition to that, these changes and fixes are included:
97231
98232
- cmemc will not fail anymore when the config dir is not creatable (message in debug)
99233
- cmemc will not fail anymore when the config ini is not readable (message in debug)
100234
- For these commands `admin acl list`, `dataset list`, `graph list`, `project list`, `admin user list`, `project variable list`, `vocabulary list`, `workflow list`, `admin workspace python list`, `admin workspace python list-plugins`, `dataset resource list`, `workflow scheduler list`, and `vocabulary cache list`:
101-
- ommit empty tables with usage note message
235+
- ommit empty tables with usage note message
102236
- `admin status` command
103-
- component name change: DI -> BUILD
104-
- component name change: DP -> EXPLORE
105-
- component removal: DM (merged into EXPLORE)
106-
- key prefix change: dp -> explore
107-
- key prefix change: di -> build
237+
- component name change: DI -> BUILD
238+
- component name change: DP -> EXPLORE
239+
- component removal: DM (merged with DP into EXPLORE)
240+
- key prefix change: dp -> explore
241+
- key prefix change: di -> build
108242
- `project export` command
109-
- `--filename-template` completion examples adaption
243+
- `--filename-template` completion examples adaption
110244
- `dataset create` command
111-
- Support compressed zip files for dataset types including CSV, XML, JSON, YAML, and plain text.
245+
- Support compressed zip files for dataset types including CSV, XML, JSON, YAML, and plain text.
112246
- `admin metrics` command group
113-
- metrics identification now as combined ID of `job_id:metrics_name`
247+
- metrics identification now as combined ID of `job_id:metrics_name`
114248
- `admin metrics` command group
115-
- `--job` option, use `--filter job job_id` or combined metrics ID instead
249+
- `--job` option, use `--filter job job_id` or combined metrics ID instead
116250
117251
## Migration Notes
118252
@@ -126,30 +260,35 @@ In addition to that, these changes and fixes are included:
126260
127261
### DataIntegration
128262
129-
...
263+
- CSV files are no longer deleted by default at the beginning of a workflow execution. This behavior can be changed in the CSV dataset configuration.
264+
- Access control changes. Action URIs have been renamed and new actions are introduced by default:
265+
- `urn:eccenca:di` -> `<https://vocab.eccenca.com/auth/Action/Build>`
266+
- `urn:elds-backend-all-actions` -> `<https://vocab.eccenca.com/auth/Action/AllActions>`
267+
- Python plugin management endpoints are now secured via `<https://vocab.eccenca.com/auth/Action/Build-AdminPython>` action.
268+
- Workspace admin functions (reload workspace, import workspace) are now secured via `<https://vocab.eccenca.com/auth/Action/Build-AdminWorkspace>` action.
269+
270+
### Explore
130271
131272
!!! info inline end "Important info"
132273
133274
Since v24.3.0, the components eccenca DataPlatform and eccenca DataManager are merged
134275
into a single component eccenca Explore.
135-
### Explore
136276
137277
...
138278
139279
### cmemc
140280
141281
- All scripts which used the `admin status` command with the `--key` option:
142-
- adapt the key prefixes accordingly:
143-
- old: `cmemc admin status --key dp.info.license.validDate`
144-
- new: `cmemc admin status --key explore.info.license.validDate`
282+
- adapt the key prefixes accordingly:
283+
- old: `cmemc admin status --key dp.info.license.validDate`
284+
- new: `cmemc admin status --key explore.info.license.validDate`
145285
- `admin store migrate` command deprecated
146-
- use the `admin migration` command group instead
286+
- use the `admin migration` command group instead
147287
- `--overwrite` options deprecated - will be removed with the next major version
148-
- affected commands:
149-
- `project import` command
150-
- `project export` command
151-
- `admin workspace export` command
288+
- affected commands:
289+
- `project import` command
290+
- `project export` command
291+
- `admin workspace export` command
152292
- All scripts which used the `admin metrics` command group:
153-
- use combined metrics ID of `job_id:metrics_name`
154-
- use `--filter job job_id` instead of `--job job_id`
155-
293+
- use combined metrics ID of `job_id:metrics_name`
294+
- use `--filter job job_id` instead of `--job job_id`

0 commit comments

Comments
 (0)