You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: design-patterns/design-pattern-data-vault-hub.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,7 +34,7 @@ This pattern is applicable for the process of loading from the Staging Layer int
34
34
35
35
## Structure
36
36
37
-
A Hub table contains the unique list of business key, and the corresponding Hub ETL process can be described as an �insert only� of the unique business keys that are not yet in the the target Hub.
37
+
A Hub table contains the unique list of business key, and the corresponding Hub ETL process can be described as an 'insert only' of the unique business keys that are not yet in the the target Hub.
38
38
39
39
The process performs a distinct selection on the business key attribute(s) in the Staging Area table and performs a key lookup to verify if the available business keys already exists in the target Hub table. If the business key already exists the row can be discarded, if not it can be inserted.
> Depending on your philosophy on Data Vault implementation, Link Satellites may not be relevant or applicable.
12
-
> This is especially applicable to the driving key mechanism.
13
12
> There are very viable considerations to implement a Data Vault model *without* Link-Satellites.
13
+
> This is especially applicable to the driving key mechanism.
14
14
15
15
## Purpose
16
16
17
-
This design pattern describes how to represent cross-key end-dating for a Link-Satellite in Data Vault methodology. In Data Vault, Link-Satellite tables manage the change for relationships over time, but there might not be a good data-driven candidate to evaluate when certain relationships are valid.
17
+
This design pattern describes how to represent cross-key end-dating for a Link-Satellite in Data Vault methodology.
18
+
19
+
In Data Vault, Link-Satellite tables are an option that can be used manage the context (and changes for) relationships over time. In many cases, the context includes attributes that can be used to evaluate if a relationship can be considered 'active', or not, at a point in time.
20
+
21
+
For example, if an attribute 'is valid' or equivalent exists, this data can be used to determine if a relationship is valid and can be used for downstream data delivery. However, depending on the systems and data that are available, there might not always be a good data-driven candidate to evaluate the validity of a relationship.
22
+
23
+
The 'driving key' mechanism is a special technique to apply validity of relationships when there are no reliable data points to do so.
18
24
19
25
## Motivation
20
26
21
27
To provide an approach for establishing validity for Link Satellites, when no context is available to define validity of the relationship in time.
22
28
23
29
## Applicability
24
30
25
-
This pattern is only applicable for loading data to Link-Satellite tables from:
26
-
27
-
* The Staging Area into the Integration Area.
28
-
* The Integration Area into the Interpretation Area.
29
-
* The only difference to the specified ETL template is any business logic required in the mappings towards the Interpretation Area tables.
31
+
This pattern is applicable for processing data for a Link-Satellite table, or its keyed-instance (relationship describing Hub) equivalent.
30
32
31
33
## Structure
32
34
33
-
Standard Link-Satellites use the Driving Key concept to manage the ending of �old� relationships.
35
+
Standard Link-Satellites use the Driving Key concept to manage the ending of �old� relationships.
34
36
35
37
## Implementation Guidelines
36
38
39
+
To avoid data redundancy, it is recommended to manage this process into the target table as opposed to using end-dating.
0 commit comments