Skip to content

Commit 783f380

Browse files
committed
Update 1000_Design_Patterns/Design Pattern - Generic - Loading Persistent Staging Area tables.md
1 parent 6053539 commit 783f380

File tree

1 file changed

+13
-9
lines changed

1 file changed

+13
-9
lines changed

1000_Design_Patterns/Design Pattern - Generic - Loading Persistent Staging Area tables.md

Lines changed: 13 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -32,23 +32,27 @@ To support these requirements, there are three main processes in place within th
3232
A safety check to prevent reloading of already loaded data. This is to avoid accidental reruns or out-of-order processing causes errors due to key violations, in combination with the requirement to load multiple changes in a single run.
3333
A verification if the change provided is really a change. This is to allow the scope of attributes to change, for instance if a specific attribute is removed from the HSTG table (and still exists in STG). As part of the standard ETL requirements, the information provided is always compared against the target scope of attributes.
3434
An ordering of changes (per key, over time) in which only the latest change is compared against the most recent change as available in HSTG.
35-
35+
3636
The above three components together satisfy the HSTG template requirements. The ETL process can be described as loading delta sets into the source historical archive.
37-
38-
37+
38+
3939
## Implementation Guidelines
4040
Use a single ETL process, module or mapping to load data from a single source system table in the corresponding History Area table.
4141
The Load Date / Time stamp is the logical ‘effective date’, and is copied from the Staging Area table. The Staging Area handles the correct definition of the time a change has occurred.
4242
Because of the differences between source interfaces, relying on the CDC Operation (i.e. Insert, Update or Delete) to detect change is not always possible. For this reason all History Area ETL processes need to contain a key lookup to compare values (detect changes).
4343

4444
## Consequences and Considerations
4545
Loading processes towards the Integration Area can either be sourced from the Staging Area or the History Area depending on the scheduling requirements.
46+
4647
The History Area can be loaded in parallel with the Integration Area, or between the Staging Area and Integration Area.
47-
Known uses
48-
Every source table or file has an accompanying History Area table and ETL process.
48+
49+
The 'prevent reprocessing' functionality can also be implemented using the Event Date / Time attribute instead of the Load Date / Time attribute.
50+
51+
52+
53+
4954

5055
## Related Patterns
51-
Implementation Pattern for SSIS 006 - Loading History Area tables
52-
Design Pattern 003 – Mapping requirements.
53-
Design Pattern 006 – Using Start, Process and End dates.
54-
Implementation Pattern 037 - Re-initialisation process.
56+
- [Design Pattern - Generic - Managing temporality by using Load, Event and Change dates]()
57+
- Implementation Pattern for SSIS - Loading Persistent Staging Area tables
58+
- Implementation Pattern - Generic - Re-initialisation process

0 commit comments

Comments
 (0)