|
1 | 1 | --- |
2 | 2 | Aliases: [] |
3 | | -Tags: [seedling] |
| 3 | +tags: |
| 4 | + - seedling |
4 | 5 | publish: true |
| 6 | +cssclasses: |
| 7 | + - list-cards |
5 | 8 | --- |
6 | 9 |
|
7 | | -## Core Concepts |
| 10 | +## New to data engineering? Start Here: [[Getting Started With Data Engineering]] |
8 | 11 |
|
9 | | -- How to build a [[Data Pipeline|data pipeline]] |
| 12 | +- [[Data Architecture]] |
| 13 | + - Explore foundational architecture patterns including Data Warehouses, Data Lakes, Data Mesh, Lambda, Kappa, and Medallion architectures. |
| 14 | +- [[Data Ingestion]] |
| 15 | + - Learn about data movement techniques including Change Data Capture, Full Load vs Delta Load strategies, and ingestion best practices. |
| 16 | +- [[Data Management]] |
| 17 | + - Discover tools and practices for organizing, cataloging, governing, and managing data assets throughout their lifecycle. |
10 | 18 | - [[Data Modeling]] |
11 | | -- [[Batch Data Processing|Batch]] vs [[Stream Data Processing|Stream processing]] |
12 | | -- [[Indexing]] |
13 | | -- [[Relational Database|Relational]] vs [[Non-relational Database|Non-relational]] data |
14 | | -- [[Online Transaction Processing|OLTP]] vs [[Online Analytical Processing|OLAP]] |
15 | | - |
16 | | -> [!info]- ## Database Concepts |
17 | | -> |
18 | | ->- [[CAP Theorem]] |
19 | | ->- [[Column-oriented Database]] |
20 | | ->- [[Document Database]] |
21 | | ->- [[Graph Database]] |
22 | | ->- [[Hybrid Transactional Analytical Processing]] |
23 | | ->- [[In-Memory Database]] |
24 | | ->- [[Key-Value Database]] |
25 | | ->- [[Non-relational Database]] |
26 | | ->- [[Online Analytical Processing]] |
27 | | ->- [[Online Transaction Processing]] |
28 | | ->- [[Relational Database]] |
29 | | ->- [[Relational Database Management System]] |
30 | | ->- [[Timeseries Database]] |
31 | | -
|
32 | | -> [!info]- ## Data Pipeline Concepts |
33 | | -> |
34 | | ->- [[Batch Data Processing]] |
35 | | ->- [[Change Data Capture]] |
36 | | ->- [[Data Pipeline]] |
37 | | ->- [[Data Unit Test]] |
38 | | ->- [[Delta Load]] |
39 | | ->- [[Full Load]] |
40 | | ->- [[Idempotence]] |
41 | | ->- [[Stream Data Processing]] |
42 | | ->- [[Workflow Orchestration]] |
43 | | -
|
44 | | -> [!info]- ## Data Architecture Concepts |
45 | | -> |
46 | | ->- [[Data Architecture]] |
47 | | ->- [[Data Lake]] |
48 | | ->- [[Data Warehouse]] |
49 | | ->- [[Fan-out]] |
50 | | ->- [[Horizontal Scaling]] |
51 | | ->- [[Lambda Architecture]] |
52 | | ->- [[Medallion Architecture]] |
53 | | ->- [[Metrics Layer]] |
54 | | ->- [[Semantic Layer]] |
55 | | ->- [[Vertical Scaling]] |
| 19 | + - Study modeling techniques from traditional relational and dimensional modeling to newer approaches like Data Vault and Activity Schema. |
| 20 | +- [[Data Processing]] |
| 21 | + - Understand batch vs stream processing, OLTP vs OLAP systems, and core data processing paradigms. |
| 22 | +- [[Data Security, Ethics, and Compliance]] |
| 23 | + - Learn about data protection, privacy, regulatory compliance, and ethical data practices. |
| 24 | +- [[Data Storage]] |
| 25 | + - Compare relational, non-relational, document, graph, key-value, and column-oriented databases with their use cases. |
| 26 | +- [[Software Engineering]] |
| 27 | + - Apply software engineering principles including testing, indexing, scaling strategies, and development best practices to data systems. |
56 | 28 |
|
57 | 29 | %% wiki footer: Please don't edit anything below this line %% |
58 | 30 |
|
|
0 commit comments