Skip to content

Commit 7383699

Browse files
authored
Merge fec82d4 into 390439c
2 parents 390439c + fec82d4 commit 7383699

File tree

11 files changed

+184
-45
lines changed

11 files changed

+184
-45
lines changed

README.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ From [Microsoft Documentation](https://learn.microsoft.com/pt-br/fabric/fundamen
147147
4. **You want to empower data consumers** (analysts, scientists, engineers) to discover and understand data assets easily.
148148
5. **You are scaling your data operations** and need consistent governance policies across teams and projects.
149149

150-
Click to read more about [Microsoft Purview for Fabric - Overview](./Purview-Fabric.md).
150+
Click to read more about [Microsoft Purview for Fabric - Overview](./Workloads-Specific/Purview/PurviewforFabric.md).
151151

152152
## Networking
153153

@@ -201,13 +201,13 @@ Click to read more about [Microsoft Purview for Fabric - Overview](./Purview-Fab
201201

202202
- [Azure Data Factory (ADF) - Best Practices Overview](./Workloads-Specific/DataFactory/BestPractices.md)
203203
- [Data Engineering - Best Practices Overview](./Workloads-Specific/DataEngineering/BestPractices.md)
204-
- [Data Warehouse - Best Practices Overview]() - in progress
205-
- [Data Science - Best Practices Overview]() - in progress
206-
- [Real-Time Intelligence - Best Practices Overview]() - in progress
204+
- [Data Warehouse - Best Practices Overview](./Workloads-Specific/DataWarehouse/BestPractices.md) - in progress
205+
- [Data Science - Best Practices Overview](./Workloads-Specific/DataScience/BestPractices.md) - in progress
206+
- [Real-Time Intelligence - Best Practices Overview](./Workloads-Specific/RealTimeIntelligence/BestPractices.md) - in progress
207207
- [Power Bi - Best Practices Overview](./Workloads-Specific/PowerBi/BestPractices.md)
208-
- [Copilot - Best Practices Overview]() - in progress
209-
- [Purview - Best Practices Overview]() - in progress
210-
- [OneLake - Best Practices Overview]() - in progress
208+
- [Copilot - Best Practices Overview](./Workloads-Specific/Copilot/BestPractices.md) - in progress
209+
- [Purview - Best Practices Overview](./Workloads-Specific/Purview/BestPractices.md) - in progress
210+
- [OneLake - Best Practices Overview](./Workloads-Specific/OneLake/BestPractices.md) - in progress
211211

212212
<div align="center">
213213
<h3 style="color: #4CAF50;">Total Visitors</h3>
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# Copilot - Best Practices Overview
2+
3+
Costa Rica
4+
5+
[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
6+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
7+
[brown9804](https://github.com/brown9804)
8+
9+
Last updated: 2025-05-02
10+
11+
----------
12+
13+
<details>
14+
<summary><b>List of References</b> (Click to expand)</summary>
15+
16+
</details>
17+
18+
<div align="center">
19+
<h3 style="color: #4CAF50;">Total Visitors</h3>
20+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
21+
</div>

Workloads-Specific/DataEngineering/BestPractices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ Last updated: 2025-05-02
6161
- **Comprehensive Schema Documentation:** Create detailed, auto-generated documentation for every endpoint; include sample queries, expected responses, and precise error messages to aid developer understanding.
6262
- **Robust Error Handling:** Implement consistent, informative error responses and integrate thorough test suites to guarantee smooth operation and backward compatibility as the API evolves.
6363

64-
https://github.com/user-attachments/assets/8971651d-9aff-4b41-94ca-9a35b9241f22
64+
<https://github.com/user-attachments/assets/8971651d-9aff-4b41-94ca-9a35b9241f22>
6565

6666
<div align="center">
6767
<h3 style="color: #4CAF50;">Total Visitors</h3>

Workloads-Specific/DataFactory/BestPractices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,6 @@ Last updated: 2025-05-02
5656

5757
</details>
5858

59-
6059
## Clear Pipeline Structure
6160

6261
> Ensure your pipelines are well-organized and easy to understand.
@@ -364,6 +363,7 @@ graph TD
364363
## Source Control
365364

366365
> Benefits of Git Integration: <br/>
366+
>
367367
> - **Version Control**: Track and audit changes, and revert to previous versions if needed. <br/>
368368
> - **Collaboration**: Multiple team members can work on the same project simultaneously. <br/>
369369
> - **Incremental Saves**: Save partial changes without publishing them live. <br/>
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# Data Science - Best Practices Overview
2+
3+
Costa Rica
4+
5+
[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
6+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
7+
[brown9804](https://github.com/brown9804)
8+
9+
Last updated: 2025-05-02
10+
11+
----------
12+
13+
<details>
14+
<summary><b>List of References</b> (Click to expand)</summary>
15+
16+
</details>
17+
18+
<div align="center">
19+
<h3 style="color: #4CAF50;">Total Visitors</h3>
20+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
21+
</div>
Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
# Data Warehouse - Best Practices Overview
2+
3+
Costa Rica
4+
5+
[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
6+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
7+
[brown9804](https://github.com/brown9804)
8+
9+
Last updated: 2025-05-02
10+
11+
----------
12+
13+
<details>
14+
<summary><b>List of References</b> (Click to expand)</summary>
15+
16+
</details>
17+
18+
> Ensure that your data warehouse solution is engineered for scalability, resilience, and efficient integration of diverse data sources. Every component (from the core warehouse to mirrored databases) should adhere to strict best practices for structure, documentation, and management, ensuring long-term maintainability and robust disaster recovery.
19+
20+
| **Best Practice** | **Description** | **Example** |
21+
|---------------|-------|-------------------|
22+
| **Structured Warehouse Implementation** | Build a robust, centralized data warehouse that organizes data into well-defined layers (often referred to as Bronze, Silver, and Gold). Layering the data warehouse ensures fast query performance, streamlined management, and strong governance. Leverage proper indexing, partitioning schemes, metadata tagging, and lineage tracking to support compliance and facilitate troubleshooting.| Create a warehouse solution that segments data as follows:<br>- **Bronze Layer:** Ingests raw, untransformed data maintaining source fidelity.<br>- **Silver Layer:** Applies data cleansing, validation, and enrichment.<br>- **Gold Layer:** Produces analytics-ready data using optimized storage formats like Parquet or Delta Lake, with partitioning by date or region. Integrate metadata catalogs and RBAC controls for added governance. |
23+
| **Sample Warehouse Environment** | Develop an isolated sample warehouse to prototype, test, and train on the data warehouse structure. This environment mimics the production warehouse architecture but contains a representative subset of data. Its purpose is to validate new queries, ETL routines, and performance tuning while insulating production operations from potential disruptions.| Deploy a sample warehouse using anonymized or synthetic data. For example, use a smaller, mirrored version of the production warehouse structure to experiment with SQL queries, develop new ETL pipelines, or train team members without impacting live data and processes.|
24+
| **Interactive Notebooks for Data Warehousing** | Use interactive notebooks as exploratory and documentation tools for your warehouse operations. These notebooks serve as an effective interface for testing queries, performing data analysis, and capturing transformation logic. Rich markdown annotations, code segmentation, and version control increase collaboration while ensuring reproducibility across the team.| Create notebooks that are segmented into distinct sections:<br>- **Data Loading:** Scripts to pull data from the warehouse.<br>- **Data Transformation:** Blocks that illustrate cleaning and enrichment steps.<br>- **Analysis & Visualization:** SQL queries and charts generated from warehouse data, supplemented with detailed markdown explanations and inline comments to clarify business logic.|
25+
| **Mirrored Azure SQL Database** | Implement mirroring for your Azure SQL Database to enhance high availability and enable robust disaster recovery. This mirroring facilitates load balancing, supports failover scenarios, and enables non-disruptive maintenance (crucial for mission-critical operations). | Configure a mirrored Azure SQL Database with geo-redundancy and automatic failover. For example, use Azure’s built-in replication to maintain a secondary copy that seamlessly takes over during primary instance outages, ensuring continuous data availability.|
26+
| **Mirrored Snowflake** | Establish a mirrored Snowflake instance to boost resilience and data consistency. Mirroring your Snowflake environment can help you maintain continuous integration testing and rapid recovery in case of primary instance failures, while also ensuring global replication consistency. | Deploy a Snowflake mirror by setting up data replication between your primary instance and a secondary environment. Regularly validate synchronization and monitor rollback capabilities to confirm that the mirror remains current and can support operations during failover or testing cycles.|
27+
| **Mirrored Azure Cosmos DB (Preview)** | Leverage the preview features of mirrored Azure Cosmos DB to set up a redundant NoSQL datastore. This setup targets high availability and low latency access for globally distributed applications, providing additional resilience through multi-region replication and load balancing. | Configure an Azure Cosmos DB mirroring setup in preview mode that replicates data across multiple regions. Test the environment by simulating high-load queries and failover events to ensure that global access is maintained with minimal latency.|
28+
| **Mirrored Azure Database for PostgreSQL (Preview)** | Utilize mirroring for Azure Database for PostgreSQL (in preview) to boost availability and support continuous operations. This safeguards maintenance activities, minimizes downtime, and ensures the consistency of transactional workloads. | Set up a mirrored Azure Database for PostgreSQL in its preview configuration. Create read replicas with continuous synchronization, perform failover drills, and track replication latency to guarantee that the mirrored instance maintains data integrity and high availability during operational stress.|
29+
| **Mirrored Azure SQL Managed Instance (Preview)** | Deploy a mirrored configuration for Azure SQL Managed Instance (preview) to combine the power of SQL Server with cloud agility. This approach underpins your enterprise’s disaster recovery strategy and supports legacy workloads with advanced data security and resiliency. | Configure an Azure SQL Managed Instance in a mirrored setup using strategies like log shipping or transactional replication. Monitor key performance metrics to ensure that replication latency is minimal, and the mirror is capable of supporting a swift transition during outages or maintenance windows.|
30+
| **Mirrored Database (Preview)** | Implement a generic mirrored database solution (in preview) to test cross-platform replication strategies. This flexible approach allows you to verify that your mirroring configurations, error-handling, and failover processes are robust before production rollout. | Set up a mirrored database configuration that synchronizes periodically with a primary instance. Schedule automated tests and synchronization checks, and simulate failover events to validate that the data remains consistent, with built-in alerts and monitoring demonstrating the mirror’s readiness for production use.|
31+
32+
<div align="center">
33+
<h3 style="color: #4CAF50;">Total Visitors</h3>
34+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
35+
</div>
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# OneLake - Best Practices Overview
2+
3+
Costa Rica
4+
5+
[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
6+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
7+
[brown9804](https://github.com/brown9804)
8+
9+
Last updated: 2025-05-02
10+
11+
----------
12+
13+
<details>
14+
<summary><b>List of References</b> (Click to expand)</summary>
15+
16+
</details>
17+
18+
<div align="center">
19+
<h3 style="color: #4CAF50;">Total Visitors</h3>
20+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
21+
</div>

0 commit comments

Comments
 (0)