Skip to content

Commit 69ae8e6

Browse files
authored
Merge 3de426a into 55c02d0
2 parents 55c02d0 + 3de426a commit 69ae8e6

File tree

4 files changed

+145
-1
lines changed

4 files changed

+145
-1
lines changed

.github/.markdownlint.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
{
22
"default": true,
33
"MD005": false,
4+
"MD009": false,
45
"MD013": false,
56
"MD028": false,
67
"MD029": false,

.github/workflows/validate_and_fix_markdown.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,9 @@ jobs:
3636
3737
- name: Commit changes
3838
run: |
39+
git fetch origin
40+
git checkout ${{ github.event.pull_request.head.ref }}
3941
git add -A
42+
git rebase origin/${{ github.event.pull_request.head.ref }}
4043
git commit -m "Fix Markdown syntax issues" || echo "No changes to commit"
4144
git push origin HEAD:${{ github.event.pull_request.head.ref }}

Purview-Fabric.md

Lines changed: 120 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,120 @@
1+
# Purview for Fabric - Overview
2+
3+
Costa Rica
4+
5+
[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
6+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
7+
[brown9804](https://github.com/brown9804)
8+
9+
Last updated: 2025-05-02
10+
11+
----------
12+
13+
<details>
14+
<summary><b>List of References </b> (Click to expand)</summary>
15+
16+
- [Use Microsoft Purview to govern Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/governance/microsoft-purview-fabric)
17+
- [The Microsoft Purview hub for data owners (preview)](https://learn.microsoft.com/en-us/fabric/governance/use-microsoft-purview-hub-data-owners)
18+
- [Govern data in Microsoft Fabric with Purview](https://learn.microsoft.com/en-us/training/modules/fabric-data-governance-purview/)
19+
20+
</details>
21+
22+
## Content
23+
24+
- [Purview + Fabric](#purview--fabric)
25+
- [Microsoft Purview Pricing: Fabric focused](#microsoft-purview-pricing-fabric-focused)
26+
- [Cost Optimization Strategies](#cost-optimization-strategies)
27+
28+
29+
## Purview + Fabric
30+
31+
> Microsoft Purview and Microsoft Fabric are integrated to provide a comprehensive `data governance and analytics platform`.
32+
> Purview enables users to `discover, classify, and protect data within Fabric`, ensuring that `data governance policies are consistently applied across the entire data estate`.
33+
> Purview's capabilities, such as `automated scanning, ingestion, classification, and sensitivity labeling, enhance the governance of Fabric's data assets, providing a unified approach to data management and compliance`.
34+
35+
| Feature | Description | Free | Charged | Interaction |
36+
|---------|-------------|------|---------|-------------|
37+
| **Automated Scanning, Ingestion & Classification** | This feature involves scanning and classifying data assets within Fabric. It helps in identifying and categorizing data based on predefined rules and policies. | - **Power BI Online**: Free for a limited time means that users can utilize automated scanning, ingestion, and classification features for Power BI Online without incurring costs during the promotional period.<br> - **SQL Server on-prem**: Free for a limited time indicates that users can use automated scanning, ingestion, and classification features for SQL Server on-premises without charges during the promotional period | **Other data sources**: $0.011 per vCore minute | Interacts with **Data Map Enrichment** by providing initial metadata. Feeds into **Report Generation** by supplying data for reports. |
38+
| **Data Map Enrichment** | Enriching metadata with advanced resource sets involves adding detailed information to the metadata, making it more useful for data governance and analysis. | N/A | $0.21 per vCore-hour | Enhances metadata from **Automated Scanning, Ingestion & Classification**. Used in **Report Generation** for detailed insights. |
39+
| **Report Generation** | Generating reports based on scanned data provides insights and summaries of the data assets, helping in decision-making and compliance monitoring. | N/A | $0.82 per vCore-hour | Utilizes enriched metadata from **Data Map Enrichment** to create comprehensive reports. |
40+
| **Elastic Data Map** | Capacity units support data map operations and metadata storage, allowing for scalable and efficient management of data assets. | N/A | $0.411 per capacity unit hour | Supports operations of **Automated Scanning, Ingestion & Classification**, **Data Map Enrichment**, and **Report Generation**. |
41+
| **Data Catalog** | Browsing and searching data assets allows users to explore and find relevant data within Fabric. This feature is included with the Data Map. | Included with Data Map | N/A | Leverages metadata and insights from other features for a unified view of data. |
42+
| **Data Policy** | Applying DevOps policies to data assets involves setting rules and policies for data management and governance. | N/A | $0.0027 per policy hour | Ensures compliance and proper data management for assets managed by other features. |
43+
| **Insights Consumption** | API calls for insights and data access allow users to retrieve and interact with data programmatically, providing flexibility in data usage. | N/A | $0.21 per API call | Enables programmatic access to data and metadata managed by other features. |
44+
45+
## Microsoft Purview Pricing: Fabric focused
46+
47+
> When Purview is integrated with Fabric, the following aspects are considered for pricing:
48+
49+
```mermaid
50+
graph TD
51+
A[Data Map] --> B[Storage]
52+
A --> C[Operations]
53+
D[Data Catalog] --> E[Storage]
54+
D --> F[Operations]
55+
G[Information Protection] --> H[Sensitivity Labels]
56+
G --> I[DLP]
57+
58+
B --> J[Charged by metadata amount]
59+
C --> K[Charged by operations]
60+
E --> L[Charged by metadata amount]
61+
F --> M[Charged by operations]
62+
H --> N[Charged by protected assets]
63+
I --> O[Charged by policies and data volume]
64+
```
65+
66+
> [!TIP]
67+
> `Compute resources` are used for `operations` such as scanning, ingestion, classification, enrichment, and report generation. These operations involve processing data and performing various tasks that require computational power. <br/>
68+
> `Storage resources` are used to `store metadata`, data maps, and other related information. This includes the storage of scanned metadata, enriched metadata, and generated reports.
69+
70+
| Aspect | Description | Pricing |
71+
|--------|-------------|---------|
72+
| **Unified Data Catalog** | **Metadata Management**: Purview captures metadata from Fabric items. | Cost is based on the amount of metadata stored and the operations performed. |
73+
| **Information Protection** | **Sensitivity Labels**: Applying sensitivity labels to Fabric items. | Charges are based on the number of assets protected. |
74+
| **Data Governance** | **Data Map Operations**: Scanning and cataloging Fabric data assets in Purview. | Charged based on the number of operations performed. |
75+
76+
> Pricing example: <br/>
77+
> 1. **Automated Scanning, Ingestion & Classification**: **Other data sources**: 0.5 minutes of total scan duration using 16 vCores. <br/>
78+
> 2. **Data Map Enrichment**: 365 hours of advanced resource set usage. <br/>
79+
> 3. **Report Generation**: 365 hours of report generation. <br/>
80+
> 4. **Elastic Data Map**: 1 capacity units used for 365 hours. `1 Capacity Unit supports requests of up to 25 data map operations per second and includes storage of up to 10 GB of metadata about data assets` <br/>
81+
> 5. **Data Catalog**: Included with the Data Map. <br/>
82+
> 6. **Data Policy**: 0 DevOps policies applied. <br/>
83+
> 7. **Insights Consumption**: 0 API calls.
84+
85+
| Feature | Description | Usage | Rate | Cost |
86+
|---------|-------------|-------|------|------|
87+
| **Automated Scanning, Ingestion & Classification** | Scanning and classifying data assets within Fabric. | 0.5 minutes x 16 vCores | $0.011 per vCore minute | $0.09 |
88+
| **Data Map Enrichment** | Enriching metadata with advanced resource sets. | 365 hours | $0.21 per vCore-hour | $76.65 |
89+
| **Report Generation** | Generating reports based on scanned data. | 365 hours | $0.82 per vCore-hour | $299.30 |
90+
| **Elastic Data Map** | Capacity units supporting data map operations and metadata storage. | 1 capacity units x 365 hours | $0.411 per capacity unit hour | $150.02 |
91+
| **Data Catalog** | Browsing and searching data assets. | Included with Data Map | N/A | $0.00 |
92+
| **Data Policy** | Applying DevOps policies to data assets. | 0 policies x 365 hours | $0.0027 per policy hour | $0.00 |
93+
| **Insights Consumption** | API calls for insights and data access. | 0 API calls | $0.21 per API call | $0.00 |
94+
95+
> Total Optimized Monthly Cost:
96+
- **Automated Scanning and Classification**: $0.09
97+
- **Data Map Enrichment**: $76.65
98+
- **Report Generation**: $299.30
99+
- **Elastic Data Map**: $150.02
100+
- **Data Policy**: $0.00
101+
- **Insights Consumption**: $0.00
102+
- **Total Optimized Monthly Cost**: $526.05
103+
104+
> [!NOTE]
105+
> Viewing reports generated by Microsoft Purview within Microsoft Fabric does not incur additional charges. The costs are primarily associated with the generation of the reports, which involves processing and computing resources. Once the reports are generated, viewing them is typically free of charge.
106+
107+
## Cost Optimization Strategies
108+
109+
| Strategy | Description | Details |
110+
|----------|-------------|---------|
111+
| **Efficient Scanning and Classification** | Optimize scanning and classification processes to reduce costs. | - **Schedule Scans**: Schedule scans during off-peak hours to reduce costs associated with high-demand periods.<br> - **Incremental Scans**: Use incremental scans to only scan new or modified data, reducing the total scan duration and vCore usage. |
112+
| **Selective Data Map Enrichment** | Focus on enriching metadata for critical data assets. | - **Prioritize Critical Data**: Focus on enriching metadata for critical data assets that require detailed governance and analysis.<br> - **Batch Processing**: Process data in batches to optimize resource usage and reduce vCore-hour costs. |
113+
| **Targeted Report Generation** | Limit report generation to essential reports. | - **Generate Essential Reports**: Limit report generation to essential reports that provide the most value for decision-making and compliance monitoring.<br> - **Optimize Report Frequency**: Adjust the frequency of report generation to balance between timely insights and cost efficiency. |
114+
| **Elastic Data Map Utilization** | Efficiently manage capacity units to optimize costs. | - **Monitor Usage**: Regularly monitor capacity unit usage to ensure efficient allocation and avoid over-provisioning.<br> - **Scale Appropriately**: Scale capacity units based on actual usage patterns to optimize costs. |
115+
| **API Call Management** | Minimize unnecessary API calls to reduce costs. | - **Optimize API Usage**: Minimize unnecessary API calls by caching results and reusing data where possible.<br> - **Batch API Requests**: Group API requests to reduce the number of individual calls, optimizing cost per API call. |
116+
117+
<div align="center">
118+
<h3 style="color: #4CAF50;">Total Visitors</h3>
119+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
120+
</div>

README.md

Lines changed: 21 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Costa Rica
55
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
66
[brown9804](https://github.com/brown9804)
77

8-
Last updated: 2025-04-21
8+
Last updated: 2025-05-02
99

1010
------------------------------------------
1111

@@ -29,6 +29,7 @@ Last updated: 2025-04-21
2929
- [Infrastructure as Code IaC](#infrastructure-as-code-iac)
3030
- [Source Control Integration](#source-control-integration)
3131
- [Security](#security)
32+
- [Microsoft Purview for Fabric](#microsoft-purview-for-fabric)
3233
- [Networking](#networking)
3334
- [Monitoring and Observability](#monitoring-and-observability)
3435

@@ -121,6 +122,25 @@ Last updated: 2025-04-21
121122
| **Data Protection & Encryption** | - **Data Masking:** Hide sensitive information from unauthorized users. <br/> - **Audit Logs:** Keep detailed records to monitor user activities and detect anomalies. <br/> - **Encryption at Rest:** Use Azure Storage Service Encryption and Transparent Data Encryption (TDE) to protect stored data. <br/> - **Encryption in Transit:** Secure communications with TLS/SSL protocols and VPNs. |
122123
| **Networking & Granular Controls** | - **Granular Security Controls:** Implement layered security measures to comprehensively protect sensitive data. <br/> - **Networking:** Leverage Fabric’s unified platform to simplify secure network configurations. For more details, see [Networking](#networking) |
123124

125+
## Microsoft Purview for Fabric
126+
127+
> **Microsoft Purview** is a unified data governance solution that helps `organizations manage and govern` their on-premises, multi-cloud, and software-as-a-service (SaaS) data. When integrated with **Microsoft Fabric**, Purview enhances `data discovery, classification, lineage, and access control` across the entire data estate. In the context of **Microsoft Fabric**, which is an end-to-end analytics platform that unifies data engineering, data science, real-time analytics, and business intelligence, Purview plays a crucial role in:
128+
129+
- **Data Cataloging**: Automatically scanning and cataloging data assets across Fabric workspaces.
130+
- **Data Lineage**: Tracking how data flows and transforms across pipelines, notebooks, and reports.
131+
- **Access Management**: Enforcing data access policies and ensuring compliance.
132+
- **Data Classification**: Identifying sensitive data using built-in or custom classifiers.
133+
134+
> When to Integrate Purview with Fabric?
135+
136+
1. **You need centralized data governance** across multiple data sources and services within Fabric.
137+
2. **Compliance and regulatory requirements** demand visibility into data usage, classification, and lineage.
138+
3. **Your organization handles sensitive data** (e.g., PII, financial data) and needs automated classification and protection.
139+
4. **You want to empower data consumers** (analysts, scientists, engineers) to discover and understand data assets easily.
140+
5. **You are scaling your data operations** and need consistent governance policies across teams and projects.
141+
142+
Click to read more about [Microsoft Purview for Fabric - Overview](./Purview-Fabric.md).
143+
124144
## Networking
125145

126146
> Networking is a critical component of any enterprise-level data platform. In Microsoft Fabric, networking configurations are simplified and secured through its `unified platform.`:

0 commit comments

Comments
 (0)