Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
02a80bb
Purview for Fabric
brown9804 May 2, 2025
81a732e
Merge 02a80bbdd2e0652e7b9a0cd50504a3646ca47f98 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
6a3f501
Update last modified date in Markdown files
github-actions[bot] May 2, 2025
4380364
enamed
brown9804 May 2, 2025
1b55a0a
Merge 43803648d4bec0ad8e2aa70ab8368fc2b4382682 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
121968d
overview
brown9804 May 2, 2025
7fef4e0
Merge 121968d8c0dadef04384d446f7a093fc863f0a34 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
b6bfe5c
Update last modified date in Markdown files
github-actions[bot] May 2, 2025
fe31dc4
testing pipeline
brown9804 May 2, 2025
571c84e
Merge fe31dc406973592bdaeb703e725df7bed1f0f30b into 55c02d0f36ddca75b…
brown9804 May 2, 2025
a6e231e
adding ignore rule
brown9804 May 2, 2025
5208033
Merge a6e231ee90a8572aa0b924c2a1217bf6bcafc883 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
6446ff4
patch for pipeline
brown9804 May 2, 2025
df4fa41
Merge 6446ff4eeae6280d96d353de8cdc5473c07332af into 55c02d0f36ddca75b…
brown9804 May 2, 2025
3de426a
-> rebase
brown9804 May 2, 2025
69ae8e6
Merge 3de426a7a0e57c6594764b278ddb1c6bf8824a70 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
5cff19b
pull rebase
brown9804 May 2, 2025
dc26b2c
Merge 5cff19ba41ffd484f0c8807377ced1c2da902af4 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
5ef6f0c
Fix Markdown syntax issues
github-actions[bot] May 2, 2025
7b191d5
detach head
brown9804 May 2, 2025
1f46288
Merge 7b191d5c4b8509df620f62e4084f4b5880ef9077 into 55c02d0f36ddca75b…
brown9804 May 2, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/.markdownlint.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
{
"default": true,
"MD005": false,
"MD009": false,
"MD013": false,
"MD028": false,
"MD029": false,
Expand Down
3 changes: 3 additions & 0 deletions .github/workflows/validate_and_fix_markdown.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,9 @@ jobs:
- name: Commit changes
run: |
git fetch origin
git checkout -b ${{ github.event.pull_request.head.ref }} origin/${{ github.event.pull_request.head.ref }}
git add -A
git commit -m "Fix Markdown syntax issues" || echo "No changes to commit"
git pull --rebase origin ${{ github.event.pull_request.head.ref }} || echo "No rebase needed"
git push origin HEAD:${{ github.event.pull_request.head.ref }}
121 changes: 121 additions & 0 deletions Purview-Fabric.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,121 @@
# Purview for Fabric - Overview

Costa Rica

[![GitHub](https://badgen.net/badge/icon/github?icon=github&label)](https://github.com)
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
[brown9804](https://github.com/brown9804)

Last updated: 2025-05-02

----------

<details>
<summary><b>List of References </b> (Click to expand)</summary>

- [Use Microsoft Purview to govern Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/governance/microsoft-purview-fabric)
- [The Microsoft Purview hub for data owners (preview)](https://learn.microsoft.com/en-us/fabric/governance/use-microsoft-purview-hub-data-owners)
- [Govern data in Microsoft Fabric with Purview](https://learn.microsoft.com/en-us/training/modules/fabric-data-governance-purview/)

</details>

## Content

- [Purview + Fabric](#purview--fabric)
- [Microsoft Purview Pricing: Fabric focused](#microsoft-purview-pricing-fabric-focused)
- [Cost Optimization Strategies](#cost-optimization-strategies)

## Purview + Fabric

> Microsoft Purview and Microsoft Fabric are integrated to provide a comprehensive `data governance and analytics platform`.
> Purview enables users to `discover, classify, and protect data within Fabric`, ensuring that `data governance policies are consistently applied across the entire data estate`.
> Purview's capabilities, such as `automated scanning, ingestion, classification, and sensitivity labeling, enhance the governance of Fabric's data assets, providing a unified approach to data management and compliance`.

| Feature | Description | Free | Charged | Interaction |
|---------|-------------|------|---------|-------------|
| **Automated Scanning, Ingestion & Classification** | This feature involves scanning and classifying data assets within Fabric. It helps in identifying and categorizing data based on predefined rules and policies. | - **Power BI Online**: Free for a limited time means that users can utilize automated scanning, ingestion, and classification features for Power BI Online without incurring costs during the promotional period.<br> - **SQL Server on-prem**: Free for a limited time indicates that users can use automated scanning, ingestion, and classification features for SQL Server on-premises without charges during the promotional period | **Other data sources**: $0.011 per vCore minute | Interacts with **Data Map Enrichment** by providing initial metadata. Feeds into **Report Generation** by supplying data for reports. |
| **Data Map Enrichment** | Enriching metadata with advanced resource sets involves adding detailed information to the metadata, making it more useful for data governance and analysis. | N/A | $0.21 per vCore-hour | Enhances metadata from **Automated Scanning, Ingestion & Classification**. Used in **Report Generation** for detailed insights. |
| **Report Generation** | Generating reports based on scanned data provides insights and summaries of the data assets, helping in decision-making and compliance monitoring. | N/A | $0.82 per vCore-hour | Utilizes enriched metadata from **Data Map Enrichment** to create comprehensive reports. |
| **Elastic Data Map** | Capacity units support data map operations and metadata storage, allowing for scalable and efficient management of data assets. | N/A | $0.411 per capacity unit hour | Supports operations of **Automated Scanning, Ingestion & Classification**, **Data Map Enrichment**, and **Report Generation**. |
| **Data Catalog** | Browsing and searching data assets allows users to explore and find relevant data within Fabric. This feature is included with the Data Map. | Included with Data Map | N/A | Leverages metadata and insights from other features for a unified view of data. |
| **Data Policy** | Applying DevOps policies to data assets involves setting rules and policies for data management and governance. | N/A | $0.0027 per policy hour | Ensures compliance and proper data management for assets managed by other features. |
| **Insights Consumption** | API calls for insights and data access allow users to retrieve and interact with data programmatically, providing flexibility in data usage. | N/A | $0.21 per API call | Enables programmatic access to data and metadata managed by other features. |

## Microsoft Purview Pricing: Fabric focused

> When Purview is integrated with Fabric, the following aspects are considered for pricing:

```mermaid
graph TD
A[Data Map] --> B[Storage]
A --> C[Operations]
D[Data Catalog] --> E[Storage]
D --> F[Operations]
G[Information Protection] --> H[Sensitivity Labels]
G --> I[DLP]

B --> J[Charged by metadata amount]
C --> K[Charged by operations]
E --> L[Charged by metadata amount]
F --> M[Charged by operations]
H --> N[Charged by protected assets]
I --> O[Charged by policies and data volume]
```

> [!TIP]
> `Compute resources` are used for `operations` such as scanning, ingestion, classification, enrichment, and report generation. These operations involve processing data and performing various tasks that require computational power. <br/>
> `Storage resources` are used to `store metadata`, data maps, and other related information. This includes the storage of scanned metadata, enriched metadata, and generated reports.

| Aspect | Description | Pricing |
|--------|-------------|---------|
| **Unified Data Catalog** | **Metadata Management**: Purview captures metadata from Fabric items. | Cost is based on the amount of metadata stored and the operations performed. |
| **Information Protection** | **Sensitivity Labels**: Applying sensitivity labels to Fabric items. | Charges are based on the number of assets protected. |
| **Data Governance** | **Data Map Operations**: Scanning and cataloging Fabric data assets in Purview. | Charged based on the number of operations performed. |

> Pricing example: <br/>
>
> 1. **Automated Scanning, Ingestion & Classification**: **Other data sources**: 0.5 minutes of total scan duration using 16 vCores. <br/>
> 2. **Data Map Enrichment**: 365 hours of advanced resource set usage. <br/>
> 3. **Report Generation**: 365 hours of report generation. <br/>
> 4. **Elastic Data Map**: 1 capacity units used for 365 hours. `1 Capacity Unit supports requests of up to 25 data map operations per second and includes storage of up to 10 GB of metadata about data assets` <br/>
> 5. **Data Catalog**: Included with the Data Map. <br/>
> 6. **Data Policy**: 0 DevOps policies applied. <br/>
> 7. **Insights Consumption**: 0 API calls.

| Feature | Description | Usage | Rate | Cost |
|---------|-------------|-------|------|------|
| **Automated Scanning, Ingestion & Classification** | Scanning and classifying data assets within Fabric. | 0.5 minutes x 16 vCores | $0.011 per vCore minute | $0.09 |
| **Data Map Enrichment** | Enriching metadata with advanced resource sets. | 365 hours | $0.21 per vCore-hour | $76.65 |
| **Report Generation** | Generating reports based on scanned data. | 365 hours | $0.82 per vCore-hour | $299.30 |
| **Elastic Data Map** | Capacity units supporting data map operations and metadata storage. | 1 capacity units x 365 hours | $0.411 per capacity unit hour | $150.02 |
| **Data Catalog** | Browsing and searching data assets. | Included with Data Map | N/A | $0.00 |
| **Data Policy** | Applying DevOps policies to data assets. | 0 policies x 365 hours | $0.0027 per policy hour | $0.00 |
| **Insights Consumption** | API calls for insights and data access. | 0 API calls | $0.21 per API call | $0.00 |

> Total Optimized Monthly Cost:

- **Automated Scanning and Classification**: $0.09
- **Data Map Enrichment**: $76.65
- **Report Generation**: $299.30
- **Elastic Data Map**: $150.02
- **Data Policy**: $0.00
- **Insights Consumption**: $0.00
- **Total Optimized Monthly Cost**: $526.05

> [!NOTE]
> Viewing reports generated by Microsoft Purview within Microsoft Fabric does not incur additional charges. The costs are primarily associated with the generation of the reports, which involves processing and computing resources. Once the reports are generated, viewing them is typically free of charge.

## Cost Optimization Strategies

| Strategy | Description | Details |
|----------|-------------|---------|
| **Efficient Scanning and Classification** | Optimize scanning and classification processes to reduce costs. | - **Schedule Scans**: Schedule scans during off-peak hours to reduce costs associated with high-demand periods.<br> - **Incremental Scans**: Use incremental scans to only scan new or modified data, reducing the total scan duration and vCore usage. |
| **Selective Data Map Enrichment** | Focus on enriching metadata for critical data assets. | - **Prioritize Critical Data**: Focus on enriching metadata for critical data assets that require detailed governance and analysis.<br> - **Batch Processing**: Process data in batches to optimize resource usage and reduce vCore-hour costs. |
| **Targeted Report Generation** | Limit report generation to essential reports. | - **Generate Essential Reports**: Limit report generation to essential reports that provide the most value for decision-making and compliance monitoring.<br> - **Optimize Report Frequency**: Adjust the frequency of report generation to balance between timely insights and cost efficiency. |
| **Elastic Data Map Utilization** | Efficiently manage capacity units to optimize costs. | - **Monitor Usage**: Regularly monitor capacity unit usage to ensure efficient allocation and avoid over-provisioning.<br> - **Scale Appropriately**: Scale capacity units based on actual usage patterns to optimize costs. |
| **API Call Management** | Minimize unnecessary API calls to reduce costs. | - **Optimize API Usage**: Minimize unnecessary API calls by caching results and reusing data where possible.<br> - **Batch API Requests**: Group API requests to reduce the number of individual calls, optimizing cost per API call. |

<div align="center">
<h3 style="color: #4CAF50;">Total Visitors</h3>
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
</div>
22 changes: 21 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ Costa Rica
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
[brown9804](https://github.com/brown9804)

Last updated: 2025-04-21
Last updated: 2025-05-02

------------------------------------------

Expand All @@ -29,6 +29,7 @@ Last updated: 2025-04-21
- [Infrastructure as Code IaC](#infrastructure-as-code-iac)
- [Source Control Integration](#source-control-integration)
- [Security](#security)
- [Microsoft Purview for Fabric](#microsoft-purview-for-fabric)
- [Networking](#networking)
- [Monitoring and Observability](#monitoring-and-observability)

Expand Down Expand Up @@ -121,6 +122,25 @@ Last updated: 2025-04-21
| **Data Protection & Encryption** | - **Data Masking:** Hide sensitive information from unauthorized users. <br/> - **Audit Logs:** Keep detailed records to monitor user activities and detect anomalies. <br/> - **Encryption at Rest:** Use Azure Storage Service Encryption and Transparent Data Encryption (TDE) to protect stored data. <br/> - **Encryption in Transit:** Secure communications with TLS/SSL protocols and VPNs. |
| **Networking & Granular Controls** | - **Granular Security Controls:** Implement layered security measures to comprehensively protect sensitive data. <br/> - **Networking:** Leverage Fabric’s unified platform to simplify secure network configurations. For more details, see [Networking](#networking) |

## Microsoft Purview for Fabric

> **Microsoft Purview** is a unified data governance solution that helps `organizations manage and govern` their on-premises, multi-cloud, and software-as-a-service (SaaS) data. When integrated with **Microsoft Fabric**, Purview enhances `data discovery, classification, lineage, and access control` across the entire data estate. In the context of **Microsoft Fabric**, which is an end-to-end analytics platform that unifies data engineering, data science, real-time analytics, and business intelligence, Purview plays a crucial role in:

- **Data Cataloging**: Automatically scanning and cataloging data assets across Fabric workspaces.
- **Data Lineage**: Tracking how data flows and transforms across pipelines, notebooks, and reports.
- **Access Management**: Enforcing data access policies and ensuring compliance.
- **Data Classification**: Identifying sensitive data using built-in or custom classifiers.

> When to Integrate Purview with Fabric?

1. **You need centralized data governance** across multiple data sources and services within Fabric.
2. **Compliance and regulatory requirements** demand visibility into data usage, classification, and lineage.
3. **Your organization handles sensitive data** (e.g., PII, financial data) and needs automated classification and protection.
4. **You want to empower data consumers** (analysts, scientists, engineers) to discover and understand data assets easily.
5. **You are scaling your data operations** and need consistent governance policies across teams and projects.

Click to read more about [Microsoft Purview for Fabric - Overview](./Purview-Fabric.md).

## Networking

> Networking is a critical component of any enterprise-level data platform. In Microsoft Fabric, networking configurations are simplified and secured through its `unified platform.`:
Expand Down