Skip to content

Commit 538c3ca

Browse files
authored
Merge pull request #18 from MicrosoftCloudEssentials-LearningHub/bestpractices7of9
best practices overview 6 of 7 workloads at the end
2 parents 1232088 + 3b56667 commit 538c3ca

File tree

9 files changed

+825
-32
lines changed

9 files changed

+825
-32
lines changed

README.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -173,7 +173,7 @@ Click to read more about [Microsoft Purview for Fabric - Overview](./Workloads-S
173173
- **Microsoft [Fabric Capacity Metrics](https://github.com/MicrosoftCloudEssentials-LearningHub/Fabric-EnterpriseFramework/blob/main/Monitoring-Observability/MonitorUsage.md#microsoft-fabric-capacity-metrics-app) app**: Powerful tool for administrators to `monitor and manage their capacity usage`. It provides detailed insights into `capacity utilization, throttling, and system events, helping to optimize performance and resource allocation`. By tracking these metrics, admins can make informed decisions to ensure efficient use of resources.
174174
- **Admin Monitoring**: Configure and use the [Admin Monitoring Workspace](https://github.com/MicrosoftCloudEssentials-LearningHub/Fabric-EnterpriseFramework/blob/main/Monitoring-Observability/MonitorUsage.md#admin-monitoring) it's a centralized hub for `tracking and analyzing usage metrics across the organization`. It includes `pre-built reports and semantic models that provide insights into feature adoption, performance, and compliance`. This workspace helps administrators maintain the health and efficiency of their Fabric environment by offering a comprehensive `view of usage patterns and system events`.
175175
- **Monitor Hub**: Access and utilize the [Monitor Hub](https://github.com/MicrosoftCloudEssentials-LearningHub/Fabric-EnterpriseFramework/blob/main/Monitoring-Observability/MonitorUsage.md#monitor-hub). Allows users to `view and track the status of activities across all workspaces they have permissions for`. It provides a detailed overview of operations, `including dataset refreshes, Spark job runs, and other activities`. With features like historical views, customizable displays, and filtering options, the Monitor Hub helps ensure smooth operations and timely interventions when needed.
176-
- **Event Hub Integration**: Use Event Hub to capture and analyze events for real-time monitoring. For example, leverage it for [Automating pipeline execution with Activator](./Monitoring-Observability/FabricActivatorRulePipeline/)
176+
- **Event Hub Integration**: Use Event Hub to capture and analyze events for real-time monitoring. For example, leverage it for [Automating pipeline execution with Activator](./Workloads-Specific/RealTimeIntelligence/FabricActivatorRulePipeline/)
177177
- **Alerting**: Configure alerts for critical events and thresholds to ensure timely responses to issues. For example, [Steps to Configure Capacity Alerts](./Monitoring-Observability/StepsCapacityAlert.md)
178178

179179
## Cost Management
@@ -202,12 +202,10 @@ Click to read more about [Microsoft Purview for Fabric - Overview](./Workloads-S
202202
- [Azure Data Factory (ADF) - Best Practices Overview](./Workloads-Specific/DataFactory/BestPractices.md)
203203
- [Data Engineering - Best Practices Overview](./Workloads-Specific/DataEngineering/BestPractices.md)
204204
- [Data Warehouse - Best Practices Overview](./Workloads-Specific/DataWarehouse/BestPractices.md)
205-
- [Data Science - Best Practices Overview](./Workloads-Specific/DataScience/BestPractices.md) - in progress
206-
- [Real-Time Intelligence - Best Practices Overview](./Workloads-Specific/RealTimeIntelligence/BestPractices.md) - in progress
205+
- [Data Science - Best Practices Overview](./Workloads-Specific/DataScience/BestPractices.md)
206+
- [Real-Time Intelligence - Best Practices Overview](./Workloads-Specific/RealTimeIntelligence/BestPractices.md)
207207
- [Power Bi - Best Practices Overview](./Workloads-Specific/PowerBi/BestPractices.md)
208-
- [Copilot - Best Practices Overview](./Workloads-Specific/Copilot/BestPractices.md) - in progress
209208
- [Purview - Best Practices Overview](./Workloads-Specific/Purview/BestPractices.md) - in progress
210-
- [OneLake - Best Practices Overview](./Workloads-Specific/OneLake/BestPractices.md) - in progress
211209

212210
<div align="center">
213211
<h3 style="color: #4CAF50;">Total Visitors</h3>

Workloads-Specific/DataScience/BestPractices.md

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,47 @@ Last updated: 2025-05-03
1313
<details>
1414
<summary><b>List of References</b> (Click to expand)</summary>
1515

16+
- [What is Data Science in Microsoft Fabric?](https://learn.microsoft.com/en-us/fabric/data-science/data-science-overview)
17+
- [Data Science documentation in Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/data-science/)
18+
19+
</details>
20+
21+
<details>
22+
<summary><b>Table of Content</b> (Click to expand)</summary>
23+
24+
- [ML Model Management](#ml-model-management)
25+
- [Experiment Tracking & Management](#experiment-tracking--management)
26+
- [Reproducible Environments](#reproducible-environments)
27+
- [Data Agent Preview Usage](#data-agent-preview-usage)
28+
1629
</details>
1730

31+
> Ensure that your data science workflows in Microsoft Fabric are built for rapid experimentation, efficient model management, and seamless deployment. Each element should be managed with clear versioning, detailed documentation, and reproducible environments, enabling a smooth transition from experimentation to production.
32+
33+
<div align="center">
34+
<img src="https://github.com/user-attachments/assets/f86cdba7-e9a6-4ce1-8dcc-912b7f438398" alt="Centered Image" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
35+
</div>
36+
37+
## ML Model Management
38+
39+
> Use model registries integrated within Fabric to store and version your models. Include a descriptive README, link relevant experiment IDs, and attach performance metrics such as accuracy, AUC, and confusion matrices. For example, link your production-ready model (v#.#) from a registered repository along with its associated validation metrics and deployment instructions.
40+
41+
## Experiment Tracking & Management
42+
43+
> Set up an experiment dashboard that automatically logs training runs. For instance, record runs with various hyperparameter combinations, tag them with unique identifiers, and visualize comparative metrics over multiple iterations. This dashboard can help you decide whether a model trained with early stopping or one with higher epochs best meets performance goals.
44+
45+
## Reproducible Environments
46+
47+
> Create an environment file (e.g., Conda `environment.yml`) that lists all required Python packages and their versions. For example, specify TensorFlow 2.9, scikit-learn 1.0, and other dependencies so that every data scientist and deployment pipeline uses the exact setup. Use Microsoft Fabric workspaces to segregate development and production environments, ensuring that models are trained and evaluated in a consistent setting.
48+
49+
<https://github.com/user-attachments/assets/fcce754d-afd3-4267-aa0f-bba87c0a3089>
50+
51+
## Data Agent (Preview) Usage
52+
53+
> Integrate the Data Agent into your pipeline to automatically validate incoming datasets for completeness and consistency. For instance, set up rules that flag missing data or out-of-range values and trigger notifications when anomalies are detected. Track and document these incidents to help refine the agent’s calibration, ensuring that data passing to your experiments meets quality standards.
54+
55+
Click to read [Demonstration: Data Agents in Microsoft Fabric](./Data_Agents.md).
56+
1857
<div align="center">
1958
<h3 style="color: #4CAF50;">Total Visitors</h3>
2059
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>

Workloads-Specific/Copilot/BestPractices.md renamed to Workloads-Specific/DataScience/How_AutoML/README.md

Lines changed: 9 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Copilot - Best Practices Overview
1+
# Demonstration: How to train a ML model with AutoML
22

33
Costa Rica
44

@@ -10,10 +10,15 @@ Last updated: 2025-05-03
1010

1111
----------
1212

13-
<details>
14-
<summary><b>List of References</b> (Click to expand)</summary>
13+
> How to create an experiment to train a ML model with AutoML:
1514
16-
</details>
15+
<https://github.com/user-attachments/assets/4c73eaaa-cf03-47cf-807b-69007c8df704>
16+
17+
Click to see notebook generated [Train a ML model with AutoML](./Train_MLmodel_AutoML.ipynb)
18+
19+
> Run the notebook the generated:
20+
21+
<https://github.com/user-attachments/assets/6dfedbac-beb7-4025-9a42-f98dade7f431>
1722

1823
<div align="center">
1924
<h3 style="color: #4CAF50;">Total Visitors</h3>

0 commit comments

Comments
 (0)