Skip to content

Commit 5067c15

Browse files
Fix Markdown syntax issues
1 parent bf883b8 commit 5067c15

File tree

1 file changed

+9
-9
lines changed

1 file changed

+9
-9
lines changed

Workloads-Specific/DataScience/BestPractices.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -30,23 +30,23 @@ Last updated: 2025-05-03
3030

3131
> Ensure that your data science workflows in Microsoft Fabric are built for rapid experimentation, efficient model management, and seamless deployment. Each element should be managed with clear versioning, detailed documentation, and reproducible environments, enabling a smooth transition from experimentation to production.
3232
33-
## ML Model Management
33+
## ML Model Management
3434

35-
> Use model registries integrated within Fabric to store and version your models. Include a descriptive README, link relevant experiment IDs, and attach performance metrics such as accuracy, AUC, and confusion matrices. For example, link your production-ready model (v#.#) from a registered repository along with its associated validation metrics and deployment instructions.
35+
> Use model registries integrated within Fabric to store and version your models. Include a descriptive README, link relevant experiment IDs, and attach performance metrics such as accuracy, AUC, and confusion matrices. For example, link your production-ready model (v#.#) from a registered repository along with its associated validation metrics and deployment instructions.
3636
37-
## Experiment Tracking & Management
37+
## Experiment Tracking & Management
3838

39-
> Set up an experiment dashboard that automatically logs training runs. For instance, record runs with various hyperparameter combinations, tag them with unique identifiers, and visualize comparative metrics over multiple iterations. This dashboard can help you decide whether a model trained with early stopping or one with higher epochs best meets performance goals.
39+
> Set up an experiment dashboard that automatically logs training runs. For instance, record runs with various hyperparameter combinations, tag them with unique identifiers, and visualize comparative metrics over multiple iterations. This dashboard can help you decide whether a model trained with early stopping or one with higher epochs best meets performance goals.
4040
41-
https://github.com/user-attachments/assets/4c73eaaa-cf03-47cf-807b-69007c8df704
41+
<https://github.com/user-attachments/assets/4c73eaaa-cf03-47cf-807b-69007c8df704>
4242

43-
## Reproducible Environments
43+
## Reproducible Environments
4444

45-
> Create an environment file (e.g., Conda `environment.yml`) that lists all required Python packages and their versions. For example, specify TensorFlow 2.9, scikit-learn 1.0, and other dependencies so that every data scientist and deployment pipeline uses the exact setup. Use Microsoft Fabric workspaces to segregate development and production environments, ensuring that models are trained and evaluated in a consistent setting.
45+
> Create an environment file (e.g., Conda `environment.yml`) that lists all required Python packages and their versions. For example, specify TensorFlow 2.9, scikit-learn 1.0, and other dependencies so that every data scientist and deployment pipeline uses the exact setup. Use Microsoft Fabric workspaces to segregate development and production environments, ensuring that models are trained and evaluated in a consistent setting.
4646
47-
https://github.com/user-attachments/assets/fcce754d-afd3-4267-aa0f-bba87c0a3089
47+
<https://github.com/user-attachments/assets/fcce754d-afd3-4267-aa0f-bba87c0a3089>
4848

49-
## Data Agent (Preview) Usage
49+
## Data Agent (Preview) Usage
5050

5151
> Integrate the Data Agent into your pipeline to automatically validate incoming datasets for completeness and consistency. For instance, set up rules that flag missing data or out-of-range values and trigger notifications when anomalies are detected. Track and document these incidents to help refine the agent’s calibration, ensuring that data passing to your experiments meets quality standards.
5252

0 commit comments

Comments
 (0)