You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
description: This module covers the principles and practices for architecting generative AI workloads that are robust, scalable, and aligned with best practices.
5
+
title: Well architected AI workloads
6
+
description: This module covers the principles and practices for architecting AI workloads that are robust, scalable, and aligned with best practices.
A generative AI CoE typically focuses on setting direction, defining strategies, providing support, establishing metrics, and monitoring the impact of generative AI initiatives. A generative AI CoE provides an organization with a central location and set of people that have the knowledge, skills, and capabilities to effectively leverage AI. When properly implemented, a generative AI CoE holds the authority and influence within an organization needed to drive adoption of generative AI solutions.
1
+
An AI CoE typically focuses on setting direction, defining strategies, providing support, establishing metrics, and monitoring the impact of AI initiatives. An AI CoE provides an organization with a central location and set of people that have the knowledge, skills, and capabilities to effectively leverage AI. When properly implemented, an AI CoE holds the authority and influence within an organization needed to drive adoption of AI solutions.
2
2
3
-
GenAIOps is a set of methods and practices aligned with the deployment, management, and operation of generative AI workloads. To build effective GenAI solutions, developers need to select the right models, and also understand how these models fit into the broader operational framework to develop an application. The end-to-end application development of a GenAI solution is also referred to as Large Language Model Operations (LLMOps), or GenAI Operations (GenAIOps). GenAIOps involves tasks like model discovery and refining pretrained models with enriched data. The overall goal of GenAIOps is capability delivery with efficient operations throughout the development lifecycle. The expected outcomes are:
3
+
GenAIOps is a set of methods and practices aligned with the deployment, management, and operation of AI workloads. To build effective GenAI solutions, developers need to select the right models, and also understand how these models fit into the broader operational framework to develop an application. The end-to-end application development of a GenAI solution is also referred to as Large Language Model Operations (LLMOps), or GenAI Operations (GenAIOps). GenAIOps involves tasks like model discovery and refining pretrained models with enriched data. The overall goal of GenAIOps is capability delivery with efficient operations throughout the development lifecycle. The expected outcomes are:
4
4
5
5
- Repeatable processes with consistent results.
6
6
- Sustained accuracy of the models over time.
7
7
- Effective governance that minimizes risks.
8
8
- Change management processes for adapting to model drift.
9
9
10
-
An organization's generative AI CoE can help ensuring that appropriate GenAIOps processes are implemented throughout a generative AI workload's lifecycle. GenAIOps methods and practices include the following elements:
10
+
An organization's AI CoE can help ensuring that appropriate GenAIOps processes are implemented throughout an AI workload's lifecycle. GenAIOps methods and practices include the following elements:
11
11
12
12
- The workload integrates security and compliance technologies. Security and compliance are built into the workload rather than addressed as an ancillary concern.
13
13
- Model choice and refinement. Evaluate existing models to identify one that's pretrained for the specific workload use case. Refine the model for domain-specific grounding.
14
-
- Monitoring and Metrics. Ensure that generative AI workloads are continually assessed to determine if they meet organizational needs, are cost effective, and meet Return on Investment (ROI) goals. Also determines whether the output of the AI workload is generating appropriate or anomalous results.
14
+
- Monitoring and Metrics. Ensure that AI workloads are continually assessed to determine if they meet organizational needs, are cost effective, and meet Return on Investment (ROI) goals. Also determines whether the output of the AI workload is generating appropriate or anomalous results.
15
15
- Workload governance. Ensure that workloads include the appropriate mechanisms to allow them to configured meet relevant governance and compliance requirements, and that the workloads meet those requirements.
16
16
- Ensures that workloads include versioning and rollback mechanisms. This allows the workload deployment to be automated, for updates to be rolled out quickly, and for rollback to occur should a new deployment lead to undesired outcomes.
17
17
18
-
While an organization's generative AI CoE provides strategic guidance, frameworks, and recommended best practices for GenAIOps, the implementation and operational execution of these frameworks is usually managed by individual dedicated teams within the organization.
18
+
While an organization's AI CoE provides strategic guidance, frameworks, and recommended best practices for GenAIOps, the implementation and operational execution of these frameworks is usually managed by individual dedicated teams within the organization.
19
19
20
20
The CoE facilitates GenAIOps processes by providing training, templates, and governance policies. The CoE helps ensure that workloads are implemented with a consistent set of technologies, are scalable, and adhere to governance standards across the enterprise.
21
21
@@ -26,10 +26,10 @@ The CoE should provide GenAIOps guidance on the following elements of workload o
26
26
- Cost management processes
27
27
- Monitoring and optimization processes
28
28
29
-

29
+

30
30
31
-
There should be bidirectional communication between the generative AI CoE and workload teams.
31
+
There should be bidirectional communication between the AI CoE and workload teams.
32
32
33
33
- A CoE disseminate recommended methods and practices to workload teams
34
34
- Workload teams provide feedback to the CoE on the effectiveness of those methods and practices based on actual operational experience
35
-
- The CoE can redistribute that feedback to other generative AI workload teams in the organization.
35
+
- The CoE can redistribute that feedback to other AI workload teams in the organization.
0 commit comments