migtools · Pkylas007 · Aug 11, 2025 · Aug 12, 2025 · Aug 16, 2025 · Aug 17, 2025
diff --git a/assemblies/developer-lightspeed-guide/assembly_configuring-openshift-ai.adoc b/assemblies/developer-lightspeed-guide/assembly_configuring-openshift-ai.adoc
@@ -0,0 +1,27 @@
+:_newdoc-version: 2.18.3
+:_template-generated: 2025-04-08
+
+ifdef::context[:parent-context-of-configuring-openshift-ai: {context}]
+
+:_mod-docs-content-type: ASSEMBLY
+
+ifndef::context[]
+[id="configuring-openshift-ai"]
+endif::[]
+ifdef::context[]
+[id="configuring-openshift-ai_{context}"]
+endif::[]
+= Configuring {ocp-short} AI 
+:context: configuring-openshift-ai
+
+The configurations that you must complete for {ocp-short} AI include creating a data science project instance in the {ocp-short} AI operator. Next, you can configure model-specific configurations in the *Red Hat {ocp-short} AI* console.
+
+include::topics/developer-lightspeed/proc_creating-datascience-cluster.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_configuring-llm-serving-runtime.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_creating-accelerator-profile.adoc[leveloffset=+1]
+
+
+ifdef::parent-context-of-configuring-openshift-ai[:context: {parent-context-of-configuring-openshift-ai}]
+ifndef::parent-context-of-configuring-openshift-ai[:!context:]
diff --git a/assemblies/developer-lightspeed-guide/assembly_configuring_llm.adoc b/assemblies/developer-lightspeed-guide/assembly_configuring_llm.adoc
@@ -0,0 +1,45 @@
+:_newdoc-version: 2.18.3
+:_template-generated: 2025-04-08
+
+ifdef::context[:parent-context-of-configuring-llm: {context}]
+
+:_mod-docs-content-type: ASSEMBLY
+
+ifndef::context[]
+[id="configuring-llm"]
+endif::[]
+ifdef::context[]
+[id="configuring-llm_{context}"]
+endif::[]
+= Configuring large language models for analysis
+:context: configuring-llm
+
+In an analysis, {mta-dl-plugin} provides the large language model (LLM) with the contextual prompt to identify the issues in the current application and generate suggestions to resolve them.
+
+{mta-dl-plugin} is designed to be model agnostic. It works with LLMs that are run in different environments (in local containers, as local AI, as a shared service) to support analyzing Java applications in a wide range of scenarios. You can choose an LLM from well-known providers, local models that you run from Ollama or Podman desktop, and OpenAI API compatible models that are configured as Model-as-a-Service deployments. 
+
+The result of an analysis performed by {mta-dl-plugin} depends on the parameters of the LLM that you choose. 
+
+You can run an LLM from the following generative AI providers:
+
+* OpenAI 
+* Azure OpenAI
+* Google Gemini
+* Amazon Bedrock
+* Deepseek 
+* OpenShift AI  
+
+include::topics/developer-lightspeed/con_model-as-a-service.adoc[leveloffset=+1]
+
+include::assembly_maas-oc-install-config.adoc[leveloffset=+2]
+
+include::assembly_configuring-openshift-ai.adoc[leveloffset=+2]
+
+include::assembly_deploying-openshift-ai-llm.adoc[leveloffset=+2]
+
+include::assembly_preparing-llm-analysis.adoc[leveloffset=+2]
+
+include::topics/developer-lightspeed/proc_configuring-llm-podman-desktop.adoc[leveloffset=+1]
+
+ifdef::parent-context-of-configuring-llm[:context: {parent-context-of-configuring-llm}]
+ifndef::parent-context-of-configuring-llm[:!context:]
diff --git a/assemblies/developer-lightspeed-guide/assembly_deploying-openshift-ai-llm.adoc b/assemblies/developer-lightspeed-guide/assembly_deploying-openshift-ai-llm.adoc
@@ -0,0 +1,29 @@
+:_newdoc-version: 2.18.3
+:_template-generated: 2025-04-08
+
+ifdef::context[:parent-context-of-deploying-openshift-ai-llm: {context}]
+
+:_mod-docs-content-type: ASSEMBLY
+
+ifndef::context[]
+[id="deploying-openshift-ai-llm"]
+endif::[]
+ifdef::context[]
+[id="deploying-openshift-ai-llm_{context}"]
+endif::[]
+= Deploying the large language model 
+:context: deploying-openshift-ai-llm
+
+To connect the {ocp-short} AI platform to a large language model (LLM), first, you must upload your LLM to a data source.
+
+{ocp-short} AI, that runs on pods in a Red Hat {ocp-short} on AWS (ROSA) cluster, can access the LLM from a data source such as an Amazon Web Services (AWS) S3 storage. You must create an AWS S3 bucket and configure access permission so that it can access the pods running in the ROSA cluster. See how to enable link:https://docs.redhat.com/en/documentation/red_hat_openshift_service_on_aws/4/html/authentication_and_authorization/assuming-an-aws-iam-role-for-a-service-account#how-service-accounts-assume-aws-iam-roles-in-user-defined-projects_assuming-an-aws-iam-role-for-a-service-account[service account to assume AWS IAM role to access the ROSA pods]. 
+
+Next, you must configure a data connection to the bucket and deploy the LLM from the {ocp-short} AI platform.
+
+include::topics/developer-lightspeed/proc_adding-data-connection.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_deploying-the-model.adoc[leveloffset=+1]
+
+
+ifdef::parent-context-of-deploying-openshift-ai-llm[:context: {parent-context-of-deploying-openshift-ai-llm}]
+ifndef::parent-context-of-deploying-openshift-ai-llm[:!context:]
diff --git a/assemblies/developer-lightspeed-guide/assembly_maas-oc-install-config.adoc b/assemblies/developer-lightspeed-guide/assembly_maas-oc-install-config.adoc
@@ -0,0 +1,32 @@
+:_newdoc-version: 2.18.3
+:_template-generated: 2025-04-08
+
+ifdef::context[:parent-context-of-maas-oc-install-config: {context}]
+
+:_mod-docs-content-type: ASSEMBLY
+
+ifndef::context[]
+[id="maas-oc-install-config"]
+endif::[]
+ifdef::context[]
+[id="maas-oc-install-config_{context}"]
+endif::[]
+= Installing and configuring {ocp-short} cluster
+:context: maas-oc-install-config
+
+As a member of the hybrid cloud infrastructure team, your initial set of tasks to deploy a large language model (LLM) through model-as-a-service involves creating {ocp-short} clusters with primary and secondary nodes and configuring an identity provider with role-based access control for users to log in to the clusters. 
+
+Next, you configure the GPU operators required to run an LLM, GPU nodes, and auto scaling for the GPU nodes in your namespace on {ocp-short} AI. The following procedures refer to an {ocp-full} cluster hosted on Amazon Web Services. 
-Next, you configure the GPU operators required to run an LLM, GPU nodes, and auto scaling for the GPU nodes in your namespace on {ocp-short} AI. The following procedures refer to an {ocp-full} cluster hosted on Amazon Web Services. 
+Next, you configure the GPU operators required to run an LLM, GPU nodes, and auto scaling for the GPU nodes in your namespace on {ocp-short}. The following procedures refer to an {ocp-full} cluster hosted on Amazon Web Services. 
-Next, you configure the GPU operators required to run an LLM, GPU nodes, and auto scaling for the GPU nodes in your namespace on {ocp-short} AI. The following procedures refer to an {ocp-full} cluster hosted on Amazon Web Services. 
+Next, you configure the GPU operators required to run an LLM, GPU nodes, and auto scaling for the GPU nodes in your namespace on {ocp-short}. The following procedures refer to an {ocp-full} cluster hosted on Amazon Web Services. 
+
+include::topics/developer-lightspeed/proc_install-oc-cluster.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_configuring-operators.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_creating-gpu-machine-set.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_configuring-node-auto-scaling.adoc[leveloffset=+1]
+
+include::topics/developer-lightspeed/proc_configuring-machine-auto-scaling.adoc[leveloffset=+1]
+
+ifdef::parent-context-of-maas-oc-install-config[:context: {parent-context-of-maas-oc-install-config}]
+ifndef::parent-context-of-maas-oc-install-config[:!context:]
diff --git a/assemblies/developer-lightspeed-guide/assembly_preparing-llm-analysis.adoc b/assemblies/developer-lightspeed-guide/assembly_preparing-llm-analysis.adoc
@@ -0,0 +1,22 @@
+:_newdoc-version: 2.18.3
+:_template-generated: 2025-04-08
+
+ifdef::context[:parent-context-of-preparing-llm-analysis: {context}]
+
+:_mod-docs-content-type: ASSEMBLY
+
+ifndef::context[]
+[id="preparing-llm-analysis"]
+endif::[]
+ifdef::context[]
+[id="preparing-llm-analysis_{context}"]
+endif::[]
+= Preparing the large language model for analysis 
+:context: preparing-llm-analysis
+
+To access the large language model (LLM), you must create an API key for the model and update settings in {mta-dl-plugin} to enable the extension to use the LLM.
+
+include::topics/developer-lightspeed/proc_configuring-openai-api-key.adoc[leveloffset=+1]
+
+ifdef::parent-context-of-preparing-llm-analysis[:context: {parent-context-of-preparing-llm-analysis}]
+ifndef::parent-context-of-preparing-llm-analysis[:!context:]
diff --git a/assemblies/developer-lightspeed-guide/topics b/assemblies/developer-lightspeed-guide/topics
@@ -0,0 +1 @@
+../../topics/
diff --git a/docs/developer-lightspeed-guide/assemblies b/docs/developer-lightspeed-guide/assemblies
@@ -0,0 +1 @@
+../../assemblies/
diff --git a/docs/developer-lightspeed-guide/master-docinfo.xml b/docs/developer-lightspeed-guide/master-docinfo.xml
@@ -0,0 +1,11 @@
+<title>MTA Developer Lightspeed Guide</title>
+<productname>{DocInfoProductName}</productname>
+<productnumber>{DocInfoProductNumber}</productnumber>
+<subtitle>Using the {ProductName} command-line interface to migrate your applications</subtitle>
+<abstract>
+    <para>Use {ProductFullName} Developer Lightspeed for application modernization in your organization by running Artificial Intelligence-driven static code analysis for Java applications.</para>
+</abstract>
+<authorgroup>
+    <orgname>Red Hat Customer Content Services</orgname>
+</authorgroup>
+<xi:include href="Common_Content/Legal_Notice.xml" xmlns:xi="http://www.w3.org/2001/XInclude" />
diff --git a/docs/developer-lightspeed-guide/master.adoc b/docs/developer-lightspeed-guide/master.adoc
@@ -0,0 +1,26 @@
+:mta:
+include::topics/templates/document-attributes.adoc[]
+:_mod-docs-content-type: ASSEMBLY
+[id="mta-developer-lightspeed"]
+= MTA Developer Lightspeed
+
+:toc:
+:toclevels: 4
+:numbered:
+:imagesdir: topics/images
+:context: mta-developer-lightspeed
+:mta-developer-lightspeed:
+
+//Inclusive language statement
+include::topics/making-open-source-more-inclusive.adoc[]
+
+
+
+
+
+
+
+
+include::assemblies/developer-lightspeed-guide/assembly_configuring_llm.adoc[leveloffset=+1]
+
+:!mta-developer-lightspeed:
diff --git a/docs/developer-lightspeed-guide/topics b/docs/developer-lightspeed-guide/topics
@@ -0,0 +1 @@
+../topics/
diff --git a/docs/topics/developer-lightspeed/con_model-as-a-service.adoc b/docs/topics/developer-lightspeed/con_model-as-a-service.adoc
@@ -0,0 +1,23 @@
+:_newdoc-version: 2.15.0
+:_template-generated: 2024-2-21
+
+:_mod-docs-content-type: CONCEPT
+
+[id="model-as-a-service_{context}"]
+= Deploying an LLM as a scalable service 
+
+[role="_abstract"]
+
+The code suggestions differ based on various parameters about the large language model (LLM) used for an analysis. Therefore, model-as-a-service enables you more control over using {mta-dl-plugin} with an LLM that is trained for your specific requirements than general purpose models from the public AI providers.
+
+{mta-dl-plugin} is built to analyze better when it can access code changes resulting from analysis performed at scale across many application teams. In an enterprise, changes at scale become more consistent when the LLMs that generate the code change suggestions are shared across application teams than when each team uses a different LLM. This approach calls for a common strategy in an enterprise to manage the underlying resources that power the models that must be exposed to multiple members in different teams.
+
+To cater to an enterprise-wide LLM deployment, {mta-dl-plugin} integrates with LLMs that are deployed as a scalable service on {ocp-full} clusters. These deployments, called model-as-a-service (MaaS), provide you with a granular control over resources such as compute, cluster nodes, and auto-scaling Graphical Processing Units (GPUs) while enabling you to leverage LLMs to perform analysis at a large scale.
+
+The workflow for configuring an LLM on {ocp-short} AI can be broadly divided into the following parts:
+
+* Installing and configuring infrastructure resources
+* Configuring OpenShift AI
+* Connecting OpenShift AI with the LLM
+* Preparing the LLM for analysis
+//* Configuring monitoring and alerting for the storage resource: creating a ConfigMap for monitoring storage and an alert configuration file.
diff --git a/docs/topics/developer-lightspeed/proc_adding-data-connection.adoc b/docs/topics/developer-lightspeed/proc_adding-data-connection.adoc
@@ -0,0 +1,44 @@
+:_newdoc-version: 2.15.0
+:_template-generated: 2024-2-21
+:_mod-docs-content-type: PROCEDURE
+
+[id="adding-data-connection_{context}"]
+= Adding a data connection
+
+[role="_abstract"]
+In {ocp-short}, a project is a Kubernetes namespace with additional annotations, and is the main way that you can manage user access to resources. A project organizes your data science work in one place and also allows you to collaborate with other developers in your organization.
+
+In your data science project, you must create a data connection to your existing S3-compatible storage bucket to which you uploaded a large language model.
+
+.Prerequisites
+
+You need the following credential information for the storage buckets:
+
+* Endpoint URL
+* Access key
+* Secret key
+* Region
+* Bucket name
+
+If you do not have this information, contact your storage administrator.
+
+.Procedure
+
+. In the {ocp-short} AI web console, select *Data science projects*.
+The *Data science projects* page shows a list of projects that you can access. For each user-requested project in the list, the *Name* column shows the project display name, the user who requested the project, and the project description.
+
+. Click *Create project*.
+In the *Create project* dialog, update the *Name* field to enter a unique display name for your project.
++
+. Optional: In the *Description* field, provide a project description.
+
+. Click *Create*.
+Your project is listed on the *Data science projects* page.
+
+. Click the name of your project, select the *Connections* tab, and click *Create connection*.
+
+. In the *Connection type* drop down, select *S3 compatible object storage - v1*.
+
+. In the *Connection details* section, enter the connection name, the access key, the secret key, endpoint to your storage bucket, and the region.
+
+. Click *Create*.
diff --git a/docs/topics/developer-lightspeed/proc_configuring-llm-podman-desktop.adoc b/docs/topics/developer-lightspeed/proc_configuring-llm-podman-desktop.adoc
@@ -0,0 +1,58 @@
+:_newdoc-version: 2.15.0
+:_template-generated: 2024-2-21
+:_mod-docs-content-type: PROCEDURE
+
+[id="configuring-llm-podman_{context}"]
+= Configuring the LLM in Podman Desktop
+
+[role="_abstract"]
+
+The Podman AI lab extension enables you to use an open-source model from a curated list of models and use it locally in your system. 
+
+.Prerequisites
+
+* You installed link:https://podman-desktop.io/docs/installation[Podman Desktop] in your system.
+
+* You completed initial configurations in {mta-dl-plugin} required for the analysis.
+
+.Procedure
+
+. Go to the Podman AI Lab extension and click *Catalog* under *Models*.
+
+. Download one or more models.
+
+. Go to *Services* and click *New Model Service*.
+
+. Select a model that you downloaded in the *Model* drop down menu and click *Create Service*.
+
+. Click the deployed model service to open the *Service Details* page.
+
+. Note the server URL and the model name. 
+You must configure these specifications in the {mta-dl-plugin} extension. 
+
+. Export the inference server URL as follows:
++
+[source, terminal]
+----
+export OPENAI_API_BASE=<server-url>
+----
++
+. In the VS Code, click *Configure GenAI Settings* to open the `provider-settings.yaml` file.
+
+. Enter the model details from Podman Desktop. For example, use the following configuration for a Mistral model. 
++
+[source, yaml]
+----
+podman_mistral:
+    provider: "ChatOpenAI"
+     environment:
+      OPENAI_API_KEY: "unused value"
+    args:
+      model: "mistral-7b-instruct-v0-2"
+      base_url: "http://localhost:35841/v1"
+----
++
+[NOTE]
+====
+The Podman Desktop service endpoint does not need a password but the OpenAI library expects the `OPENAI_API_KEY` to be set. In this case, the value of the `OPENAI_API_KEY` variable does not matter.
+====
diff --git a/docs/topics/developer-lightspeed/proc_configuring-llm-serving-runtime.adoc b/docs/topics/developer-lightspeed/proc_configuring-llm-serving-runtime.adoc
@@ -0,0 +1,38 @@
+:_newdoc-version: 2.15.0
+:_template-generated: 2024-2-21
+:_mod-docs-content-type: PROCEDURE
+
+[id="configuring-llm-serving-runtime_{context}"]
+= Configuring the LLM serving runtime
+
+[role="_abstract"]
+It takes several minutes to scale nodes and pull the image to serve the virtual large language model (vLLM). However, the default time for deploying a vLLM is 10 minutes. A vLLM deployment that takes longer fails on the {ocp-short} AI cluster. 
+
+To mitigate this issue, you must enter a custom serving time configuration.
+
+.Procedure
+
+. On the {ocp-short} AI dashboard, click *Settings > Serving runtimes*.
+The *Serving runtimes* page lists the `vLLM ServingRuntime for KServe` custom resource (CR).
+`KServe` orchestrates model serving for all types of models and includes model-serving runtimes that implement the loading of given types of model servers. KServe also handles the lifecycle of the deployment object, storage access, and networking setup.
+
+. Click on the kebab menu for `vLLM ServingRuntime for KServe` and select *Duplicate serving runtime*.
+
+. Enter a different display name for the serving runtime and increase the value for `serving.knative.dev/progress-deadline` to `60m`.
+
+. To support multiple GPU nodes and scaling, add `--distributed-executor-backend` and `--tensor-parallel-size` to `containers.args` as follows:
++
+[source, yaml]
+----
+spec:
+  containers:
+    - args:
+        - --port=8080
+        - --model=/mnt/models
+        - --served-model-name={{.Name}}
+        - --distributed-executor-backend=mp
+        - --tensor-parallel-size=8
+----
++
+
+Next, you must create an accelerator profile if you want to run a GPU node for the first time.