ArmDeveloperEcosystem
diff --git a/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_index.md‎
Lines changed: 44 additions & 0 deletions b/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_index.md‎
Lines changed: 44 additions & 0 deletions
diff --git a/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_next-steps.md‎
Lines changed: 27 additions & 0 deletions b/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_next-steps.md‎
Lines changed: 27 additions & 0 deletions
diff --git a/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_review.md‎
Lines changed: 43 additions & 0 deletions b/‎content/learning-paths/servers-and-cloud-computing/gh-runners/_review.md‎
Lines changed: 43 additions & 0 deletions
diff --git a/‎content/learning-paths/servers-and-cloud-computing/gh-runners/background.md‎
Lines changed: 87 additions & 0 deletions b/‎content/learning-paths/servers-and-cloud-computing/gh-runners/background.md‎
Lines changed: 87 additions & 0 deletions
diff --git a/‎content/learning-paths/servers-and-cloud-computing/gh-runners/compare-performance.md‎
Lines changed: 79 additions & 0 deletions b/‎content/learning-paths/servers-and-cloud-computing/gh-runners/compare-performance.md‎
Lines changed: 79 additions & 0 deletions
@@ -0,0 +1,44 @@
+---
+title: MLOps with Arm-hosted GitHub Runners
+draft: true
+cascade:
+    draft: true
+
+minutes_to_complete: 30
+
+who_is_this_for: This is an introductory topic for software developers interested in automation for machine learning (ML) tasks.
+
+learning_objectives:
+    - Set up an Arm-hosted GitHub runner.
+    - Train and test a PyTorch ML model with the German Traffic Sign Recognition Benchmark (GTSRB) dataset.
+    - Use PyTorch compiled with OpenBLAS and oneDNN with Arm Compute Library to compare the performance of a trained model.
+    - Containerize the model and push the container to DockerHub.
+    - Automate all the steps in the ML workflow using GitHub Actions.
+
+prerequisites:
+    - A GitHub account with access to Arm-hosted GitHub runners.
+    - A Docker Hub account for storing container images.
+    - Some familiarity with ML and continuous integration and deployment (CI/CD) concepts.
+
+author_primary: Pareena Verma, Annie Tallund
+
+### Tags
+skilllevels: Introductory
+subjects: CI-CD
+armips:
+    - Neoverse
+tools_software_languages:
+    - Python
+    - PyTorch
+    - ACL
+    - GitHub
+operatingsystems:
+    - Linux
+
+
+### FIXED, DO NOT MODIFY
+# ================================================================================
+weight: 1                       # _index.md always has weight of 1 to order correctly
+layout: "learningpathall"       # All files under learning paths have this same wrapper
+learning_path_main_page: "yes"  # This should be surfaced when looking for related content. Only set for _index.md of learning path content.
+---
@@ -0,0 +1,27 @@
+---
+next_step_guidance: Thank you for completing the learning path on running MLOps with Arm-hosted GitHub runners. You might be interested in learning how to build Arm images and multi-architecture images with these Arm-hosted runners.
+
+recommended_path: /learning-paths/cross-platform/github-arm-runners
+
+further_reading:
+    - resource:
+        title: Arm64 on GitHub Actions - Powering faster, more efficient build systems
+        link: https://github.blog/news-insights/product-news/arm64-on-github-actions-powering-faster-more-efficient-build-systems/
+        type: blog
+    - resource:
+        title: Arm Compute Library
+        link: https://github.com/ARM-software/ComputeLibrary
+        type: website
+    - resource:
+        title: Streamlining your MLOps pipeline with GitHub Actions and Arm64 runners
+        link: https://github.blog/enterprise-software/ci-cd/streamlining-your-mlops-pipeline-with-github-actions-and-arm64-runners/
+        type: blog
+
+
+# ================================================================================
+#       FIXED, DO NOT MODIFY
+# ================================================================================
+weight: 21                  # set to always be larger than the content in this path, and one more than 'review'
+title: "Next Steps"         # Always the same
+layout: "learningpathall"   # All files under learning paths have this same wrapper
+---
@@ -0,0 +1,43 @@
+---
+review:
+    - questions:
+        question: >
+            Can Arm-hosted runners be used with GitHub Actions?
+        answers:
+            - "Yes"
+            - "No"
+        correct_answer: 1
+        explanation: >
+            Arm-hosted runners for use with GitHub Actions are available for Linux and Windows.
+
+    - questions:
+        question: >
+            What is the GTSRB dataset made up of?
+        answers:
+            - Sound files of spoken German words
+            - Sound files of animal sounds
+            - Images of flower petals
+            - Images of German traffic signs
+        correct_answer: 4
+        explanation: >
+            GTSRB stands for German Traffic Signs Recognition Benchmark
+
+    - questions:
+        question: >
+            ACL is included in PyTorch.
+        answers:
+            - "True"
+            - "False"
+        correct_answer: 1
+        explanation: >
+            While it is possible to use ACL stand-alone, the optimized kernels are built into PyTorch through the oneDNN backend.
+
+
+
+# ================================================================================
+#       FIXED, DO NOT MODIFY
+# ================================================================================
+title: "Review"                 # Always the same title
+weight: 20                      # Set to always be larger than the content in this path
+layout: "learningpathall"       # All files under learning paths have this same wrapper
+---
@@ -0,0 +1,87 @@
+---
+title: MLOps background
+weight: 2
+
+### FIXED, DO NOT MODIFY
+layout: learningpathall
+---
+
+## Overview 
+
+In this Learning Path, you will learn how to automate an MLOps workflow using Arm-hosted GitHub runners and GitHub Actions. 
+
+You will learn how to do the following tasks:
+- Train and test a neural network model with PyTorch.
+- Compare the model inference time using two different PyTorch backends.
+- Containerize the model and save it to DockerHub.
+- Deploy the container image and use API calls to access the model.
+
+## GitHub Actions
+
+GitHub Actions is a platform that automates software development workflows, including continuous integration and continuous delivery. Every repository on GitHub has an `Actions` tab as shown below:
+
+![#actions-gui](images/actions-gui.png)
+
+GitHub Actions runs workflow files to automate processes. Workflows run when specific events occur in a GitHub repository. 
+
+[YAML](https://yaml.org/) defines a workflow. 
+
+Workflows specify how a job is triggered, the running environment, and the commands to run. 
+
+The machine running workflows is called a _runner_.
+
+## Arm-hosted GitHub runners
+
+Hosted GitHub runners are provided by GitHub so you don't need to setup and manage cloud infrastructure. Arm-hosted GitHub runners use the Arm architecture so you can build and test software without cross-compiling or instruction emulation.
+
+Arm-hosted GitHub runners enable you to optimize your workflows, reduce cost, and improve energy consumption. 
+
+Additionally, the Arm-hosted runners are preloaded with essential tools, making it easier for you to develop and test your applications.
+
+Arm-hosted runners are available for Linux and Windows. This Learning Path uses Linux.
+
+{{% notice Note %}}
+You must have a Team or Enterprise Cloud plan to use Arm-hosted runners.
+{{% /notice %}}
+
+Getting started with Arm-hosted GitHub runners is straightforward. Follow the steps in [Create a new Arm-hosted runner](/learning-paths/cross-platform/github-arm-runners/runner/#how-can-i-create-an-arm-hosted-runner) to create a runner in your organization.
+
+Once you have created the runner, use the `runs-on` syntax in your GitHub Actions workflow file to execute the workflow on Arm. 
+
+Below is an example workflow that executes on an Arm-hosted runner named `ubuntu-22.04-arm-os`:
+
+```yaml
+name: Example workflow
+on:
+  workflow_dispatch:
+jobs:
+  example-job:
+    name: Example Job
+    runs-on: ubuntu-22.04-arm-os # Custom ARM64 runner
+    steps:
+      - name: Example step
+        run: echo "This line runs on Arm!"
+```
+
+
+## Machine Learning Operations (MLOps)
+
+Machine learning use-cases have a need for reliable workflows to maintain performance and quality. 
+
+There are many tasks that can be automated in the ML lifecycle. 
+- Model training and re-training
+- Model performance analysis
+- Data storage and processing
+- Model deployment
+
+Developer Operations (DevOps) refers to good practices for collaboration and automation, including CI/CD. The domain-specific needs for ML, combined with DevOps knowledge, creates the new term MLOps.
+
+## German Traffic Sign Recognition Benchmark (GTSRB)
+
+This Learning Path explains how to train and test a PyTorch model to perform traffic sign recognition. 
+
+You will learn how to use the GTSRB dataset to train the model. The dataset is free to use under the [Creative Commons](https://creativecommons.org/publicdomain/zero/1.0/) license. It contains thousands of images of traffic signs found in Germany. It has become a well-known resource to showcase ML applications. 
+
+The GTSRB dataset is also good for comparing performance and accuracy of different models and to compare and contrast different PyTorch backends. 
+
+Continue to the next section to learn how to setup an end-to-end MLOps workflow using Arm-hosted GitHub runners.
@@ -0,0 +1,79 @@
+---
+title: Compare the performance of PyTorch backends
+weight: 5
+
+### FIXED, DO NOT MODIFY
+layout: learningpathall
+---
+
+Continuously monitoring the performance of your machine learning models in production is crucial to maintaining effectiveness over time. The performance of your ML model can change due to various factors ranging from data-related issues to environmental factors.
+
+In this section, you will change the PyTorch backend being used to test the trained model. You will learn how to measure and continuously monitor the inference performance using your workflow.
+
+## OneDNN with Arm Compute Library (ACL)
+
+In the previous section, you used the PyTorch 2.3.0 Docker Image compiled with OpenBLAS from DockerHub to run your testing workflow. PyTorch can be run with other backends. You will now modify the testing workflow to use PyTorch 2.3.0 Docker Image compiled with OneDNN and the Arm Compute Library. 
+
+The [Arm Compute Library](https://github.com/ARM-software/ComputeLibrary) is a collection of low-level machine learning functions optimized for Arm's Cortex-A and Neoverse processors and Mali GPUs. Arm-hosted GitHub runners use Arm Neoverse CPUs, which make it possible to optimize your neural networks to take advantage of processor features. ACL implements kernels (also known as operators or layers), using specific instructions that run faster on AArch64.
+
+ACL is integrated into PyTorch through [oneDNN](https://github.com/oneapi-src/oneDNN), an open-source deep neural network library.
+
+## Modify the test workflow and compare results
+
+Two different PyTorch docker images for Arm Neoverse CPUs are available on [DockerHub](https://hub.docker.com/r/armswdev/pytorch-arm-neoverse). 
+
+Up until this point, you used the `r24.07-torch-2.3.0-openblas` container image to run workflows. The oneDNN container image is also available to use in workflows. These images represent two different PyTorch backends which handle the PyTorch model execution.
+
+### Change the Docker image to use oneDNN
+
+In your browser, open and edit the file `.github/workflows/test_model.yml`.
+
+Update the `container.image` parameter to `armswdev/pytorch-arm-neoverse:r24.07-torch-2.3.0-onednn-acl` and save the file by committing the change to the main branch:
+
+```yaml
+jobs:
+  test-model:
+    name: Test the Model
+    runs-on: ubuntu-22.04-arm-os # Custom ARM64 runner
+    container:
+      image: armswdev/pytorch-arm-neoverse:r24.07-torch-2.3.0-onednn-acl
+      options: --user root
+    # Steps omitted
+```
+
+### Run the test workflow
+
+Trigger the **Test Model** job again by clicking the `Run workflow` button on the `Actions` tab.
+
+The test workflow starts running. 
+
+Navigate to the workflow run on the `Actions` tab, click into the job, and expand the **Run testing script** step. 
+
+You see a change in the performance results with OneDNN and ACL kernels being used. 
+
+The output is similar to:
+
+```output
+Accuracy of the model on the test images: 90.48%
+---------------------------------  ------------  ------------  ------------  ------------  ------------  ------------
+                             Name    Self CPU %      Self CPU   CPU total %     CPU total  CPU time avg    # of Calls
+---------------------------------  ------------  ------------  ------------  ------------  ------------  ------------
+                  model_inference         4.63%     304.000us       100.00%       6.565ms       6.565ms             1
+                     aten::conv2d         0.18%      12.000us        56.92%       3.737ms       1.869ms             2
+                aten::convolution         0.30%      20.000us        56.74%       3.725ms       1.863ms             2
+               aten::_convolution         0.43%      28.000us        56.44%       3.705ms       1.853ms             2
+         aten::mkldnn_convolution        47.02%       3.087ms        55.48%       3.642ms       1.821ms             2
+                 aten::max_pool2d         0.15%      10.000us        25.51%       1.675ms     837.500us             2
+    aten::max_pool2d_with_indices        25.36%       1.665ms        25.36%       1.665ms     832.500us             2
+                     aten::linear         0.18%      12.000us         9.26%     608.000us     304.000us             2
+                      aten::clone         0.26%      17.000us         9.08%     596.000us     149.000us             4
+                      aten::addmm         8.50%     558.000us         8.71%     572.000us     286.000us             2
+---------------------------------  ------------  ------------  ------------  ------------  ------------  ------------
+Self CPU time total: 6.565ms
+```
+
+For the ACL results, notice that the **Self CPU time total** is lower compared to the OpenBLAS run in the previous section. 
+
+The names of the layers have also changed, where the `aten::mkldnn_convolution` is the kernel optimized to run on the Arm architecture. That operator is the main reason the inference time is improved, made possible by using ACL kernels.
+
+In the next section, you will learn how to automate the deployment of your model.