Merge branch 'main' of https://github.com/aws-samples/Cost_effective_and_scalable_Small_Language_Models_Inference_on_AWS_Graviton4_with_EKS

ddynwzh1992 · ddynwzh1992 · commit 68703f173bca · 2025-04-11T16:01:45.000+10:00
diff --git a/README.md b/README.md
@@ -1,4 +1,4 @@
-# Cost effective and Scalable Model Inference and Agentic AI on AWS Graviton with Ray on EKS
+# Cost effective and Scalable Model Inference and Agentic AI on AWS Graviton with EKS
 
 ## Overview
 The solution implements a scalable ML inference architecture using Amazon EKS, leveraging both Graviton processors for CPU-based inference and GPU instances for accelerated inference. The system utilizes Ray Serve for model serving, deployed as containerized workloads within a Kubernetes environment.

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-# Cost effective and Scalable Model Inference and Agentic AI on AWS Graviton with Ray on EKS`
	`1`	`+# Cost effective and Scalable Model Inference and Agentic AI on AWS Graviton with EKS`
`2`	`2`
`3`	`3`	`## Overview`
`4`	`4`	`The solution implements a scalable ML inference architecture using Amazon EKS, leveraging both Graviton processors for CPU-based inference and GPU instances for accelerated inference. The system utilizes Ray Serve for model serving, deployed as containerized workloads within a Kubernetes environment.`