Skip to content

Commit f2d8681

Browse files
Merge pull request #2159 from jasonrandrews/review
Updates to Arcee Graviton4 Learning Path
2 parents 4315638 + 3e73e27 commit f2d8681

File tree

5 files changed

+6
-7
lines changed

5 files changed

+6
-7
lines changed

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-aws/01_launching_a_graviton4_instance.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ If you're new to EC2, check out the Learning Path [Getting Started with AWS](/le
2020

2121
To deploy the Arcee AFM-4.5B model, you need an EC2 instance running on Arm-based Graviton4 hardware.
2222

23-
To do this, start by signing in to the [AWS Management Console](https://console.aws.amazon.com), then navigate to the **EC2** service.
23+
To do this, start by signing in to the [AWS Management Console](https://console.aws.amazon.com), change to your preferred region, and navigate to the **EC2** service.
2424

2525
From there, you can create an SSH key pair that allows you to connect to your instance securely.
2626

@@ -39,7 +39,7 @@ mkdir -p ~/.ssh
3939
mv arcee-graviton4-key.pem ~/.ssh/
4040
chmod 400 ~/.ssh/arcee-graviton4-key.pem
4141
```
42-
internet
42+
4343
## Launch and configure the EC2 instance
4444

4545
In the left sidebar of the EC2 dashboard, select **Instances**, and then **Launch instances**.

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-aws/03_building_llama_cpp.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,4 +87,4 @@ After compilation, you'll find several key command-line tools in the `bin` direc
8787

8888
You can find more tools and usage details in the llama.cpp [GitHub repository](https://github.com/ggml-org/llama.cpp/tree/master/tools).
8989

90-
These binaries are specifically optimized for Arm64 architecture and will provide excellent performance on your Graviton4 instance.
90+
These binaries are specifically optimized for the Arm architecture and will provide excellent performance on your Graviton4 instance.

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-aws/05_downloading_and_optimizing_afm45b.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ This command creates a 4-bit quantized version of the model:
6666

6767
## Arm optimization
6868

69-
Arm has contributed optimized kernels for Q4_0 that use Neoverse v2 instruction sets. These low-level routines accelerate math operations, delivering strong performance on Graviton4.
69+
Arm has contributed optimized kernels for Q4_0 that use Neoverse V2 instruction sets. These low-level routines accelerate math operations, delivering strong performance on Graviton4.
7070

7171
These instruction sets allow Llama.cpp to run quantized operations significantly faster than generic implementations, making Arm processors a competitive choice for inference workloads.
7272

@@ -85,7 +85,7 @@ This command creates an 8-bit quantized version of the model:
8585

8686
## Arm optimization
8787

88-
Similar to Q4_0, Arm has contributed optimized kernels for Q8_0 quantization that take advantage of Neoverse v2 instruction sets. These optimizations provide excellent performance for 8-bit operations while maintaining higher accuracy compared to 4-bit quantization.
88+
Similar to Q4_0, Arm has contributed optimized kernels for Q8_0 quantization that take advantage of Neoverse V2 instruction sets. These optimizations provide excellent performance for 8-bit operations while maintaining higher accuracy compared to 4-bit quantization.
8989

9090
## Model files ready for inference
9191

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-aws/08_conclusion.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ The benchmarking results demonstrate the power of quantization and Arm-based com
3636

3737
## The AWS Graviton4 advantage
3838

39-
AWS Graviton4 processors, built on the Arm Neoverse-V2 architecture, provide:
39+
AWS Graviton4 processors, built on the Arm Neoverse V2 architecture, provide:
4040

4141
- Superior performance per watt compared to x86 alternatives
4242
- Cost savings of 20–40% for compute-intensive workloads

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-aws/_index.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,6 @@ learning_objectives:
1818

1919
prerequisites:
2020
- An [AWS account](https://aws.amazon.com/) with permission to launch Graviton4 (`c8g.4xlarge` or larger) instances
21-
- At least 128 GB of available storage
2221
- Basic familiarity with Linux and SSH
2322

2423
author: Julien Simon

0 commit comments

Comments
 (0)