Skip to content

Commit c4e6b90

Browse files
authored
Merge pull request #68 from AI-Hypercomputer/bvandermoon-tpu-recipes
Recommend Python 3.10 in MaxText instructions and other instruction c…
2 parents 748140e + 48353cc commit c4e6b90

File tree

2 files changed

+18
-7
lines changed

2 files changed

+18
-7
lines changed

training/trillium/MAXTEXT_README.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,14 @@
1-
# Prep for Maxtext workloads on GKE
1+
# Prep for MaxText workloads on GKE
22

33
> **_NOTE:_** We recommend running these instructions and kicking off your recipe
4-
workloads from a TPU VM.
4+
workloads from a VM in GCP using Python 3.10.
55

6-
1. Clone [Maxtext](https://github.com/google/maxtext) repo and move to its directory
6+
1. Clone [MaxText](https://github.com/google/maxtext) repo and move to its directory
77
```shell
88
git clone https://github.com/google/maxtext.git
99
cd maxtext
1010
# Checkout either the commit id or MaxText tag.
11-
# Example: `git checkout tpu-recipes-v0.1.1`
11+
# Example: `git checkout tpu-recipes-v0.1.2`
1212
git checkout ${MAXTEXT_COMMIT_ID_OR_TAG}
1313
```
1414

@@ -18,7 +18,7 @@ bash setup.sh
1818
```
1919

2020
Optional: Use a virtual environment to setup and run your workloads. This can help with errors
21-
like `This environment is externally managed`.
21+
like `This environment is externally managed`:
2222
```shell
2323
## One time step of creating the venv
2424
VENV_DIR=~/venvp3
@@ -29,6 +29,10 @@ source $VENV_DIR/bin/activate
2929
bash setup.sh
3030
```
3131

32+
> **_NOTE:_** If you use a virtual environment, you must use the same one when running the
33+
[XPK Installation](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#installation)
34+
steps linked in the [XPK_README](XPK_README.md) as well as your relevant tpu-recipe workloads.
35+
3236
3. Run the following commands to build the docker image
3337
```shell
3438
# Example BASE_IMAGE=us-docker.pkg.dev/cloud-tpu-images/jax-stable-stack/tpu:jax0.5.2-rev1

training/trillium/XPK_README.md

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
## Initialization
22

33
> **_NOTE:_** We recommend running these instructions and kicking off your recipe
4-
workloads from a TPU VM.
4+
workloads from a VM in GCP using Python 3.10.
55

66
1. Run the following commands to initialize the project and zone.
77
```shell
@@ -14,8 +14,10 @@ gcloud config set compute/zone $ZONE
1414
2. Install XPK by following the [prerequisites](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#prerequisites) and [installation](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#installation)
1515
instructions. Also ensure you have the proper [GCP permissions](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#installation).
1616

17-
* In order to run the tpu-recipes as-is, run the `git clone` command from your home directory:
17+
* In order to run the tpu-recipes as-is, run the `git clone` command from your home (~/) directory:
1818
```shell
19+
# tpu-recipes requiring XPK will look for it in the home directory
20+
cd ~/
1921
git clone https://github.com/google/xpk.git
2022
```
2123

@@ -25,6 +27,11 @@ git clone https://github.com/google/xpk.git
2527
cd xpk # Should be equivalent to cd ~/xpk
2628
```
2729

30+
> **_NOTE:_** If you use a virtual environment in the
31+
[XPK Installation](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#installation)
32+
steps, you must use the same one to run the steps in the [MAXTEXT_README](MAXTEXT_README.md)
33+
as well as your relevant tpu-recipe workloads.
34+
2835
## GKE Cluster Creation
2936
1. Specify your TPU GKE cluster configs.
3037
```shell

0 commit comments

Comments
 (0)