Guide updates.

tjholm · davemooreuws · commit 4c470d8646c3 · 2024-11-07T15:14:20.000+11:00
diff --git a/docs/guides/python/ai-podcast-part-1.mdx b/docs/guides/python/ai-podcast-part-1.mdx
@@ -382,6 +382,9 @@ async def do_download_audio_model(ctx: MessageContext):
 @main_api.post("/download-model")
 async def download_audio(ctx: HttpContext):
     model_id = ctx.req.query.get("model", audio_model_id)
+
+    if isinstance(model_id, list):
+        model_id = model_id[0]
     # asynchronously download the model
     await download_audio_model.publish({ "model_id": model_id })
 
@@ -662,7 +665,7 @@ nitric stack new test aws
 This will generate a nitric stack file called `test` which defines how we want to deploy a stack to AWS. We can update this stack file with settings to configure our batch service and the AWS Compute environment it will run in.
 
 ```yaml title: nitric.test.yaml
-provider: nitric/aws@1.14.2
+provider: nitric/aws@1.15.4
 # The target aws region to deploy to
 # See available regions:
 # https://docs.aws.amazon.com/general/latest/gr/lambda-service.html
diff --git a/docs/guides/python/ai-podcast-part-2.mdx b/docs/guides/python/ai-podcast-part-2.mdx
@@ -269,6 +269,9 @@ async def do_download_audio_model(ctx: MessageContext):
 @main_api.post("/download-model")
 async def download_audio(ctx: HttpContext):
     model_id = ctx.req.query.get("model", audio_model_id)
+
+    if isinstance(model_id, list):
+        model_id = model_id[0]
     # asynchronously download the model
     await download_audio_model.publish({ "model_id": model_id })
 
@@ -331,7 +334,8 @@ FROM ghcr.io/astral-sh/uv:python3.11-bookworm AS builder
 
 ARG HANDLER
 ENV HANDLER=${HANDLER}
-
+# Set flags for common execution environment
+ENV CMAKE_ARGS="-DLLAMA_NATIVE=OFF -DGGML_NATIVE=OFF -DGGML_BLAS=ON -DGGML_BLAS_VENDOR=OpenBLAS -DGGML_AVX512=OFF -DGGML_AVX512_VNNI=OFF -DGGML_AVX512_VBMI=OFF -DGGML_AVX512_BF16=OFF"
 ENV UV_COMPILE_BYTECODE=1 UV_LINK_MODE=copy PYTHONPATH=.
 WORKDIR /app
 RUN --mount=type=cache,target=/root/.cache/uv \
@@ -378,6 +382,16 @@ README.md
 model.zip
 ```
 
+## Update our .env file
+
+We'll change some of the environment variables in our `.env` file to factor in the extra init time it will take to start our LLM job:
+
+```sh title:.env
+PYTHONPATH=.
+# diff +
+WORKER_TIMEOUT=300
+```
+
 Finally, we need to tell Nitric to use these files to create the script service. We can do this by updating the `nitric.yaml` file:
 
 ```yaml title:nitric.yaml