revamp bedrock

HarshCasper · HarshCasper · commit a5eba1f696d8 · 2025-06-18T11:38:08.000+05:30
diff --git a/src/content/docs/aws/services/bedrock.md b/src/content/docs/aws/services/bedrock.md
@@ -1,15 +1,15 @@
 ---
 title: "Bedrock"
-linkTitle: "Bedrock"
-description: Use foundation models running on your device with LocalStack!
+description: Get started with Bedrock on LocalStack
 tags: ["Ultimate"]
 ---
 
 ## Introduction
 
 Bedrock is a fully managed service provided by Amazon Web Services (AWS) that makes foundation models from various LLM providers accessible via an API.
+
 LocalStack allows you to use the Bedrock APIs to test and develop AI-powered applications in your local environment.
-The supported APIs are available on our [API Coverage Page]({{< ref "coverage_bedrock" >}}), which provides information on the extent of Bedrock's integration with LocalStack.
+The supported APIs are available on our [API Coverage Page](), which provides information on the extent of Bedrock's integration with LocalStack.
 
 ## Getting started
 
@@ -37,16 +37,17 @@ This way you avoid long wait times when switching between models on demand with
 
 You can view all available foundation models using the [`ListFoundationModels`](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_ListFoundationModels.html) API.
 This will show you which models are available on AWS Bedrock.
-{{< callout "note">}}
+
+:::note
 The actual model that will be used for emulation will differ from the ones defined in this list.
 You can define the used model with `DEFAULT_BEDROCK_MODEL`
-{{< / callout >}}
+:::
 
 Run the following command:
 
-{{< command >}}
-$ awslocal bedrock list-foundation-models
-{{< / command >}}
+```bash
+awslocal bedrock list-foundation-models
+```
 
 ### Invoke a model
 
@@ -56,15 +57,15 @@ However, the actual model will be defined by the `DEFAULT_BEDROCK_MODEL` environ
 
 Run the following command:
 
-{{< command >}}
-$ awslocal bedrock-runtime invoke-model \
+```bash
+awslocal bedrock-runtime invoke-model \
     --model-id "meta.llama3-8b-instruct-v1:0" \
     --body '{
         "prompt": "<|begin_of_text|><|start_header_id|>user<|end_header_id|>\nSay Hello!\n<|eot_id|>\n<|start_header_id|>assistant<|end_header_id|>",
         "max_gen_len": 2,
         "temperature": 0.9
     }' --cli-binary-format raw-in-base64-out outfile.txt
-{{< / command >}}
+```
 
 The output will be available in the `outfile.txt`.
 
@@ -75,8 +76,8 @@ You can specify both system prompts and user messages.
 
 Run the following command:
 
-{{< command >}}
-$ awslocal bedrock-runtime converse \
+```bash
+awslocal bedrock-runtime converse \
     --model-id "meta.llama3-8b-instruct-v1:0" \
     --messages '[{
         "role": "user",
@@ -87,47 +88,46 @@ $ awslocal bedrock-runtime converse \
     --system '[{
         "text": "You'\''re a chatbot that can only say '\''Hello!'\''"
     }]'
-{{< / command >}}
+```
 
 ### Model Invocation Batch Processing
 
 Bedrock offers the feature to handle large batches of model invocation requests defined in S3 buckets using the [`CreateModelInvocationJob`](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_CreateModelInvocationJob.html) API.
 
-First, you need to create a `JSONL` file that contains all your prompts:
+First, you need to create a `JSONL` file named `batch_input.jsonl` that contains all your prompts:
 
-{{< command >}}
-$ cat batch_input.jsonl
+```json
 {"prompt": "Tell me a quick fact about Vienna.", "max_tokens": 50, "temperature": 0.5}
 {"prompt": "Tell me a quick fact about Zurich.", "max_tokens": 50, "temperature": 0.5}
 {"prompt": "Tell me a quick fact about Las Vegas.", "max_tokens": 50, "temperature": 0.5}
-{{< / command >}}
+```
 
 Then, you need to define buckets for the input as well as the output and upload the file in the input bucket:
 
-{{< command >}}
-$ awslocal s3 mb s3://in-bucket
-make_bucket: in-bucket
-
-$ awslocal s3 cp batch_input.jsonl s3://in-bucket
-upload: ./batch_input.jsonl to s3://in-bucket/batch_input.jsonl
-
-$ awslocal s3 mb s3://out-bucket
-make_bucket: out-bucket
-{{< / command >}}
+```bash
+awslocal s3 mb s3://in-bucket
+awslocal s3 cp batch_input.jsonl s3://in-bucket
+awslocal s3 mb s3://out-bucket
+```
 
 Afterwards you can run the invocation job like this:
 
-{{< command >}}
-$ awslocal bedrock create-model-invocation-job \
+```bash
+awslocal bedrock create-model-invocation-job \
   --job-name "my-batch-job" \
   --model-id "mistral.mistral-small-2402-v1:0" \
   --role-arn "arn:aws:iam::123456789012:role/MyBatchInferenceRole" \
   --input-data-config '{"s3InputDataConfig": {"s3Uri": "s3://in-bucket"}}' \
   --output-data-config '{"s3OutputDataConfig": {"s3Uri": "s3://out-bucket"}}'
+```
+
+The output will be:
+
+```json
 {
     "jobArn": "arn:aws:bedrock:us-east-1:000000000000:model-invocation-job/12345678"
 }
-{{< / command >}}
+```
 
 The results will be at the S3 URL `s3://out-bucket/12345678/batch_input.jsonl.out`
 
@@ -140,33 +140,33 @@ LocalStack will pull the model from Ollama and use it for emulation.
 
 For example, to use the Mistral model, set the environment variable while starting LocalStack:
 
-{{< command >}}
-$ DEFAULT_BEDROCK_MODEL=mistral localstack start
-{{< / command >}}
+```bash
+DEFAULT_BEDROCK_MODEL=mistral localstack start
+```
 
 You can also define models directly in the request, by setting the `model-id` parameter to `ollama.<ollama-model-id>`.
 For example, if you want to access `deepseek-r1`, you can do it like this:
 
-{{< command >}}
-$ awslocal bedrock-runtime converse \
+```bash
+awslocal bedrock-runtime converse \
     --model-id "ollama.deepseek-r1" \
     --messages '[{
         "role": "user",
         "content": [{
             "text": "Say Hello!"
         }]
     }]'
-{{< / command >}}
+```
 
 ## Troubleshooting
 
 Users of Docker Desktop on macOS or Windows might run into the issue of Bedrock becoming unresponsive after some usage.
 A common reason for that is insufficient storage or memory space in the Docker Desktop VM.
 To resolve this issue you can increase those amounts directly in Docker Desktop or clean up unused artifacts with the Docker CLI like this
 
-{{< command >}}
-$ docker system prune
-{{< / command >}}
+```bash
+docker system prune
+```
 
 You could also try to use a model with lower requirements.
 To achieve that you can search for models in the [Ollama Models library](https://ollama.com/search) with a low parameter count or smaller size.