Skip to content

Commit d567afd

Browse files
committed
Update to use OIDC token, upgrade resources for model on larger instance, update cert
1 parent c41f093 commit d567afd

File tree

5 files changed

+28
-9
lines changed

5 files changed

+28
-9
lines changed

.github/workflows/deploy-to-eks.yml

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -26,10 +26,9 @@ jobs:
2626
runs-on: ubuntu-latest
2727
steps:
2828
- name: Configure AWS credentials
29-
uses: aws-actions/configure-aws-credentials@v1
29+
uses: aws-actions/configure-aws-credentials@v2
3030
with:
31-
aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
32-
aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
31+
role-to-assume: ${{ secrets.GH_ACTIONS_ROLE }}
3332
aws-region: ${{ secrets.AWS_REGION }}
3433

3534
- name: Update kube config

out/base/model-deployment.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -39,11 +39,11 @@ spec:
3939
containerPort: 11434
4040
resources:
4141
requests:
42-
cpu: "1000m"
43-
memory: "4Gi"
42+
cpu: "6000m"
43+
memory: "28Gi"
4444
limits:
45-
cpu: "2000m"
46-
memory: "6Gi"
45+
cpu: "7000m"
46+
memory: "30Gi"
4747
lifecycle:
4848
postStart:
4949
exec:

out/base/server-expose.yaml

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
#! server-expose.yaml
2+
# Generated code, do not edit
3+
apiVersion: v1
4+
kind: Service
5+
metadata:
6+
name: server
7+
namespace: cat-chatbot
8+
labels:
9+
com.docker.compose.project: cat-chatbot
10+
com.docker.compose.service: server
11+
spec:
12+
selector:
13+
com.docker.compose.project: cat-chatbot
14+
com.docker.compose.service: server
15+
ports:
16+
- name: app-3000
17+
port: 3000
18+
targetPort: app-3000
19+
- name: server-5001
20+
port: 5001
21+
targetPort: server-5001

out/overlays/desktop/server-service.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
alb.ingress.kubernetes.io/scheme: internet-facing
88
alb.ingress.kubernetes.io/target-type: ip
99
alb.ingress.kubernetes.io/listen-ports: '[{"HTTP": 80}, {"HTTPS": 443}]'
10-
alb.ingress.kubernetes.io/certificate-arn: arn:aws:acm:us-east-2:550259584844:certificate/043b146e-db7e-4681-a0af-da743eb84723
10+
alb.ingress.kubernetes.io/certificate-arn: arn:aws:acm:us-east-1:175142243308:certificate/c5e7fb70-aaec-4319-9d28-7bcbd4bbbf3c
1111
spec:
1212
ingressClassName: alb
1313
rules:

tests/server.test.js

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,6 @@ describe('Ollama Container Tests', () => {
5252
const result = await getResponse(model, prompt);
5353
console.log('Response from Ollama:', result);
5454

55-
// Add your assertions here
5655
expect(result["done"]).toBe(true)
5756
}, 60 * SECONDS);
5857
});

0 commit comments

Comments
 (0)