Skip to content

Commit 873558d

Browse files
committed
Wait for model readiness in test script
1 parent e9e427d commit 873558d

File tree

2 files changed

+19
-6
lines changed

2 files changed

+19
-6
lines changed

charts/azimuth-llm/templates/test/end-to-end.yml

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,6 @@ spec:
2424
- http://{{ .Values.ui.service.name }}.{{ .Release.Namespace }}.svc
2525
{{- end }}
2626
restartPolicy: Never
27-
# Allow plenty of retries since downloading
28-
# model weights can take a long time.
29-
backoffLimit: 20
27+
# Handle retries within gradio-test-client script
28+
backoffLimit: 1
3029
{{- end -}}
Lines changed: 17 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,21 @@
11
import sys
2+
import time
3+
24
from gradio_client import Client
35

46
gradio_host = sys.argv[1]
5-
client = Client(gradio_host)
6-
result = client.predict("Hi", api_name="/chat")
7-
print(result)
7+
8+
retries = 60
9+
for n in range(1, retries+1):
10+
try:
11+
client = Client(gradio_host)
12+
result = client.predict("Hi", api_name="/chat")
13+
print(result)
14+
break
15+
except Exception as err:
16+
msg = f"Attempt {n} / {retries} encounter error: {err}"
17+
if n < retries:
18+
print(msg, "- waiting 10 seconds before retrying")
19+
time.sleep(10)
20+
else:
21+
print(msg, "- no more retries left")

0 commit comments

Comments
 (0)