Skip to content

Commit e02c17b

Browse files
increase tekton timeouts and add back the arches
1 parent f1a35b8 commit e02c17b

File tree

3 files changed

+7
-1
lines changed

3 files changed

+7
-1
lines changed

.tekton/odh-training-th04-cuda128-torch29-py312-rhel9-pull-request.yaml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,9 @@ metadata:
1818
name: odh-training-th04-cuda128-torch29-py312-rhel9-on-pull-request
1919
namespace: open-data-hub-tenant
2020
spec:
21+
timeouts:
22+
pipeline: 20h
23+
tasks: 18h
2124
params:
2225
- name: git-url
2326
value: '{{source_url}}'

.tekton/odh-training-th04-cuda128-torch29-py312-rhel9-push.yaml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,9 @@ metadata:
1717
name: odh-training-th04-cuda128-torch29-py312-rhel9-on-push
1818
namespace: open-data-hub-tenant
1919
spec:
20+
timeouts:
21+
pipeline: 20h
22+
tasks: 18h
2023
params:
2124
- name: git-url
2225
value: '{{source_url}}'

images/universal/training/th04-cuda128-torch290-py312/Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ ENV NVIDIA_VISIBLE_DEVICES=all \
5959
CUDA_HOME=/usr/local/cuda \
6060
PATH=/usr/local/cuda/bin:$PATH \
6161
LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH \
62-
TORCH_CUDA_ARCH_LIST="8.6" \
62+
TORCH_CUDA_ARCH_LIST="8.0;8.6;8.9;9.0" \
6363
XLA_FLAGS=--xla_gpu_cuda_data_dir=/usr/local/cuda
6464

6565
################################################################################

0 commit comments

Comments
 (0)