Skip to content

Tensorflow 2.14 on NVIDIA DGX Spark GB10 on CUDA 13 - ARM64 #48

@obriensystems

Description

@obriensystems
michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ vi environments/windows/Dockerfile 
michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ sudo ./build.sh 
[+] Building 0.3s (8/8) FINISHED                                                                                                                                                   docker:default
 => [internal] load build definition from Dockerfile                                                                                                                                         0.0s
 => => transferring dockerfile: 545B                                                                                                                                                         0.0s
 => [internal] load metadata for docker.io/linaro/tensorflow-arm64-build:2.14-multipython                                                                                                    0.1s
 => [internal] load .dockerignore                                                                                                                                                            0.0s
 => => transferring context: 2B                                                                                                                                                              0.0s
 => [1/3] FROM docker.io/linaro/tensorflow-arm64-build:2.14-multipython@sha256:6924bf224dec0acf3f8ca29650b8eae45a2cf5fb8fc011880fdfc5b42d371352                                              0.0s
 => => resolve docker.io/linaro/tensorflow-arm64-build:2.14-multipython@sha256:6924bf224dec0acf3f8ca29650b8eae45a2cf5fb8fc011880fdfc5b42d371352                                              0.0s
 => [internal] load build context                                                                                                                                                            0.0s
 => => transferring context: 57B                                                                                                                                                             0.0s
 => CACHED [2/3] WORKDIR /src                                                                                                                                                                0.0s
 => CACHED [3/3] COPY /src/tflow.py .                                                                                                                                                        0.0s
 => exporting to image                                                                                                                                                                       0.0s
 => => exporting layers                                                                                                                                                                      0.0s
 => => writing image sha256:0cc082cb7126f885bdc2c605c287c1a1ac3962b546b3ac38a220ee1f834dfa06                                                                                                 0.0s
 => => naming to docker.io/library/ml-tensorflow-arm                                                                                                                                         0.0s
Traceback (most recent call last):
  File "tflow.py", line 1, in <module>
    import tensorflow as tf
ModuleNotFoundError: No module named 'tensorflow'
michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ vi environments/windows/Dockerfile 
michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ python3 -m venv venv-t214
michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ source venv-t214/bin/activate
(venv-t214) michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ python --version
Python 3.12.3
(venv-t214) michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ python -m pip install --upgrade pip
Requirement already satisfied: pip in ./venv-t214/lib/python3.12/site-packages (24.0)
Collecting pip
  Downloading pip-25.3-py3-none-any.whl.metadata (4.7 kB)
Downloading pip-25.3-py3-none-any.whl (1.8 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 12.5 MB/s eta 0:00:00
Installing collected packages: pip
  Attempting uninstall: pip
    Found existing installation: pip 24.0
    Uninstalling pip-24.0:
      Successfully uninstalled pip-24.0
Successfully installed pip-25.3
(venv-t214) michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ python -m pip install tensorflow==2.14
ERROR: Could not find a version that satisfies the requirement tensorflow==2.14 (from versions: 2.16.0rc0, 2.16.1, 2.16.2, 2.17.0rc0, 2.17.0rc1, 2.17.0, 2.17.1, 2.18.0rc0, 2.18.0rc1, 2.18.0rc2, 2.18.0, 2.18.1, 2.19.0rc0, 2.19.0, 2.19.1, 2.20.0rc0, 2.20.0)
ERROR: No matching distribution found for tensorflow==2.14


check 64 bit
(venv-t214) michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ python
Python 3.12.3 (main, Aug 14 2025, 17:47:21) [GCC 13.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import struct;print(struct.calcsize("P") * 8)
64

same as
https://stackoverflow.com/questions/77297953/tensorflow-2-14-gpu-not-getting-recognized

running cuda 13
(venv-t214) michael@spark-7d19:~/wse_github/ObrienlabsDev/machine-learning$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Wed_Aug_20_01:57:39_PM_PDT_2025
Cuda compilation tools, release 13.0, V13.0.88
Build cuda_13.0.r13.0/compiler.36424714_0

nope

pip install tensorflow[and-cuda]==2.14.0


Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions