MxNet unable to detect CPU on AWS/EMR/g4 instance #2038
patrick-nicolas
started this conversation in
General
Replies: 1 comment
-
@patrick-nicolas |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The application works just fine on CPU. I followed the recommendations from issue 732
Issue-732
Which step am I missing?
GPU configuration:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.172.01 Driver Version: 450.172.01 CUDA Version: 11.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla T4 Off | 00000000:00:1E.0 Off | 0 |
| N/A 33C P0 14W / 70W | 0MiB / 15109MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
MxNet binary libraries
/home/hadoop/.djl.ai/mxnet/1.9.0-mkl-linux-x86_64/
├── libgfortran.so.3
├── libgomp.so.1
├── libmxnet.so
├── libopenblas.so.0
└── libquadmath.so.0
Relevant environment Variables
PATH: /home/hadoop/spark-3.3.0-bin-hadoop3-scala2.13/bin/:/home/hadoop/jdk-11/bin:/usr/local/cuda/bin:/usr/local/cuda/lib64:usr/local/cuda/targets/x86_64-linux/lib:/usr/lib64/qt-3.3/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/opt/aws/puppet/bin/
LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/targets/x86_64-linux/lib
JAVA_HOME: /home/hadoop/jdk-11
SPARK_ENV_LOADED: 1
SPARK_SCALA_VERSION: 2.13
SPARK_HOME: /home/hadoop/spark-3.3.0-bin-hadoop3-scala2.13
SHELL: /bin/bash
USER: hadoop
SPARK_CONF_DIR: /home/hadoop/spark-3.3.0-bin-hadoop3-scala2.13/conf
HOME: /home/hadoop
-------------- Directories --------------
temp directory: /tmp
DJL cache directory: /home/hadoop/.djl.ai
Engine cache directory: /home/hadoop/.djl.ai
------------------ CUDA -----------------
GPU Count: 1
CUDA: 110
ARCH: 75
GPU(0) memory used: 106954752 bytes
----------------- Engines ---------------
MXNet version: 1.9.0
Default Engine: MXNet
Default Device: cpu()
MXNet: #0
Beta Was this translation helpful? Give feedback.
All reactions