Skip to content

CUDA error when running predictions without GPU #126

@LouisK92

Description

@LouisK92

I am running segger in the danielunyi42/segger_dev:cuda121 docker container on a system without gpu.

The training went well by setting

trainer = Trainer(
    accelerator="cpu"
    ...
)

I then attempt predictions with the trained model with

model_version = 0
model_path = MODELS_DIR / "lightning_logs" / f"version_{model_version}"
model = load_model(model_path / "checkpoints")

receptive_field = {'k_bd': 4, 'dist_bd': 12, 'k_tx': 15, 'dist_tx': 3}

segment(
    model,
    dm,
    save_dir=TMP_DIR,
    seg_tag='segger_output',
    transcript_file=TRANSCRIPTS_PARQUET,
    receptive_field=receptive_field,
    min_transcripts=5,
    cell_id_col='segger_cell_id',
    use_cc=False,
    knn_method='kd_tree',
    verbose=True,
)

With this I run into the following error:

Processing Train batches:   0%|                                                                       | 0/1258 [00:00<?, ?it/s]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/workspace/segger_dev/src/segger/prediction/predict_parquet.py", line 524, in segment
    predict_batch(
  File "/workspace/segger_dev/src/segger/prediction/predict_parquet.py", line 322, in predict_batch
    with cp.cuda.Device(gpu_id):
  File "cupy/cuda/device.pyx", line 173, in cupy.cuda.device.Device.__enter__
  File "cupy_backends/cuda/api/runtime.pyx", line 202, in cupy_backends.cuda.api.runtime.getDevice
  File "cupy_backends/cuda/api/runtime.pyx", line 146, in cupy_backends.cuda.api.runtime.check_status
cupy_backends.cuda.api.runtime.CUDARuntimeError: cudaErrorInsufficientDriver: CUDA driver version is insufficient for CUDA runtime version

I believe the issue could be resolved if there is a way to tell the function to use the cpu. Is this possible in some way?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions