Skip to content

whisper breaks on cuda-13 #8033

@markuman

Description

@markuman

LocalAI version:

localai/localai:v3.9.0-aio-gpu-nvidia-cuda-13

Environment, CPU architecture, OS, and Version:

Linux gpu2 6.18.5-arch1-1 #1 SMP PREEMPT_DYNAMIC Sun, 11 Jan 2026 17:10:53 +0000 x86_64 GNU/Linux

Describe the bug

Jan 14 12:56:08 ERROR failed starting/connecting to the gRPC service error=rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial tcp 127.0.0.1:40857: connect: connection refused" 
Jan 14 12:56:10 ERROR Failed to load model modelID="ggml-large-v3-turbo.bin" error=failed to load model with internal loader: grpc service not ready backend="whisper" 

using

localai/localai:v3.9.0-aio-gpu-nvidia-cuda-12

fixes the problem

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions