Misc. bug: potential segmentation fault/memory corruption in llama-server

### Name and Version

$ ./build/bin/llama-cli --version
version: 4242 (642330ac)
built with Homebrew clang version 18.1.5 for arm64-apple-darwin23.3.0

### Operating systems

Linux, Mac, Windows

### Which llama.cpp modules do you know to be affected?

llama-server

### Problem description & steps to reproduce

in the destructor of server_context -
```server.cpp (line 638, commit: 642330ac
    ~server_context() {
        if (ctx) {
            llama_free(ctx);
            ctx = nullptr;
        }

        if (model) {
            llama_free_model(model);
            model = nullptr;
        }

        if (model_dft) {
            llama_free_model(model_dft);
            model_dft = nullptr;
        }

        // Clear any sampling context
        for (server_slot & slot : slots) {
            common_sampler_free(slot.smpl);
            slot.smpl = nullptr;

            llama_free(slot.ctx_dft);
            slot.ctx_dft = nullptr;

            common_speculative_free(slot.spec);
            slot.spec = nullptr;

            llama_batch_free(slot.batch_spec);
        }

        llama_batch_free(batch);
    }
```
1. if model_dft is not selected, the slot.spec is not allocated, hence `common_speculative_free` is called with a nullptr (introduced in commit 9ca2e677)
2. if model_dft is not selected, slot.batch_spec is initialized as default, and llama_batch_free with the default value causes memory corruption (introduced in commit 10bce045)

suggested fix -
1. check for slot.spec before calling common_speculative_free
2. convert slot.batch_spec to pointer, check for allocation and then call llama_batch_free


kind attn: @ggerganov @slaren 

### First Bad Commit

first bad commit: 9ca2e677
second bad commit: 10bce045

### Relevant log output

```shell
this is observation from reading code, not able to reproduce with built binary and passing signal to get the system memory logs on deallocation.
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions