Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

anagri · 2024-12-03T11:11:03Z

Name and Version

$ ./build/bin/llama-cli --version
version: 4242 (642330a)
built with Homebrew clang version 18.1.5 for arm64-apple-darwin23.3.0

Operating systems

Linux, Mac, Windows

Which llama.cpp modules do you know to be affected?

llama-server

Problem description & steps to reproduce

in the destructor of server_context -

    ~server_context() {
        if (ctx) {
            llama_free(ctx);
            ctx = nullptr;
        }

        if (model) {
            llama_free_model(model);
            model = nullptr;
        }

        if (model_dft) {
            llama_free_model(model_dft);
            model_dft = nullptr;
        }

        // Clear any sampling context
        for (server_slot & slot : slots) {
            common_sampler_free(slot.smpl);
            slot.smpl = nullptr;

            llama_free(slot.ctx_dft);
            slot.ctx_dft = nullptr;

            common_speculative_free(slot.spec);
            slot.spec = nullptr;

            llama_batch_free(slot.batch_spec);
        }

        llama_batch_free(batch);
    }

if model_dft is not selected, the slot.spec is not allocated, hence common_speculative_free is called with a nullptr (introduced in commit 9ca2e67)
if model_dft is not selected, slot.batch_spec is initialized as default, and llama_batch_free with the default value causes memory corruption (introduced in commit 10bce04)

suggested fix -

check for slot.spec before calling common_speculative_free
convert slot.batch_spec to pointer, check for allocation and then call llama_batch_free

kind attn: @ggerganov @slaren

First Bad Commit

first bad commit: 9ca2e67
second bad commit: 10bce04

Relevant log output

this is observation from reading code, not able to reproduce with built binary and passing signal to get the system memory logs on deallocation.

The text was updated successfully, but these errors were encountered:

slaren · 2024-12-04T00:24:54Z

Agree, this is likely to be an issue. batch_spec should be initialized in the same way server_context::batch is.

ggerganov · 2024-12-04T09:34:27Z

PTAL #10651

anagri added the bug-unconfirmed label Dec 3, 2024

slaren added bug Something isn't working and removed bug-unconfirmed labels Dec 4, 2024

ggerganov mentioned this issue Dec 4, 2024

server : fix free of spec context and batch #10651

Merged

ggerganov closed this as completed in #10651 Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

anagri commented Dec 3, 2024

slaren commented Dec 4, 2024 •

edited

Loading

ggerganov commented Dec 4, 2024

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

Misc. bug: potential segmentation fault/memory corruption in llama-server #10635

Comments

anagri commented Dec 3, 2024

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Problem description & steps to reproduce

First Bad Commit

Relevant log output

slaren commented Dec 4, 2024 • edited Loading

ggerganov commented Dec 4, 2024

slaren commented Dec 4, 2024 •

edited

Loading