add --mmap in llama-bench #5257

NeoZhangJianyu · 2024-02-01T14:09:32Z

add --no-mmap: in some case, mmap will lead to hang issue on SYCL backend.
show "SYCL" backend in result table.
update get_gpu_info() for SYCL.

examples/llama-bench/llama-bench.cpp

NeoZhangJianyu · 2024-02-01T15:04:50Z

@airMeng , @luoyu-intel , @abhilash1910, @ggerganov Invite you to review if you are idle.

slaren

Change the mmap parameter name from no-mmap to mmap. Set it by default to 1.
Allow multiple values
Print the value in the markdown printer if it is not the default.

NeoZhangJianyu · 2024-02-01T15:21:43Z

Change the mmap parameter name from no-mmap to mmap. Set it by default to 1.

Allow multiple values

Print the value in the markdown printer if it is not the default.

OK, I will update according to comments.

examples/llama-bench/llama-bench.cpp

slaren · 2024-02-01T16:23:18Z

The changes to llama-bench are good, should we merge this now, or wait for a review of the ggml-sycl changes?

* add --no-mmap, show sycl backend * fix conflict * fix code format, change print for --no-mmap * ren no_mmap to mmap, show mmap when not default value in printer * update guide for mmap * mv position to reduce model reload

NeoZhangJianyu added 3 commits February 1, 2024 22:05

add --no-mmap, show sycl backend

c36ecbf

Merge branch 'master' into update_bench

5ab6504

fix conflict

e4e28c1

slaren reviewed Feb 1, 2024

View reviewed changes

fix code format, change print for --no-mmap

b2f6338

slaren requested changes Feb 1, 2024

View reviewed changes

NeoZhangJianyu added 2 commits February 1, 2024 23:47

ren no_mmap to mmap, show mmap when not default value in printer

fb69ed8

update guide for mmap

da32e21

NeoZhangJianyu requested a review from slaren February 1, 2024 16:03

slaren approved these changes Feb 1, 2024

View reviewed changes

slaren reviewed Feb 1, 2024

View reviewed changes

examples/llama-bench/llama-bench.cpp Outdated Show resolved Hide resolved

mv position to reduce model reload

30c52f3

ggerganov requested a review from abhilash1910 February 1, 2024 17:39

abhilash1910 approved these changes Feb 1, 2024

View reviewed changes

slaren merged commit 128dcbd into ggerganov:master Feb 1, 2024
53 checks passed

NeoZhangJianyu deleted the update_bench branch February 2, 2024 01:04

NeoZhangJianyu changed the title ~~add --no-mmap in llama-bench~~ add --mmap in llama-bench Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add --mmap in llama-bench #5257

add --mmap in llama-bench #5257

NeoZhangJianyu commented Feb 1, 2024

NeoZhangJianyu commented Feb 1, 2024

slaren left a comment

NeoZhangJianyu commented Feb 1, 2024

slaren commented Feb 1, 2024

add --mmap in llama-bench #5257

add --mmap in llama-bench #5257

Conversation

NeoZhangJianyu commented Feb 1, 2024

NeoZhangJianyu commented Feb 1, 2024

slaren left a comment

Choose a reason for hiding this comment

NeoZhangJianyu commented Feb 1, 2024

slaren commented Feb 1, 2024