Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

perplexity : faster HellaSwag via batching #5017

Merged
merged 7 commits into from
Jan 18, 2024
Merged

Conversation

ggerganov
Copy link
Owner

@ggerganov ggerganov commented Jan 18, 2024

This PR improves HellaSwag computation via the perplexity tool by batching both the endings and the tasks into a single llama_batch

For GPUs with plenty of FLOPS, adding -c 1024 or even -c 2048 might further improve performance

By default we evaluate 1 task at a time, but for small tasks it is useful to batch them together. This can be controlled with the --parallel argument.

@ggerganov ggerganov merged commit ad19812 into master Jan 18, 2024
41 of 46 checks passed
@ggerganov ggerganov deleted the gg/hellaswag-batched branch January 18, 2024 13:33
jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024
* perplexity : faster HellaSwag

ggml-ci

* perplexity : clean-up

ggml-ci

* perplexity : no need for decode_helper

ggml-ci

* perplexity : add comments

* perplexity : option to specify max batched tasks via `n_parallel`

* perplexity : remove HellaSwag restruction for n_batch
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
* perplexity : faster HellaSwag

ggml-ci

* perplexity : clean-up

ggml-ci

* perplexity : no need for decode_helper

ggml-ci

* perplexity : add comments

* perplexity : option to specify max batched tasks via `n_parallel`

* perplexity : remove HellaSwag restruction for n_batch
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant