readme : refresh #10587

ggerganov · 2024-11-29T20:09:49Z

Clean-up some old stuff from the readme and reorganize the information a little bit

ngxson

This is great! Thanks 🔥

README.md

JohannesGaessler · 2024-11-29T21:36:37Z

README.md

+
+### Obtaining and quantizing models
+
+The [Hugging Face](https://huggingface.co) platform hosts a [large amount of LLMs](https://huggingface.co/models?library=gguf&sort=trending) compatible with `llama.cpp` - simply search for the [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) file format.


Suggested change

The [Hugging Face](https://huggingface.co) platform hosts a [large amount of LLMs](https://huggingface.co/models?library=gguf&sort=trending) compatible with `llama.cpp` - simply search for the [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) file format.

The [Hugging Face](https://huggingface.co) platform hosts a [large number of LLMs](https://huggingface.co/models?library=gguf&sort=trending) compatible with `llama.cpp` - simply search for the [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) file format.

Strictly speaking this is not 100% correct since you could store arbitrary data in GGUF files (and some people unfortunately upload broken GGUFs immediately after a new model release). As of right now though you can reasonably assume that any GGUF model will run with llama.cpp. But we should keep in mind to update this if GGUF models for other projects ever become popular.

Hmm right, I can see from the link above that flux gguf is one of the trending model, which is not compat with llama.cpp. But I think it's acceptable for now, as this link is here to help user find commonly used GGUF models.

Indeed, on HF hub, we do have a measure to filter only llama.cpp-compatible models. Maybe we could add a specific query parameter for it in the future (for example, &compatible=llama-cpp). CC @julien-c too!

I've updated the wording to no imply that all GGUF files will work with llama.cpp. Feel free to improve this in the future.

JohannesGaessler · 2024-11-29T21:42:10Z

README.md

@@ -309,7 +264,7 @@ See [this page](./examples/main/README.md) for a full list of parameters.

 ### Conversation mode

-If you want a more ChatGPT-like experience, you can run in conversation mode by passing `-cnv` as a parameter:
+For a more ChatGPT-like experience, run `llama-cli` in conversation mode by passing `-cnv` as a parameter:


I would say nowadays the "ChatGPT-like" experience is the server.

I removed the "ChatGPT" term all together.

* readme : refresh * readme : move section [no ci] * readme : clarify [no ci] * readme : fixes [no ci] * readme : more fixes [no ci] * readme : simplify [no ci] * readme : clarify GGUF

ggerganov added 3 commits November 29, 2024 22:08

readme : refresh

4ba2876

readme : move section [no ci]

e8338b3

readme : clarify [no ci]

e3c7b4f

ngxson approved these changes Nov 29, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

readme : fixes [no ci]

308c041

JohannesGaessler reviewed Nov 29, 2024

View reviewed changes

ggerganov added 3 commits November 29, 2024 23:51

readme : more fixes [no ci]

4b8ce77

readme : simplify [no ci]

b223e7b

readme : clarify GGUF

3b4c551

ggerganov merged commit abadba0 into master Nov 30, 2024
2 checks passed

ggerganov deleted the gg/readme-refresh branch November 30, 2024 07:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme : refresh #10587

readme : refresh #10587

ggerganov commented Nov 29, 2024

ngxson left a comment

JohannesGaessler Nov 29, 2024

JohannesGaessler Nov 29, 2024

ngxson Nov 29, 2024

ggerganov Nov 30, 2024

JohannesGaessler Nov 29, 2024

ggerganov Nov 29, 2024


		### Obtaining and quantizing models

		The [Hugging Face](https://huggingface.co) platform hosts a [large amount of LLMs](https://huggingface.co/models?library=gguf&sort=trending) compatible with `llama.cpp` - simply search for the [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) file format.

readme : refresh #10587

readme : refresh #10587

Conversation

ggerganov commented Nov 29, 2024

ngxson left a comment

Choose a reason for hiding this comment

JohannesGaessler Nov 29, 2024

Choose a reason for hiding this comment

JohannesGaessler Nov 29, 2024

Choose a reason for hiding this comment

ngxson Nov 29, 2024

Choose a reason for hiding this comment

ggerganov Nov 30, 2024

Choose a reason for hiding this comment

JohannesGaessler Nov 29, 2024

Choose a reason for hiding this comment

ggerganov Nov 29, 2024

Choose a reason for hiding this comment