[pull] master from ggerganov:master #23

pull · 2024-01-18T13:29:59Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

* Metal memory: Small memory leak on init, dangling pointer, and unused autorelease pool in graph compute * SPM header potential fix * Reverting symlinks

* winogrande: simple implementation It doesn't look like it is working - why? For Mistral-7B it is barely better than random chance (score ~60% for 1267 tasks), while I see Mistral-7B scoring 78.4% on the HF leader board. 1-sigma statistical uncertainty for 1267 tasks is ~1.4, so no way the difference is due to statistics. * winogrande: somewhat better Score for Mistrali7-B is now 68.9 on the validation set of winogrande_debiased. Still far from the reported 78.4, but better than what I had before. * winogrande: improving Mistral-7B score is now 73.56. Still not quite 78.4 but getting there. We are also getting a lower score on HellaSwag compared to HF leader board, so I'm not expecting we will get up to 78.4 anyway. It looks like it is better to skip the choice word(s) when evaluating the average log-likelihood. This kind of makes sense because a more common word (in Winogrande this is often a name) will have a higher probability without knowing about the follow up context, and this will skew the log-likelihood towards the more common word. We can only do this if the choice words are not last in the sentence. It also looks like it is better to skip the punctuation at the end of the sentence, provided the choice words are not last. * winogrande: add dataset instructions --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

* perplexity : faster HellaSwag ggml-ci * perplexity : clean-up ggml-ci * perplexity : no need for decode_helper ggml-ci * perplexity : add comments * perplexity : option to specify max batched tasks via `n_parallel` * perplexity : remove HellaSwag restruction for n_batch

ptsochantaris and others added 3 commits January 18, 2024 10:47

metal : fix memory leak, dangling pointer and unused autorel (#5007)

1e605f4

* Metal memory: Small memory leak on init, dangling pointer, and unused autorelease pool in graph compute * SPM header potential fix * Reverting symlinks

scritps : add helper script to get hellaswag data in txt format

dcad445

pull bot added the ⤵️ pull label Jan 18, 2024

teleprint-me closed this Jan 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggerganov:master #23

[pull] master from ggerganov:master #23

pull bot commented Jan 18, 2024 •

edited

Loading

[pull] master from ggerganov:master #23

[pull] master from ggerganov:master #23

Conversation

pull bot commented Jan 18, 2024 • edited Loading

pull bot commented Jan 18, 2024 •

edited

Loading