gguf-py: Support identity operation in TensorNameMap #3095

KerfuffleV2 · 2023-09-09T10:05:34Z

edit: Just to make it a bit more clear what this is trying to do: TensorNameMap is used to map the assorted naming conventions for types of tensors in various models to GGUF convention. A HuggingFace LLaMA model might call the attention norm tensor model.layers.1.input_layernorm, the .pth version might call it something different and so on. In GGUF it's called blk.1.attn_norm.

However, currently TensorNameMap only maps the non-GGUF names to the GGUF name. If you already have the GGUF name and try to map, it'll fail. This pull just adds an entry for the GGUF-style name to the list so trying to map a name that's already correct is a no-op.

Before:

>>> nm = gguf.TensorNameMap(gguf.MODEL_ARCH.LLAMA, 2)
>>> nm['transformer.wte']
'token_embd'
>>> nm[nm['transformer.wte']]
Traceback (most recent call last):
  File "/blah/llama.cpp/gguf-py/gguf/gguf.py", line 351, in __getitem__
    return self.mapping[key][1]
           ~~~~~~~~~~~~^^^^^
KeyError: 'token_embd'

After:

>>> nm = gguf.TensorNameMap(gguf.MODEL_ARCH.LLAMA, 2)
>>> nm['transformer.wte']
'token_embd'
>>> nm[nm[nm[nm['transformer.wte']]]]
'token_embd'
>>> nm[nm[nm[nm['transformer.h.1.ln_1']]]]
'blk.1.attn_norm'

This also fixes an issue where you had to specify try_suffixes to TensorNameMap.get_name and friends. It just sets the default value for the keyword param to an empty sequence (I meant to do this originally but apparently I messed it up).

Make try_suffixes keyword param optional.

cebtenzzre · 2023-09-09T20:51:41Z

If I understand #3093 (comment) correctly, we don't actually need this.

KerfuffleV2 · 2023-09-09T21:33:29Z

If I understand #3093 (comment) correctly, we don't actually need this.

It's not specifically for that, that pull is just something that made me aware of the current less than ideal behavior and to prevent other people from having the same issue in the future. As far as I understand it, the name mapping stuff is supposed to just let translate the tensor names in a model to the GGUF convention without actually caring about the details. The fact that if it's already correct you can't is unintuitive.

The pull also fixes an issue where you had to specify the suffixes to try no matter what which is weird and not what I intended. I just stupidly forgot to specify the default value for the keyword arguments.

Not saying it's of earthshaking importance or anything, it just makes the interface more ergonomic and intuitive to use. I think it makes sense to do that kind of thing when it's simple/easy and doesn't break any existing API usage.

cebtenzzre · 2023-09-09T21:57:48Z

Is there a clear use case that would take advantage of the identity mapping? I thought the conversion to GGUF was only done once when it's working correctly.

KerfuffleV2 · 2023-09-09T22:22:13Z

Is there a clear use case that would take advantage of the identity mapping?

By use case do you mean something that can't be done without that behavior? If so, no. But one could say the same thing about a lot of things. There isn't necessarily a clear use case for specifying defaults for keyword arguments either.

These are just things that make working with the code easier and more ergonomic, make it less likely for people to run into issues where the behavior is unintuitive. It can also enable writing more general code instead of having to special case stuff.

I thought the conversion to GGUF was only done once when it's working correctly.

This change is in the GGUF Python package, not a conversion script. It's a public API in a public package so people can use it for whatever they want, not necessarily just conversion. Presumably the GGUF Python package will also support reading GGUF files eventually.

People could also find ways for the package to solve problems even without caring about GGUF files specifically. Just as example, the tensor mapping stuff could be useful to anyone that wants to deal with different flavors of models without having to figure out all the tensor types and write their own mapping stuff.

cebtenzzre · 2023-09-09T22:38:20Z

Okay.

Make try_suffixes keyword param optional.

gguf-py: Support identity operation in TensorNameMap

d5a3c4a

Make try_suffixes keyword param optional.

KerfuffleV2 mentioned this pull request Sep 9, 2023

Adding SqueezeLLM Support #3093

Open

KerfuffleV2 added the script Script related label Sep 9, 2023

ggerganov approved these changes Sep 14, 2023

View reviewed changes

ggerganov merged commit e394084 into ggerganov:master Sep 14, 2023

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

gguf-py : support identity operation in TensorNameMap (ggerganov#3095)

4ae1f82

Make try_suffixes keyword param optional.

KerfuffleV2 mentioned this pull request Sep 29, 2023

Various script cleanups/fixes + convert merges and special token handling #2842

Merged

KerfuffleV2 deleted the fix-gguf-name-map-identity branch November 17, 2023 03:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf-py: Support identity operation in TensorNameMap #3095

gguf-py: Support identity operation in TensorNameMap #3095

KerfuffleV2 commented Sep 9, 2023 •

edited

Loading

cebtenzzre commented Sep 9, 2023

KerfuffleV2 commented Sep 9, 2023

cebtenzzre commented Sep 9, 2023

KerfuffleV2 commented Sep 9, 2023

cebtenzzre commented Sep 9, 2023

gguf-py: Support identity operation in TensorNameMap #3095

gguf-py: Support identity operation in TensorNameMap #3095

Conversation

KerfuffleV2 commented Sep 9, 2023 • edited Loading

cebtenzzre commented Sep 9, 2023

KerfuffleV2 commented Sep 9, 2023

cebtenzzre commented Sep 9, 2023

KerfuffleV2 commented Sep 9, 2023

cebtenzzre commented Sep 9, 2023

KerfuffleV2 commented Sep 9, 2023 •

edited

Loading