-
Notifications
You must be signed in to change notification settings - Fork 10.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
aarch64 CUDA build: ggml.h(309): error: identifier "half" is undefined #4922
Comments
Does it work if you replace |
@slaren thanks but no:
|
Hello there
what's the output? I will test nix on my jetson orin nano later. |
I had the same error and add the cuda_fp16.h in ggml.h, it works
|
same error using
@planform add this could help, but we dont know what change cause this compile error
|
I think problem is there this patch add so the solution is move the patch to fix this problem is below
|
After adding the patch above and also the other patch for the ggml.h file in multiple locations since this file is used in different versions during the build I get the following error in Jetson Xavier AGX with CUDA 11:
My protoc version is libprotoc 3.6.1 and the build command i used is
So it seems the fixes are working but in the end the protoc version is an issue for the compiler. |
@KyL0N patch works also on Jetson AGX Orin 👍 |
Did you also build the grpc server or where you using an external one? |
Sorry for the off-topic. The best option for you is to use https://github.com/dusty-nv/jetson-containers @ark626 did you managed to instal LocalAI? Do you have a Dockerfile? |
No sadly not. I try to install this since 3 days. I almost had a build finished v 1.25.0 but due to some repository force pushing the older builds dont seem to work anymore. Currently i try to build it directly on a fresh install of Jetson AGX Xavier but since my knowledge of c builds is very limited its hard to figure out whats wrong exactly. I will have a look into the jetson containers but i try to install the localai.io complete as i try to play around it in the Home Assistant as an own ChatGPT variant. But first i need to get at least any build to run. |
@ark626 in https://github.com/dusty-nv/jetson-containers you have https://github.com/oobabooga/text-generation-webui working out of the box on jetson - this webui has openai compatible API which you can use in HA extension. I'm also working on my own extension ;) I strongly recommend to use docker for those experiments. It's easier to manage dependencies & configurations |
Thank you very nice. Will have a look at it. |
@planform's patch is sufficient and seems to be minimal ( @planform @KyL0N either of you would like to open a PR or should I?
The more general answer is "use the correct protobuf version from any source that ships it" (e.g. another distribution, conda, a prebuilt multi-gigabyte docker image, or 🙃 Nixpkgs). I'll stop here and abstain from being a shill |
@KyL0N 's solution woks for my jetson nano:) |
@SomeoneSerge hello, pr #4975 is already merged |
Thanks @KyL0N! One can see a passing pipeline e.g. in https://github.com/ggerganov/llama.cpp/actions/runs/7611010381/job/20725572309 |
Steps To Reproduce
Steps to reproduce the behavior:
nix build .#packages.aarch64-linux.jetson-xavier
)Build log
CI: https://github.com/ggerganov/llama.cpp/actions/runs/7514510149/job/20457461738#step:8:1499
Cleaner logs: https://gist.github.com/SomeoneSerge/33008b08b7bd887e994b7e52cd432af0
Additional context
Previous work and related issues
#1455 had faced a related issue and introduced (#2670) the
typedef
at the line 309.The failure is at most three weeks old, a successful xavier build confirmed e.g. in #4605 (comment)
I'll run a bisect if/when I get access to an aarch64 builder
The text was updated successfully, but these errors were encountered: