Add flake app to run openai proxy #4612

ParetoOptimalDev · 2023-12-23T22:53:41Z

Works with:

nix run .#llama-server-openai-proxy

If merged, then you can run both the server and proxy via two commands:

nix run github:ggerganov/llama.cpp#llama-server
nix run github:ggerganov/llama.cpp#llama-server-openai-proxy

Note: This is likely not ideal or the best way to do this, but it works and can hopefully spark useful discussion to get this feature into the llama.cpp flake 😃

Works with: nix run .#llama-server-openai-proxy If merged, then you can run both the server and proxy via two commands: nix run github:ggerganov/llama.cpp#llama-server nix run github:ggerganov/llama.cpp#llama-server-openai-proxy

SomeoneSerge · 2023-12-24T01:23:43Z

flake.nix

+          program = "${
+            (let pythonWithPkgs = pkgs.python3.withPackages (ps: with ps; [ flask requests ]);
+             in pkgs.writeShellScriptBin "run-openai-proxy" ''
+                  ${pythonWithPkgs}/bin/python3 ${self}/examples/server/api_like_OAI.py


This could've been a makeWrapper. I don't know if api_like_OAI.py takes any command line arguments, but makeWrapper would have passed those on

SomeoneSerge · 2023-12-24T01:25:29Z

flake.nix

+            (let pythonWithPkgs = pkgs.python3.withPackages (ps: with ps; [ flask requests ]);
+             in pkgs.writeShellScriptBin "run-openai-proxy" ''
+                  ${pythonWithPkgs}/bin/python3 ${self}/examples/server/api_like_OAI.py
+             '')
+          }/bin/run-openai-proxy";


Can't suggest anything concrete right now, but it might be a good thing to do some form of buildPythonApplication because its probably less affected by PYTHONPATH contamination.

SomeoneSerge · 2023-12-24T01:28:02Z

flake.nix

@@ -110,6 +110,15 @@
          type = "app";
          program = "${self.packages.${system}.default}/bin/llama-server";
        };
+        apps.llama-server-openai-proxy = {


I was going to mention this in #4605, but we wouldn't have had to manually wrap these in apps if we instead exposed them (in the overlay and) in packages/legacyPackages, and if we ensured that meta.mainPrograms are set correctly (with writeShellScriptBin they are)

You could now add this to scope.nix:

llama.cpp/.devops/nix/scope.nix

Lines 7 to 12 in 4e1d251

lib.makeScope newScope (

self: {

inherit llamaVersion;

llama-cpp = self.callPackage ./package.nix { };

}

)

This way the attribute would show up as llamaPackages.llama-server-openai-proxy in nixpkgs with the flake's overlays.default applied, as well as in the flake's legacyPackages.${system}.llamaPackages.

ParetoOptimalDev · 2023-12-31T22:46:44Z

I think this might not be needed since the default llama-server started on port 8080 seems to be openai compatible.

ggerganov · 2024-01-02T10:41:07Z

I think this might not be needed since the default llama-server started on port 8080 seems to be openai compatible.

Yes, I'm thinking we should probably deprecate examples/server/api_like_OAI.py and improve the built-in server support for the OpenAI API (#4216)

Add flake app to run openai proxy

bc1b0d5

Works with: nix run .#llama-server-openai-proxy If merged, then you can run both the server and proxy via two commands: nix run github:ggerganov/llama.cpp#llama-server nix run github:ggerganov/llama.cpp#llama-server-openai-proxy

This was referenced Dec 23, 2023

Support llama.cpp karthink/gptel#121

Closed

flake.nix: rewrite #4605

Merged

SomeoneSerge reviewed Dec 24, 2023

View reviewed changes

SomeoneSerge added the nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment label Dec 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add flake app to run openai proxy #4612

Add flake app to run openai proxy #4612

ParetoOptimalDev commented Dec 23, 2023

SomeoneSerge Dec 24, 2023

SomeoneSerge Dec 24, 2023

SomeoneSerge Dec 24, 2023

SomeoneSerge Dec 31, 2023

ParetoOptimalDev commented Dec 31, 2023

ggerganov commented Jan 2, 2024

	lib.makeScope newScope (
	self: {
	inherit llamaVersion;
	llama-cpp = self.callPackage ./package.nix { };
	}
	)

Add flake app to run openai proxy #4612

Are you sure you want to change the base?

Add flake app to run openai proxy #4612

Conversation

ParetoOptimalDev commented Dec 23, 2023

SomeoneSerge Dec 24, 2023

Choose a reason for hiding this comment

SomeoneSerge Dec 24, 2023

Choose a reason for hiding this comment

SomeoneSerge Dec 24, 2023

Choose a reason for hiding this comment

SomeoneSerge Dec 31, 2023

Choose a reason for hiding this comment

ParetoOptimalDev commented Dec 31, 2023

ggerganov commented Jan 2, 2024