feat: expose openai api endpoints from vllm #112

quitrk · 2024-10-24T10:33:18Z

No description provided.

skynet/modules/ttt/openai_api/app.py

skynet/env.py

saghul · 2024-10-24T13:37:10Z

skynet/env.py

-vllm_server_path = os.environ.get('VLLM_SERVER_PATH', 'vllm.entrypoints.openai.api_server')
-openai_api_server_port = int(os.environ.get('OPENAI_API_SERVER_PORT', 8003))
-openai_api_base_url = os.environ.get('OPENAI_API_BASE_URL', f'http://localhost:{openai_api_server_port}')
+openai_api_server_port = int(os.environ.get('OPENAI_API_SERVER_PORT', app_port if use_vllm else 8003))


Shall we get rid of this while we're here, and use the URL, defaulting to ollama's default port when not doing vllm?

saghul · 2024-10-24T14:02:46Z

skynet/main.py

    sys.exit(1)

 log.info(f'Enabled modules: {modules}')

+if device == 'cuda' or is_mac:
+    log.info('Using GPU')


I like how you solved this!

saghul

LGTM! We can get rid off the llama-cpp-server thing in a separate PR...

* master: (61 commits) deps: use pypi provided silero vad, upgrade to latest fix: remove public key validation (jitsi#123) fix: downgrade vllm (jitsi#122) feat: add fallback folder when looking up public keys (jitsi#119) fix: add ffmpeg dependency for pytorch ref: bypass queueing jobs with invalid payload (jitsi#121) fix: replace examplar usage with label for app_id feat: add instrumentation for app_id (jitsi#118) fix: re-enable vLLM multiprocessing (jitsi#116) fix: update incorrect prompt example fix: healthchecks failing due to missing internal id (jitsi#115) feat(openai-api) use Ollama for local development feat: expose openai api endpoints from vllm (jitsi#112) feat: update text hint type prompting (jitsi#111) feat: add meeting hint type and use it as default (jitsi#110) feat: enable requests batching (jitsi#109) metrics: add full duration metric metrics: add a skipped job status which will not count towards duration metrics fix: catch exceptions when echoing fails feat: add support for echoing requests (jitsi#107) ... # Conflicts: # Dockerfile # Makefile # requirements.txt

feat: expose openai api endpoints from vllm

767eb79

saghul reviewed Oct 24, 2024

View reviewed changes

skynet/modules/ttt/openai_api/app.py Outdated Show resolved Hide resolved

skynet/modules/ttt/openai_api/app.py Outdated Show resolved Hide resolved

quitrk force-pushed the tavram/openai branch 5 times, most recently from 80a3a80 to 4a4396b Compare October 24, 2024 13:25

saghul reviewed Oct 24, 2024

View reviewed changes

code review changes

a08eb41

quitrk force-pushed the tavram/openai branch from 4a4396b to a08eb41 Compare October 24, 2024 13:46

saghul reviewed Oct 24, 2024

View reviewed changes

saghul approved these changes Oct 24, 2024

View reviewed changes

Avram Tudor added 2 commits October 24, 2024 14:08

fix health check

d54cf5e

update vllm

15ffb20

quitrk merged commit 22237df into master Oct 25, 2024

quitrk deleted the tavram/openai branch October 25, 2024 05:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expose openai api endpoints from vllm #112

feat: expose openai api endpoints from vllm #112

quitrk commented Oct 24, 2024

saghul Oct 24, 2024

saghul Oct 24, 2024

saghul left a comment

feat: expose openai api endpoints from vllm #112

feat: expose openai api endpoints from vllm #112

Conversation

quitrk commented Oct 24, 2024

saghul Oct 24, 2024

Choose a reason for hiding this comment

saghul Oct 24, 2024

Choose a reason for hiding this comment

saghul left a comment

Choose a reason for hiding this comment