Convert cuda env tgi variables to lmi #2013

sindhuvahinis · 2024-06-03T20:02:54Z

Description

Translating some of the cuda env variables to LMI format.

Testing:
Bashed into a container and tested this bash script. bash /usr/local/bin/dockerd-entrypoint.sh serve

siddvenk · 2024-06-03T20:08:23Z

serving/docker/dockerd-entrypoint-with-cuda-compat.sh

@@ -47,6 +47,12 @@ translateTGIToLMI "SM_NUM_GPUS" "TENSOR_PARALLEL_DEGREE"
 translateTGIToLMI "MAX_CONCURRENT_REQUESTS" "SERVING_JOB_QUEUE_SIZE"
 translateTGIToLMI "MAX_BATCH_PREFILL_TOKENS" "OPTION_MAX_ROLLING_BATCH_PREFILL_TOKENS"
 translateTGIToLMI "MAX_BATCH_SIZE" "OPTION_MAX_ROLLING_BATCH_SIZE"
+translateTGIToLMI "ENABLE_CUDA_GRAPHS" "OPTION_ENFORCE_EAGER"


This one needs to be translated to the opposite value, right? If enable_cuda_graphs = true, enforce_eager = false

Thanks Sid, https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/llm.py#L70 You are right,

… (#2015)

sindhuvahinis requested review from zachgk, frankfliu and a team as code owners June 3, 2024 20:02

siddvenk reviewed Jun 3, 2024

View reviewed changes

sindhuvahinis force-pushed the env branch from 8f1342a to 8f99f3b Compare June 3, 2024 20:45

Convert cuda env tgi variables to lmi

760584b

sindhuvahinis force-pushed the env branch from 8f99f3b to 760584b Compare June 3, 2024 20:46

lanking520 approved these changes Jun 3, 2024

View reviewed changes

siddvenk approved these changes Jun 3, 2024

View reviewed changes

sindhuvahinis merged commit c6ff51b into deepjavalibrary:master Jun 3, 2024
7 checks passed

sindhuvahinis added a commit to sindhuvahinis/djl-serving that referenced this pull request Jun 4, 2024

Convert cuda env tgi variables to lmi (deepjavalibrary#2013)

8308d7c

sindhuvahinis added a commit that referenced this pull request Jun 4, 2024

[0.28.0-dlc][cherry-pick] Convert cuda env tgi variables to lmi (#2013)…

aefae3c

… (#2015)

sindhuvahinis deleted the env branch June 20, 2024 19:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert cuda env tgi variables to lmi #2013

Convert cuda env tgi variables to lmi #2013

sindhuvahinis commented Jun 3, 2024 •

edited

Loading

siddvenk Jun 3, 2024

sindhuvahinis Jun 3, 2024

sindhuvahinis Jun 3, 2024

Convert cuda env tgi variables to lmi #2013

Convert cuda env tgi variables to lmi #2013

Conversation

sindhuvahinis commented Jun 3, 2024 • edited Loading

Description

siddvenk Jun 3, 2024

Choose a reason for hiding this comment

sindhuvahinis Jun 3, 2024

Choose a reason for hiding this comment

sindhuvahinis Jun 3, 2024

Choose a reason for hiding this comment

sindhuvahinis commented Jun 3, 2024 •

edited

Loading