Skip to content

Commit

Permalink
Apply temp. patch to Triton code to resolve conflicting cache dirs in…
Browse files Browse the repository at this point in the history
… TP case.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
  • Loading branch information
tdoublep committed May 28, 2024
1 parent 066041a commit e679a9d
Show file tree
Hide file tree
Showing 2 changed files with 18 additions and 0 deletions.
10 changes: 10 additions & 0 deletions Dockerfile.ubi
Original file line number Diff line number Diff line change
Expand Up @@ -264,6 +264,14 @@ RUN --mount=type=cache,target=/root/.cache/pip \
RUN microdnf install -y gcc \
&& microdnf clean all

# patch triton (fix for #720)
COPY triton_patch/cache_fix.patch .
RUN microdnf install -y patch \
&& patch /opt/vllm/lib/python3.11/site-packages/triton/runtime/cache.py cache_fix.patch \
&& microdnf remove -y patch \
&& microdnf clean all \
&& rm cache_fix.patch

ENV HF_HUB_OFFLINE=1 \
PORT=8000 \
GRPC_PORT=8033 \
Expand All @@ -281,5 +289,7 @@ RUN microdnf install -y shadow-utils \

COPY LICENSE /licenses/vllm.md

RUN microdnf install -y nano diffutils patch

USER 2000
ENTRYPOINT ["python3", "-m", "vllm.entrypoints.openai.api_server"]
8 changes: 8 additions & 0 deletions triton_patch/cache_fix.patch
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
4c4
< import random
---
> import uuid
117c117
< rnd_id = random.randint(0, 1000000)
---
> rnd_id = str(uuid.uuid4())

0 comments on commit e679a9d

Please sign in to comment.