Compile kernels and fix build #17

njhill · 2024-04-11T17:10:26Z

These Dockerfile changes:

Update the release stage to work with the recently refactored requirements-common.txt / requirements-cuda.txt split
Fixup the kernel compilation in the build stage to correctly pick up cuda
Install the kernels from this docker build rather than pulling a precompiled wheel. We can swap that back once a new wheel is available with the correct pytorch version + updated interfaces

Signed-off-by: Nick Hill <nickhill@us.ibm.com>

Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>

njhill

commit 1c75dd5 Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Thu Apr 11 11:07:45 2024 -0700 ✨ add console scripts to setup.py Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 4d1686d Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Tue Apr 9 15:28:37 2024 -0700 🎨 format Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit c142e08 Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Tue Apr 9 15:26:15 2024 -0700 ✅ add test cli file Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 0cb571d Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Tue Apr 9 15:25:44 2024 -0700 🎨 fix typo Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 9ff66fe Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Tue Apr 9 11:45:01 2024 -0700 🎨 fmt Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 7f3d0b6 Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Tue Apr 9 11:13:31 2024 -0700 ✅ add test file Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 6691fd6 Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Fri Apr 5 11:57:52 2024 -0700 ✨ add tgis-cli tools Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 15076fa Author: Nick Hill <nickhill@us.ibm.com> Date: Fri Apr 12 00:50:25 2024 +0100 Compile kernels and fix build (#17) These Dockerfile changes: - Update the release stage to work with the recently refactored `requirements-common.txt` / `requirements-cuda.txt` split - Fixup the kernel compilation in the `build` stage to correctly pick up cuda - Install the kernels from this docker build rather than pulling a precompiled wheel. We can swap that back once a new wheel is available with the correct pytorch version + updated interfaces --------- Signed-off-by: Nick Hill <nickhill@us.ibm.com> Signed-off-by: Joe Runde <Joseph.Runde@ibm.com> Co-authored-by: Joe Runde <Joseph.Runde@ibm.com> Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

commit 82d2261 Author: Prashant Gupta <prashantgupta@us.ibm.com> Date: Wed Apr 17 15:44:35 2024 -0700 ♻️ update dockerfile.ubi with vllm wheel installation Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> commit 15076fa Author: Nick Hill <nickhill@us.ibm.com> Date: Fri Apr 12 00:50:25 2024 +0100 Compile kernels and fix build (#17) These Dockerfile changes: - Update the release stage to work with the recently refactored `requirements-common.txt` / `requirements-cuda.txt` split - Fixup the kernel compilation in the `build` stage to correctly pick up cuda - Install the kernels from this docker build rather than pulling a precompiled wheel. We can swap that back once a new wheel is available with the correct pytorch version + updated interfaces --------- Signed-off-by: Nick Hill <nickhill@us.ibm.com> Signed-off-by: Joe Runde <Joseph.Runde@ibm.com> Co-authored-by: Joe Runde <Joseph.Runde@ibm.com> Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

Initial support for AIU in vLLM. What is currently supported/tested: - Single AIU - Model: llama-7b-chat - Offline inference (batch size 1) - Online inference (with `max-num-seq=1`) --------- Signed-off-by: Nikolaos Papandreou <npo@zurich.ibm.com> Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by: Nikolaos Papandreou <npo@zurich.ibm.com> Co-authored-by: TRAVIS JOHNSON <tsjohnso@us.ibm.com>

njhill force-pushed the kernel-compat-hack branch 2 times, most recently from b2dc7b7 to 318276c Compare April 11, 2024 17:18

Temp hack to work with prebuilt 0.4.0-post kernels

ba27b4d

Signed-off-by: Nick Hill <nickhill@us.ibm.com>

njhill force-pushed the kernel-compat-hack branch from 318276c to ba27b4d Compare April 11, 2024 19:48

🐛 compile it ourselves

e6ec7db

Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>

joerunde changed the title ~~[DO NOT MERGE] Temp hack to work with prebuilt 0.4.0-post kernels~~ Compile kernels and fix build Apr 11, 2024

joerunde marked this pull request as ready for review April 11, 2024 23:41

njhill commented Apr 11, 2024

View reviewed changes

njhill merged commit 15076fa into main Apr 11, 2024
4 checks passed

njhill deleted the kernel-compat-hack branch April 11, 2024 23:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile kernels and fix build #17

Compile kernels and fix build #17

njhill commented Apr 11, 2024 •

edited by joerunde

Loading

njhill left a comment

Compile kernels and fix build #17

Compile kernels and fix build #17

Conversation

njhill commented Apr 11, 2024 • edited by joerunde Loading

njhill left a comment

Choose a reason for hiding this comment

njhill commented Apr 11, 2024 •

edited by joerunde

Loading