Handle the model unload requests in model adapter controller #42
Labels
area/lora
kind/feature
Categorizes issue or PR as related to a new feature.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
Milestone
🚀 Feature Description and Motivation
After the model adapter is deleted, we should unload the model adapter from the pod. Otherwise, it will wait for the GC to evict the Lora adapter. We definitely prefer eager way over lazy way here.
We need to response the deletion event and send request to the corresponding pod. There's no way to update the model adapter then.
BTW, we could also leverage finalizer to make it.
.metadata.deletionTimestamp
will be marked first and the finalizer can helpunload
the model.Use Case
No response
Proposed Solution
No response
Alternatives Considered
No response
Additional Context
No response
The text was updated successfully, but these errors were encountered: