-
Notifications
You must be signed in to change notification settings - Fork 121
Can not terminate running tasks #308
Comments
A few questions:
Also, please see the |
Hi @alfpark, thanks for your quick reply. The pool specification is below, which I think is not in
Also I tried with If I understand correctly |
It's possible that the termination signal is not being properly propagated within the running container. Do you have a provisioned SSH user? If not, then Shipyard cannot kill these containers properly in non- It may be cleaner to use a For redeploying on a new image - I assume you accidentally left out the The DevOps task is intriguing... please open a new issue for that so we can track as a proper feature request. |
You are right that we use I can open a new support feature request for DevOps after gathering some requirements. Thanks for your hint on |
Yes, When you issue the |
I can only see this
|
Thanks for confirming - this is most likely a regression. I have a fix that should be landing shortly in Note that if you're moving from |
- SSH side-channel docker kill signal was not being sent as Docker tasks were not being detected properly - Also fix issue with pool images update not executing if block on images is false - Resolves #308
- SSH side-channel docker kill signal was not being sent as Docker tasks were not being detected properly - Also fix issue with pool images update not executing if block on images is false - Resolves #308
@alfpark Can you please confirm the new version of docker |
Did you redeploy your pool and jobs using the |
Sorry I should have elaborated better. I used |
Sorry, I'm still a bit unclear here. I understand you re-deployed your test pool with
If yes to both, can you please elaborate about your job (or post a redacted jobs.yaml) and also a screenshot from the portal UI or Batch explorer of your task command line and environment variables (redact as necessary)? |
My answer is yes for both. Below is the jobs.yaml file
I attach here some screenshot images for our sample job and task
|
@hieuhc Thanks for the detailed report. You may want to consider rotating your ACR credentials as part of the credential leaked in the screenshot above (I edited out the screenshot). It looks like there was another defect with infinite retry tasks and termination. Please try the new |
Hi. I confirm the newest fix has resolved the issue. Much appreciated for helping. |
Problem Description
I can not terminate running tasks in a job with
jobs tasks term
. The command hangs for long time if I use--wait
. When I terminate the task using Azure Portal, it is marked withCompleted
state, but when I log in to the node I can see the container is still running.Batch Shipyard Version
3.7.1
Steps to Reproduce
jobs tasks term
.Expected Results
The task is terminated with
Completed
state. There is no Docker container running when logging into the node.Actual Results
Can not terminate using the command.
Additonal Comments
I wonder what should be expected when I specify
max_task_retries
being -1. To be able to terminate this kind of task, I had to manually terminate in Azure Portal, then log in to the node anddocker rm -f
The text was updated successfully, but these errors were encountered: