-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Default Job Idle Timeout not working #11484
Comments
Any help? |
@jakemcdermott @oweel have you guys seen my comment on PR #10906 ? |
I have the same problem but in my case job is killed exactly after 60 min if idle. I have some long running jobs and this cause problems as it does not finish properly. |
If you look at |
@shanemcd |
@shanemcd Were you able to take a look at this? |
@shanemcd Any updated on this would be highly appreciated. |
@shanemcd how can I help on fixing this issue? |
@shanemcd please can you revisit this? |
I'm facing the same issue in AWX 19.5.1. |
FYI, I am seeing this also in 19.3.0. Timeout is not working at all. Let me add more detail. I have the system wide timeout set to 300 (5 minutes), and that work. But setting the timeout in the template does not work. The template timeout is completely ignored, no matter what. If I changed the system wide timeout to 0, the default, the timeout setting in the templates are still ignored. |
@shanemcd Do you have any insight on why this issue is not fixed? I updated to 0.21.0 and still getting the same issue even with the latest EE images |
I have not been able to produce this by running a playbook that does |
@AlanCoding what are you running awx on? Because the bug I experience is the kubernetes pod running the job gets terminated after 5 minutes of idle time. |
This issue is "Default Job Idle Timeout not working", getting terminated by OCP after 5 minutes of idleness is kind of a different thing. I put up #12289 to adjust the help text, and I see that mixed messages might be confusing the issue. I kicked off a job to see if I can reproduce. |
Right now I don't seem to be able to reproduce this, as container group jobs can sleep for 10 minutes or so without problem. Nothing in the pod spec jumps out to me as related either. |
Have the same problem but it is timing out after 25 minutes |
Please confirm the following
Summary
Default Job Idle Timeout setting on AWX job settings is not working properly.
PR #10906 does not seem to work properly when value is set above 5-6 minutes.
AWX version
AWX 19.5.0
Installation method
kubernetes
Modifications
no
Ansible version
No response
Operating system
No response
Web browser
Firefox, Chrome
Steps to reproduce
@oweel it looks like idle_timeout might not be working too well.
Even though idle_timeout is set to 0 its still timing out in 5-6 minutes.
minutes: 20
If idle_timeout is set to something like 10 seconds then it works.
And if idle_timeout is set to like 9999 it will still timeout in 5-6 minutes:
Expected results
No timeout.
Actual results
Timeouts after 5-6 minutes.
Additional information
No response
The text was updated successfully, but these errors were encountered: