[sonic-snmpagent] AgentX TCP Connection is being terminated when blocking=True arg is set #10310
Labels
Request for 202111 Branch
For PRs being requested for 202111 branch
Triaged
this issue has been triaged
Description
When blocking=True is used and the data is not available in Redis, the corresponding data-fetching coroutines are eating up time and not giving enough time for the
coroutine which maintains the TCP connection to AgentX Socket and thus the connection is getting terminated and eventually causing the failure of SNMP queries.
This SNMP query failure is also reported here:
#9996
Triage:
It clearly took 4 mins for the connection_routine to finish TCP handshake, and so the same behavior is expected when the Transport coroutine has to handle and respond to any incoming data. https://github.com/Azure/sonic-snmpagent/blob/master/src/ax_interface/socket_io.py#L149
I've verified this behavior by removing the Updater Instances which are throwing the following exceptions,
and the snmp queries started to work.
Solution:
This PR sonic-net/sonic-snmpagent#246 fixes the issue temporarily but as a long term solution all the blocking=True arguments in the subagent repo should be avoided.
sonic_dump_qa-eth-vt05-1-2410_20220318_131013 (1).tar.gz
The text was updated successfully, but these errors were encountered: