-
-
Notifications
You must be signed in to change notification settings - Fork 346
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bugfix for heartbeat memory leak #236
Bugfix for heartbeat memory leak #236
Conversation
Resolver code was causing memory leak during ping-pong
Pull Request Test Coverage Report for Build 445
💛 - Coveralls |
Thanks again for tracking this down @tsightler. I see now the precise source of the leak, Line 396 in c72cda1
We're never calling our |
Makes sense, I probably should have seen that. I'll make that change in my local tree and let it run and report back. |
Took a quick stab at this, but honestly, the resolver code seems totally broken to me. It immediately failed with errors that _resolver wasn't a function. I added the following console.log outputs to the _packetHandler() to see what was going on:
I then fired up the script and started my init script, which adds a total of 4 Tuya devices, at 2 second intervals. This was what was logged on the console:
And the number of entries in resolvers just keeps increasing. Sometimes I see one and then it is deleted, but mostly they just keep incrementing. This seems completely broken and adding a delete doesn't seem likely to work. |
Just a little more digging, it seems obvious to me now that _sendPing increments and adds a sequence number to the packet and _send adds a resolver with the current sequence number. However, in the received pong, the sequence number is always zero, and I can't see any way to associate that response with an existing sequence number to know which resolver to resolve and delete. To me, bypassing the resolver for heartbeat, as this patch does, is simplest solution that doesn't seem to have any chance to break anything. It doesn't fix the resolver code, which still appears to be totally broken (issuing commands constantly increases the number of entries in _resolver), so there will still be some leak, but commands are generally issued far less frequently that every 10 seconds per device and the leak is so small it would likely take months for most users to see this. I see two options for a complete fix:
In the interim, I've worked around this issue in my own code by disconnecting and reconnecting all devices every hour. Crude but effective. |
OK @tsightler, good points. In light of this I agree, we should do the quick fix, and then we'll work on removing or rewriting resolvers. |
Just to clear a few points up:
The above is a feature, not a necessity. I'll merge this in for now (CI is failing but it looks like an issue with Travis, not the code), though if we continue to have similar problems I'm perfectly happy to completely strip out the resolver code and revert Moving forward, this should be completely fixed in v6 if we're storing the device state internally as @kueblc has suggested. |
Also, please open PRs against the development branch in the future @tsightler. I think there's a note about that in CONTRIBUTING.md. |
Published in v5.1.2. |
Great job guys! I updated my adapter too and will receive user feedback soon |
This small patch effectively bypasses the resolver code for heartbeat packets and appears to correct the significant memory leak seen, especially when there are a lot of devices.
First contribution to this project so apologize if it's wrong.