You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had a strange issue on a server: during a resilvering, zfs snapshot started to get stuck, never finishing. It took some time for us to see that, at the end there was 73 blocked zfs snapshots.
It didn't stop the availability of the datasets until the very end, where all processes accessing any dataset on the zpool were finally now blocked (apache httpd, postgresql, etc). I tried to stop them but it was impossible to kill them, I also tried to kill the zfs snapshots but no luck either.
I found what I think is the origin of the issue several days ago in the kernel log: http://apaste.info/ue5
I can't say for certain since the back traces are incomplete but I believe cherry-picking the folloiwng patch will resolve the issue. It should be in the next point release 0.6.5.5.
zfs snapshot stuck again on the same machine, the snapshot is done every hour and the one that is stuck is from 16:00. I did not apply any patch (I may do so now... I don't want my snapshots to not work every ~5 days).
Hello,
I had a strange issue on a server: during a resilvering, zfs snapshot started to get stuck, never finishing. It took some time for us to see that, at the end there was 73 blocked zfs snapshots.
It didn't stop the availability of the datasets until the very end, where all processes accessing any dataset on the zpool were finally now blocked (apache httpd, postgresql, etc). I tried to stop them but it was impossible to kill them, I also tried to kill the zfs snapshots but no luck either.
I found what I think is the origin of the issue several days ago in the kernel log: http://apaste.info/ue5
Result of useful commands:
zpool status: http://apaste.info/1yL
zpool get all: http://apaste.info/PGG
list of ZFS Debian package installed (the server is on Wheezy): http://apaste.info/OFN
Stack of a blocked zfs snapshot process: http://apaste.info/JoO
I had to hard reboot the server to get it back (impossible to shutdown it otherwise).
Not sure what more information I can give.
The text was updated successfully, but these errors were encountered: