-
Notifications
You must be signed in to change notification settings - Fork 20.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High Read Disk I/O recently??? #21809
Comments
Is your RPC open to the outside? Can you provide some logs? I see you have --metrics on - can you share them? |
Hey @ligi , thanks for looking into this. Also one thing I've noticed, everytime I restart geth, for 2 minutes, I get an massive spike of disk i/o (write ~130MB/s and read 70MB/s). |
That's geth flusing a lot of in-memory data structures (cached trie nodes) to disk, that's to be expected.
That's.. not to be expected... :/ |
do you guys know if those numbers are expected? |
I'm investigating an archive node where the leveldb database for some reason has placed all files in level0, thus totally failing to do compaction. This causes massive IO overhead. I would be interested to find out if your node has a similar issue. I'll send a link to a PR which has an command for checking the db stats
|
Could you please give this one a test-run: #21887 ? Should be quick to run, and will tell us if the leveldb database is in a somwhat 'sane' state. |
@holiman thanks for looking into it. Here's some results:
EDIT: Unfortunately, as I'm running those commands, my disk i/o were back to normal. |
hey @holiman, just ran the command as my disk I/O are again on avg 20-30MB/s
|
@holiman we are experiencing this issue as well across all of our full nodes regardless of there being active client connections or not. We are regularly seeing sustained read throughput of around 100 MB/s: This is happening for client versions 1.9.21 and 1.9.24, both compiled with go1.15.5 running on linux. |
We're always looking into how to reduce the disk IO, but since this bug is fairly old, I don't see any reason to keep it open any longer. |
System information
Geth version: 1.9.23-stable
OS & Version: Arch Linux
Expected behaviour
I have been experiencing for the past month an average of under 1MB/s for read and write disk i/o
Actual behaviour
Recently (last 4 days or so), I have ben experiencing an average of 20MB/s for read disk i/o.
Doing
sudo iotop
clearly shows that geth is responsible for this.The weird thing is that I've tried restarting geth with different parameters (such as turning off --metrics --pprof), but the same thing happens, but 1 out of 15-20 times, the geth node is doing normal write and read i/o (under 1MB/s) and as long as I don't restart the process, it'll work fine.
The text was updated successfully, but these errors were encountered: