Feature request - multi-threaded indexing and search #227

denis-bogdanas · 2018-06-19T12:20:56Z

Please consider implementing indexing/search on multiple threads, to take advantage of SSD speed. This will greatly speedup operations on 10GB+ files.

My benchmarks. I have a Samsung 850 EVO SSD and both indexing and search are CPU-limited on a single thread. SSD read speed on a large file is ~340 MB/s for indexing and 210 MB/s for search of a short word. Benchmarked sequential read speed of my SSD is 450 MB/s. And there are NVMe SSDs on the market that have read speed on the order of 2-3 GB/s.

EmEditor btw is faster on both operations but still single-threaded CPU limited.

variar · 2018-06-19T14:25:59Z

@denis-bogdanas could you benchmark klogg in same conditions? In this fork I've added parallel search and tuned indexing code a bit. Precompiled builds are available on bintray.

Also "matches overview" is updated on UI thread, turning it off should affect benchmark results.

denis-bogdanas · 2018-06-19T14:28:59Z

@variar What version? Is klogg-17.12.0.245-setup.exe ok? I would rater not mess up with compiling from sources.

variar · 2018-06-19T15:35:39Z

Either klogg-17.12.0.245-setup.exe or klogg-17.12.0.245-portable.zip. Portable version does not require any installation, just unpack the zip archive.

These precompiled binaries are only for 64 bit systems.

denis-bogdanas · 2018-06-19T17:25:18Z

Results for klogg:
File loading: 320-325 MB/s,
klogg CPU usage: 24-25% . E.g. one full core.
My PC: quad-core i5 2500K@4.4 GHz, single channel 16 GB DDR3 1600 RAM CAS 11.

Searching word "aziza": 264-286 MB/s, average ~270.
klogg CPU usage: 20%. Less than one core.

Summary: 9% slower indexing, 30% faster searching than glogg, but still no signs of parallelism.

variar · 2018-06-19T22:22:50Z

Thanks for the feedback. Several things come to mind:

QFile::open api does not allow to pass flags for sequential access, so OS can't optimize caching and prefetching. I think some platform specific functions for opening and reading files should be used. This affects both glogg and klogg.
For klogg default parameters for indexing and parallel search may be bad. During index generation file reading is done with 5MiB buffer, migth be too small on ssd. When searching klogg reads 5000 lines at a time and splits them between all available cores for regex matching. This might be too small for 2500K quad-core CPU and ssd. Defaults keep memory usage low. These parameters can be changed in Tools->Options->Advanced tab.

I'll try to hack some benchmarking code and make PR at least for qfile replacement.

denis-bogdanas · 2018-06-20T03:38:27Z

I'm using process explorer to get the numbers btw.

Data for EmEditor: Indexing ~425 MB/s, search 309 MB/s, one core used.

denis-bogdanas · 2018-06-20T04:02:18Z

I tried 50K lines buffer for search. It's even a bit slower.

From your description looks like reading and processing is done sequentially: you first read some buffer then process it. While processing is done SSD is not used. The way I guess I'd do it is to have one thread that reads buffers into a synchronized queue and several worker threads that get buffers from the queue and process it. And another thread that combines the results.

variar · 2018-06-21T12:17:55Z

I've done some research. Adding posix_fadvise(...,POSIX_FADV_SEQUENTIAL) make file reading actually slower on my pc. I'll test FILE_FLAG_SEQUENTIAL_SCAN on Windows later.

Adding separate thread to read data from file and pass it on for indexing improves file loading time. You can try new portable build. Use something like 8-16MiB for file loading buffer in settings (this is now for "readahead" buffer, not the size of chunk being read from disk at a time).

Index is still built in one thread, but I'll try to make it parallel.

denis-bogdanas · 2018-06-21T12:24:29Z

Thanks! I'm actually using EmEditor at least until trial expires. But superior performance is something that might attract many other users. Hell, it might be a real-life benchmarking tool for SSD performance.

variar · 2019-01-22T11:42:04Z

FYI, switching from naive loop to std::memchr when searching for line breaks and tabs makes initial file load IO-bound on my 850 EVO. Changes are needed in one loop, I'll try to cherry-pick them to glogg without all my multi-threaded reading/processing experiments.

variar mentioned this issue May 9, 2020

support larger memory #277

Open

danberindei pushed a commit to danberindei/glogg that referenced this issue Apr 21, 2021

Replace null chars with spaces for clipboard (nickbnf#227)

63474a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request - multi-threaded indexing and search #227

Feature request - multi-threaded indexing and search #227

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 20, 2018

denis-bogdanas commented Jun 20, 2018

variar commented Jun 21, 2018

denis-bogdanas commented Jun 21, 2018

variar commented Jan 22, 2019

Feature request - multi-threaded indexing and search #227

Feature request - multi-threaded indexing and search #227

Comments

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 19, 2018

variar commented Jun 19, 2018

denis-bogdanas commented Jun 20, 2018

denis-bogdanas commented Jun 20, 2018

variar commented Jun 21, 2018

denis-bogdanas commented Jun 21, 2018

variar commented Jan 22, 2019