Improve Analyze Performance and Stability #41930
Labels
component/statistics
sig/planner
SIG: Planner
type/enhancement
The issue or PR belongs to an enhancement.
Enhancement
Currently, when we use the analyze command to collect statistics. There are several problems we have met, especially for large tables:
tidb_build_stats_concurrency
andtidb_distsql_scan_concurrency
) to speed up analyze. However, that may consume lots of cpu/mem/io for tikv(when scanning the table and sampling) and lots of cpu/mem for tidb(when merging samples and building stats).Here is the related issue in tikv repo:
tikv/tikv#14231
Tasks
Use faster murmur3 hash function for FMSketch calculation
Reduce encoding cost
Avoid FMSketch calculation for single-column index
Sample-based NDV calculation
The text was updated successfully, but these errors were encountered: