You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ideally we would optimize indexes with the "best_compression" codec. This uses less disk space (~53% of the space for apache log data, and should improve to ~43% in the future) but the tradeoff is that access to the _source and lucene stored fields are slower.
note, as the option currently works, space savings only apply to stored fields data (not postings lists or term dictionaries for indexed fields), but today this is still typically the largest portion of the index.
So overall, we can think of current space savings of something like ~20-25% on average. But for many use-cases this is still a good trade-off.
Alternatively, we expect many users will want to utilize a “hot/warm” architecture using the shard allocation filtering feature. In this scenario, time-based indexes on hot nodes can be configured to be created with the default LZ4 compression; when they’re migrated to the warm nodes (configured with index.codec: best_compression in elasticsearch.yml), the indexes can be compressed by optimizing that index. It may be preferable to pay the CPU penalty of compression during this optimize process (which we often recommend executing when the cluster is known to be less utilized) than at the time of initial indexing.
With this being the case, it seems that Curator's existing shard allocation functionality will address this need.
Ideally we would optimize indexes with the "best_compression" codec. This uses less disk space (~53% of the space for apache log data, and should improve to ~43% in the future) but the tradeoff is that access to the _source and lucene stored fields are slower.
Maybe this can somehow be an optional part of the optimize process? Optimizing in this way should not be noticeably longer (my guestimate: ~ 5%-10% on average) as the cost of merging the stored fields from lz4 to deflate takes only 19% longer after https://issues.apache.org/jira/browse/LUCENE-6115 and https://issues.apache.org/jira/browse/LUCENE-6141.
See elastic/elasticsearch#8863 for more information. This only applies to elasticsearch 2.0 or higher.
The text was updated successfully, but these errors were encountered: