-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[new feature] introduce Coprocessor #8559
Comments
Nice. I think this calls for a LID though. Would you mind creating a pull request and add the information from this issue in the template? |
Ok, I'll try to write a LID. |
LID: #8616 |
FYI, VictoriaLogs uses bloom filters for speeding up queries over high-cardinality phrases inside log messages such as |
For reference, here is another issue discussing high-cardinality labels: #91 |
Is your feature request related to a problem? Please describe.
ref issue : High cardinality labels
{log_type="service_metrics"} |= "ee74f4ee-3059-4473-8ba6-94d8bfe03272"
We have counted the source distribution of our logql, and 85% of the grafana log explore queries are traceID queries.
Generally, the log time range of traceID is about 10 minutes(trace time= start~end), but because users do not know traceID start time and end time , they usually search for 7 day log. In fact having a time range of "7d-10m" is an invalid search.
So we hope to introduce some auxiliary abilities to solve this "7d-10m" invalid search.
We have checked that in the database field, such feature have been implemented very maturely.
And our team tried to implement the
preQuery Coprocessor
, and achieved great success. Through this feature, we solved the problem of "loki + traceID search is very slow".Thanks Google’s BigTable coprocessor and HBase coprocessor.
HBase coprocessor_introduction link: https://blogs.apache.org/hbase/entry/coprocessor_introduction
The idea of HBase Coprocessors was inspired by Google’s BigTable coprocessors. Jeff Dean gave a talk at LADIS ’09 (http://www.scribd.com/doc/21631448/Dean-Keynote-Ladis2009, page 66-67)
Describe the solution you'd like
Describe alternatives you've considered
The problem of slow traceID search In the past six months, we tried to introduce kv system / reverse index text system/ bloomfilter to speed up logql return, but the machine cost was too high and finally gave up.
Additional context
data:image/s3,"s3://crabby-images/34093/34093ae67ba66adf4e677017149a0c485fb1de33" alt="image"
data:image/s3,"s3://crabby-images/d481f/d481ffbc9526415d9a28d75865742ec167a505ff" alt="image"
"traceID Coprocessor 1 simple text analysis" makes loki return traceID search results in 0.9s.
The text was updated successfully, but these errors were encountered: