You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Metrics exported by EMF exporter were lost by 1-3 minutes during the long run, causing gaps in the dashboard. Followed by a false burst, it automatically recovered from the stuck.
Usually metrics were reported at a constant rate, but when the collector "stucked", it was unsteady.
After adding more debugging logs, we found it took 30 seconds to digest one chunk of metrics by average, e.g. metrics from one Pod in terms of using prometheus receiver.
Although metrics are grouped before the pushing happens, every logEventBatch is sent one by one, causing high network latencies, which drags down the performance.
How to solve it
Pushing EMF logs in batch.
The text was updated successfully, but these errors were encountered:
What happened
Metrics exported by EMF exporter were lost by 1-3 minutes during the long run, causing gaps in the dashboard. Followed by a false burst, it automatically recovered from the stuck.
Usually metrics were reported at a constant rate, but when the collector "stucked", it was unsteady.
After adding more debugging logs, we found it took 30 seconds to digest one chunk of metrics by average, e.g. metrics from one Pod in terms of using prometheus receiver.
It took 3 minutes to finish the same process at the time of metrics lost.
Root cause
Although metrics are grouped before the pushing happens, every logEventBatch is sent one by one, causing high network latencies, which drags down the performance.
How to solve it
Pushing EMF logs in batch.
The text was updated successfully, but these errors were encountered: