Improve encoding performance #81

yaauie · 2020-04-06T15:10:32Z

This change-set replaces character-by-character input sanitizers with equivalent gsub expressions, improving performance especially when many fields are referenced.

For example, when used with the TCP Output and simple field values, I was able to achieve the following maximum throughputs on a 16-worker pipeline:

fields	char-by-char	gsub inline	gsub constantized
1	45k	47k	49k
12	18k	28k	30k
100	3.5k	7.5k	8.0k

colinsurprenant

LGTM

lib/logstash/codecs/cef.rb

yaauie · 2020-04-07T19:34:03Z

@colinsurprenant can you take one more look? I refactored since your LGTM to get even better performance.

colinsurprenant · 2020-04-07T20:06:15Z

lib/logstash/codecs/cef.rb

-    value = value.to_s.gsub(/[^a-zA-Z0-9]/, "")
-    return value
+    value.to_s
+         .gsub(/[^a-zA-Z0-9]/, "")


cosmetic: newline might not be necessary here.

colinsurprenant · 2020-04-07T20:09:08Z

LGTM²

yaauie assigned elasticsearch-bot Apr 6, 2020

colinsurprenant unassigned elasticsearch-bot Apr 6, 2020

colinsurprenant self-requested a review April 6, 2020 16:51

colinsurprenant approved these changes Apr 6, 2020

View reviewed changes

kares reviewed Apr 6, 2020

View reviewed changes

lib/logstash/codecs/cef.rb Outdated Show resolved Hide resolved

lib/logstash/codecs/cef.rb Outdated Show resolved Hide resolved

yaauie added 2 commits April 6, 2020 22:27

perf: replace char-by-char sanitizer with targeted gsub

2bd2439

noop/style: skip useless rebinding of local variable

6e37f4c

yaauie force-pushed the encode-performance branch from 56f7d6a to 6e37f4c Compare April 6, 2020 22:28

colinsurprenant reviewed Apr 7, 2020

View reviewed changes

changelog & version bump for 6.1.1 release

9b74838

yaauie merged commit 2a8edcd into logstash-plugins:master Apr 9, 2020

yaauie deleted the encode-performance branch April 9, 2020 01:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve encoding performance #81

Improve encoding performance #81

yaauie commented Apr 6, 2020 •

edited

Loading

colinsurprenant left a comment

yaauie commented Apr 7, 2020

colinsurprenant Apr 7, 2020

colinsurprenant commented Apr 7, 2020

Improve encoding performance #81

Improve encoding performance #81

Conversation

yaauie commented Apr 6, 2020 • edited Loading

colinsurprenant left a comment

Choose a reason for hiding this comment

yaauie commented Apr 7, 2020

colinsurprenant Apr 7, 2020

Choose a reason for hiding this comment

colinsurprenant commented Apr 7, 2020

yaauie commented Apr 6, 2020 •

edited

Loading