pkg/trace/agent: improve NormalizeTag #2951

gbbr · 2019-01-24T15:41:47Z

Improves performance of NormalizeTag

name                      old time/op    new time/op    delta
NormalizeTag/ok-4            129ns ± 0%      69ns ± 0%  -46.74%
NormalizeTag/trim-4          167ns ± 0%      92ns ± 0%  -44.97%
NormalizeTag/trim-both-4     197ns ± 0%     160ns ± 0%  -18.78%
NormalizeTag/plenty-4        193ns ± 0%     147ns ± 0%  -23.83%
NormalizeTag/more-4          261ns ± 0%     264ns ± 0%   +1.15%

name                      old alloc/op   new alloc/op   delta
NormalizeTag/ok-4             136B ± 0%       24B ± 0%  -82.35%
NormalizeTag/trim-4           160B ± 0%       32B ± 0%  -80.00%
NormalizeTag/trim-both-4      176B ± 0%       64B ± 0%  -63.64%
NormalizeTag/plenty-4         160B ± 0%       64B ± 0%  -60.00%
NormalizeTag/more-4           176B ± 0%      128B ± 0%  -27.27%

name                      old allocs/op  new allocs/op  delta
NormalizeTag/ok-4             3.00 ± 0%      2.00 ± 0%  -33.33%
NormalizeTag/trim-4           3.00 ± 0%      2.00 ± 0%  -33.33%
NormalizeTag/trim-both-4      3.00 ± 0%      3.00 ± 0%    0.00%
NormalizeTag/plenty-4         3.00 ± 0%      3.00 ± 0%    0.00%
NormalizeTag/more-4           3.00 ± 0%      4.00 ± 0%  +33.33%

Instead of creating a buffer, create a series of tuples representing cuts that need to happen inside the tag string, and apply them later. If no changes need to happen, the same string is returned without any extra allocations.

As can be seen in the benchmarks, the algorithm becomes slower the more cuts need to be made. For common scenarios with no changes or few changes, the improvement is around 400% for memory.

codecov-io · 2019-01-24T16:04:05Z

Codecov Report

Merging #2951 into master will increase coverage by 0.06%.
The diff coverage is 96.77%.

@@            Coverage Diff             @@
##           master    #2951      +/-   ##
==========================================
+ Coverage   56.45%   56.51%   +0.06%     
==========================================
  Files         482      482              
  Lines       34185    34223      +38     
==========================================
+ Hits        19298    19340      +42     
+ Misses      13728    13724       -4     
  Partials     1159     1159

Impacted Files	Coverage Δ
pkg/trace/agent/tags.go	`72.27% <96.77%> (+7.98%)`	⬆️
pkg/trace/writer/stats.go	`89.69% <0%> (-1.04%)`	⬇️
pkg/trace/writer/trace.go	`91.66% <0%> (+0.49%)`	⬆️
pkg/logs/auditor/auditor.go	`71.83% <0%> (+0.7%)`	⬆️

LotharSee · 2019-01-24T17:33:46Z

pkg/trace/agent/tags.go

 			continue
 		}

 		c = unicode.ToLower(c)
 		switch {
-		// handle always valid cases
 		case unicode.IsLetter(c) || c == ':':


We can merge the 1st and 3rd case together?

Actually there is a catch here: switch case statements are evaluated top-to-bottom and the idea here is that you can not go past the second case (length = 0) if the character isn't a letter or a colon. In other words only letters or colons can start the tag name.

If we merge the two, it will alter this rule.

pkg/trace/agent/tags.go

LotharSee · 2019-01-24T17:39:33Z

pkg/trace/agent/tags.go

-			buf.WriteRune(c)
-			lastWasUnderscore = false
+			// lower-case
+			norm[i] += 'a' - 'A'


Nit: we can put +32 directly, saying that it moves a ascii upper-case to ascii lower-case equivalent.

The compiler should auto compute this expression so we can leave this which I think gives better readability

I agree with @erichgess here. I liked this way and I even checked how the standard library does it, so I'd rather keep it if that's ok: https://github.com/golang/go/blob/05f8b44d5edc2960eff106e5e780cf83535d0533/src/unicode/letter.go#L267

gbbr · 2019-01-28T09:05:39Z

I'd say this is good to review if you'd like to take another look @LotharSee

gbbr · 2019-01-29T09:30:39Z

Has a bug that was fixed in #2957

gbbr added do-not-merge/WIP team/agent-apm trace-agent labels Jan 24, 2019

gbbr added this to the 6.10.0 milestone Jan 24, 2019

gbbr requested a review from a team as a code owner January 24, 2019 15:41

gbbr force-pushed the gbbr/normalize-tag branch from 3fffb4a to c10955b Compare January 24, 2019 15:53

LotharSee reviewed Jan 24, 2019

View reviewed changes

pkg/trace/agent/tags.go Outdated Show resolved Hide resolved

LotharSee reviewed Jan 24, 2019

View reviewed changes

gbbr added 2 commits January 28, 2019 10:36

pkg/trace/agent: improve NormalizeTag

01258f0

Address comment.

3c3fe1d

gbbr force-pushed the gbbr/normalize-tag branch from c10955b to 3c3fe1d Compare January 28, 2019 08:38

gbbr removed the do-not-merge/WIP label Jan 28, 2019

LotharSee approved these changes Jan 28, 2019

View reviewed changes

gbbr merged commit c6012c3 into master Jan 28, 2019

gbbr deleted the gbbr/normalize-tag branch January 28, 2019 10:24

gbbr mentioned this pull request Jan 29, 2019

pkg/trace/agent: consider UTF-8 characters in tags #2957

Merged

gbbr mentioned this pull request Apr 11, 2019

pkg/trace/api: improve normalizeTag algorithm #3289

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pkg/trace/agent: improve NormalizeTag #2951

pkg/trace/agent: improve NormalizeTag #2951

gbbr commented Jan 24, 2019 •

edited

Loading

codecov-io commented Jan 24, 2019 •

edited

Loading

LotharSee Jan 24, 2019

gbbr Jan 28, 2019

LotharSee Jan 24, 2019

erichgess Jan 24, 2019

gbbr Jan 28, 2019 •

edited

Loading

gbbr commented Jan 28, 2019

gbbr commented Jan 29, 2019

pkg/trace/agent: improve NormalizeTag #2951

pkg/trace/agent: improve NormalizeTag #2951

Conversation

gbbr commented Jan 24, 2019 • edited Loading

codecov-io commented Jan 24, 2019 • edited Loading

Codecov Report

LotharSee Jan 24, 2019

Choose a reason for hiding this comment

gbbr Jan 28, 2019

Choose a reason for hiding this comment

LotharSee Jan 24, 2019

Choose a reason for hiding this comment

erichgess Jan 24, 2019

Choose a reason for hiding this comment

gbbr Jan 28, 2019 • edited Loading

Choose a reason for hiding this comment

gbbr commented Jan 28, 2019

gbbr commented Jan 29, 2019

gbbr commented Jan 24, 2019 •

edited

Loading

codecov-io commented Jan 24, 2019 •

edited

Loading

gbbr Jan 28, 2019 •

edited

Loading