[processor/traces] Action/sampling on resources attributes based on span attributes #20294

yehaotian · 2023-03-23T18:52:29Z

Component(s)

traces related processors

Is your feature request related to a problem? Please describe.

Usecase:
We have some debug span attributes and would like to 100% sample it in tailsamplingprocessor policy. At this moment tail sampling processor does not support policy on span attributes but only on resource and record string_attribute.

Currently we do not have status code on span, so that the workaround is using spanprocessor to set ERROR status when match the debug span attribute and leverage status_code policy for 100% sampling.
However, we have a new case need to send the trace(debug trace) contains the debug span attribute to a different destination instead sending all ERROR cases, how can we use the current attribute/filter/span/resources/tailsampling processors to achieve that?

Note: The debug span attribute only exist in the root span, not all spans in the trace.

Describe the solution you'd like

If the current processors cannot make it happen, several solutions can be applied:

Have spanprocessor able to set resource and record attributes instead of just status code.
Have resourcesprocessor able to insert/set resource and record attributes when match span attributes.
Have attributesprocessor able to insert/set resource and record attributes when match span attributes.
Have tailsamplingprocessor policy on span attributes (unlikely due to performance?)
...

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

yehaotian · 2023-03-23T18:53:52Z

cc: @TylerHelmuth

github-actions · 2023-03-23T20:03:41Z

Pinging code owners for processor/tailsampling: @jpkrohling. See Adding Labels via Comments if you do not have permissions to add labels yourself.

TylerHelmuth · 2023-03-23T20:05:44Z

@jpkrohling I'm not familiar with the tailsamplingprocessor config, but when I hear "I wish I had access to this piece of telemetry" I think of OTTL. Is there a place in tailsamplingprocessor where we could utilize OTTL conditions?

jpkrohling · 2023-03-29T19:05:25Z

That's an interesting thought. Perhaps we could do a brainstorming session, looking at the current policies and seeing which ones could be replaced by an "OTTL policy".

yehaotian · 2023-04-04T17:06:25Z

Hi @jpkrohling @TylerHelmuth any updates on this? Are we expecting OTTL policy for tail sampling processor soon?

TylerHelmuth · 2023-04-04T17:36:13Z

I think OTTL can help here, but I won't be able to work on it soon.

jpkrohling · 2023-04-05T13:49:32Z

Same. I'm happy to review your PR, @yehaotian, in case you want to send one, but I don't have time to work on it myself.

jiekun · 2023-04-11T08:26:53Z

@yehaotian CMIIW, the expected configuration should be something like this?

processors:
  tail_sampling:
    ...
    policies:
      [
          {
            name: ottl_policy_1,
            type: ottl_policy,
            query: sample(attributes["your.custom.attributes"] = "/health")  # or something else meets OTTL standard
          },

I feel like we could replace status_code / string_attribute / numeric_attribute / boolean_attribute / span_count / trace_state with OTTL or let them co-exist for a certain time period first, to get more feedback from users.

@jpkrohling I'm pretty interested in implementing this. May I have your suggestion, like:

should we further discuss the OTTL way to provide user convenience? (Not sure if it was discussed at the SIG meeting)
should I have a detailed tech design or just create a pull request with codes and related descriptions?

TylerHelmuth · 2023-04-11T15:52:55Z

Couple quick suggestions:

replace query with statement.
Drop the Invocation (sample) and right the statement as if it were an OTTL condition. We don't have a condition-only parser yet for OTTL ([pkg/ottl] expose a parser explicitly for parsing conditions #13545) but the condition can be appended to a noop invocation during startup. This is the pattern the filterprocessor and countconnector use.

Hopefully the tailsamplingprocesor would be able to take advantage of internal/filter/filterottl to do the matching.

@jpkrohling I haven't read through the tailsamplingprocessor code myself, but if internal/filter/filterottl slots in nicely to some boolean check somewhere then I think this is a relatively straightforward approach.

jpkrohling · 2023-04-12T20:02:47Z

some boolean check somewhere then I think this is a relatively straightforward approach

Yes, it boils down to that. We have more states than just booleans, but a policy can decide to return only a "sample" or "not sampled" if it wants.

jiekun · 2023-05-30T06:43:07Z

@yehaotian Please check the OTTL condition policy and see if it mets your requirement :P

TylerHelmuth · 2023-05-30T17:55:39Z

Closing this for now as the OTTL condition policy will allow conditions based any telemetry field. Please ping me if you think it should be reopened.

yehaotian · 2023-05-30T21:24:28Z

@TylerHelmuth Thanks for the change!!
I have following policy and it seems not working:

        {
          name: debug-policy,
          type: ottl_condition,
          ottl_condition: {
            error_mode: ignore,
            span: [
              "attributes[\"debug-id\"] != nil",
            ]
          }
        },

btw no error messages.

yehaotian · 2023-05-30T21:29:34Z

Debug log:

{"level":"debug","ts":1685482015.4306045,"caller":"sampling/ottl.go:59","msg":"Evaluating with OTTL conditions filter","kind":"processor","name":"tail_sampling","pipeline":"traces","traceID":"061c3f37e4c300ec350e3f9d9bd75e67"}
{"level":"debug","ts":1685482015.4307067,"caller":"tailsamplingprocessor@v0.78.0/processor.go:199","msg":"Sampling policy evaluation completed","kind":"processor","name":"tail_sampling","pipeline":"traces","batch.len":1,"sampled":0,"notSampled":4,"droppedPriorToEvaluation":0,"policyEvaluationErrors":0}

The span does have debug-id attribute

TylerHelmuth · 2023-05-30T22:02:27Z

@yehaotian is debug-id definitely an attribute and not a resource attribute?

yehaotian · 2023-05-30T22:13:13Z

Yes, this is something can be added in the business logic for debug.
Also it is listed in Attribute not Resource category

TylerHelmuth · 2023-05-30T22:33:26Z

@yehaotian Out of curiosity what happens if you do == instead? I want to make sure we didn't accidentally inverse the condition.

jiekun · 2023-05-31T00:59:02Z

@TylerHelmuth @yehaotian I believe this should work. test case as below:
https://github.com/jiekun/opentelemetry-collector-contrib/blob/1a42bceb9a1cf10bc11e427628a4042843514847/processor/tailsamplingprocessor/internal/sampling/ottl_test.go#L66

		{
			"OTTL conditions inverse match(!=) span attributes 2",
			[]string{"attributes[\"attr_k_1\"] != \"attr_v_1\""},  // this is my ottl condition
			[]string{},
			[]spanWithAttributes{{SpanAttributes: map[string]string{"attr_k_1": "attr_v_2"}}},  // span attributes
			false,
			Sampled,  // it should be sampled
		},

I will check it again today with " != nil" condition.

yehaotian · 2023-05-31T19:38:06Z

After enabling debug logging exporter, I notice sometimes the attributes fail to be reported to otel collector which seems super wired to me.
FYI, we are using Jaeger opentracing instrumentation with otel collector Jaeger receiver, perhaps there are issues during schema transformation

TylerHelmuth · 2023-05-31T20:58:17Z

For our testing purposes we need to reduce variability. Can you use the attributeprocessor or transformprocessor to add the attribute to the telemetry before the tailsamplingprocessor to ensure that it is always present?

jiekun · 2023-06-01T03:47:37Z

After enabling debug logging exporter, I notice sometimes the attributes fail to be reported to otel collector which seems super wired to me. FYI, we are using Jaeger opentracing instrumentation with otel collector Jaeger receiver, perhaps there are issues during schema transformation

Thanks for clarifying this. I checked the != nil condition with unit test as well, which works correctly. I am putting those test cases here FYI. Feel free to provide more info / feedback.

		{
			Desc:                "OTTL conditions 1",
			SpanConditions:      []string{"attributes[\"attr_k_1\"] == \"attr_v_1\""},
			SpanEventConditions: []string{},
			Spans:               []spanWithAttributes{{SpanAttributes: map[string]string{"attr_k_1": "attr_v_1"}}},
			WantErr:             false,
			Decision:            Sampled,
		},
		{
			Desc:                "OTTL conditions 2",
			SpanConditions:      []string{"attributes[\"attr_k_1\"] != \"attr_v_1\""},
			SpanEventConditions: []string{},
			Spans:               []spanWithAttributes{{SpanAttributes: map[string]string{"attr_k_1": "attr_v_1"}}},
			WantErr:             false,
			Decision:            NotSampled,
		},
		{
			Desc:                "OTTL conditions 3",
			SpanConditions:      []string{"attributes[\"attr_k_1\"] != nil"},
			SpanEventConditions: []string{},
			Spans:               []spanWithAttributes{{SpanAttributes: map[string]string{"attr_k_1": "attr_v_1"}}},
			WantErr:             false,
			Decision:            Sampled,
		},

yehaotian added enhancement New feature or request needs triage New item requiring triage labels Mar 23, 2023

TylerHelmuth added processor/tailsampling Tail sampling processor and removed needs triage New item requiring triage labels Mar 23, 2023

jiekun mentioned this issue Apr 13, 2023

[processor/tailsampling] add OTTL Condition policy #20890

Merged

TylerHelmuth closed this as completed May 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/traces] Action/sampling on resources attributes based on span attributes #20294

[processor/traces] Action/sampling on resources attributes based on span attributes #20294

yehaotian commented Mar 23, 2023 •

edited

Loading

yehaotian commented Mar 23, 2023

github-actions bot commented Mar 23, 2023

TylerHelmuth commented Mar 23, 2023

jpkrohling commented Mar 29, 2023

yehaotian commented Apr 4, 2023

TylerHelmuth commented Apr 4, 2023

jpkrohling commented Apr 5, 2023

jiekun commented Apr 11, 2023 •

edited

Loading

TylerHelmuth commented Apr 11, 2023

jpkrohling commented Apr 12, 2023

jiekun commented May 30, 2023

TylerHelmuth commented May 30, 2023

yehaotian commented May 30, 2023 •

edited

Loading

yehaotian commented May 30, 2023

TylerHelmuth commented May 30, 2023

yehaotian commented May 30, 2023

TylerHelmuth commented May 30, 2023

jiekun commented May 31, 2023

yehaotian commented May 31, 2023

TylerHelmuth commented May 31, 2023

jiekun commented Jun 1, 2023 •

edited

Loading

[processor/traces] Action/sampling on resources attributes based on span attributes #20294

[processor/traces] Action/sampling on resources attributes based on span attributes #20294

Comments

yehaotian commented Mar 23, 2023 • edited Loading

Component(s)

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

yehaotian commented Mar 23, 2023

github-actions bot commented Mar 23, 2023

TylerHelmuth commented Mar 23, 2023

jpkrohling commented Mar 29, 2023

yehaotian commented Apr 4, 2023

TylerHelmuth commented Apr 4, 2023

jpkrohling commented Apr 5, 2023

jiekun commented Apr 11, 2023 • edited Loading

TylerHelmuth commented Apr 11, 2023

jpkrohling commented Apr 12, 2023

jiekun commented May 30, 2023

TylerHelmuth commented May 30, 2023

yehaotian commented May 30, 2023 • edited Loading

yehaotian commented May 30, 2023

TylerHelmuth commented May 30, 2023

yehaotian commented May 30, 2023

TylerHelmuth commented May 30, 2023

jiekun commented May 31, 2023

yehaotian commented May 31, 2023

TylerHelmuth commented May 31, 2023

jiekun commented Jun 1, 2023 • edited Loading

yehaotian commented Mar 23, 2023 •

edited

Loading

jiekun commented Apr 11, 2023 •

edited

Loading

yehaotian commented May 30, 2023 •

edited

Loading

jiekun commented Jun 1, 2023 •

edited

Loading