Redesign of Asynchronous Instrument Registration API via the Meter #3519

MrAlias · 2022-12-06T19:27:26Z

Currently the metric API provides the following registration API for asynchronous instruments via a meter:

Line 58 in 4146bd1

    
           RegisterCallback(insts []instrument.Asynchronous, function func(context.Context)) error

Also, the OpenTelemetry specification states ...

Multiple-instrument Callbacks MUST be associated at the time of registration with a declared set of asynchronous instruments from the same Meter instance. This requirement that Instruments be declaratively associated with Callbacks allows an SDK to execute only those Callbacks that are necessary to evaluate instruments that are in use by a configured View.

This API allows for callbacks to observe values for instruments they are not registered for, and allows for the callback to be associated with instruments from different Meter instances.

The SDK tries to prevent this by embedding a unique token in the passed context.

opentelemetry-go/sdk/metric/pipeline.go

Line 118 in 4146bd1

ctx = context.WithValue(ctx, produceKey, struct{}{})

That producer key is not unique to a Meter meaning it does not prevent instruments from different meters from being updated. Furthermore, it means a users has to pass that context to the Observe methods they invoke. This latter point is not explicitly stated and is a potential frustration point for users that use a different context here.

Proposal

Similar to the callback design proposed in #3507, having the callback return a set of observations for the instruments it is registered with will ensure only those instruments are updated and that they are valid.

type Meter interface {
	// [...]
	RegisterCallback(f Callback, insts ...instrument.Asynchronous) error
}

type Callback func(context.Context) ([]Int64Observation, []Float64Observation, error)

type Int64Observation struct {
	Instrument instrument.Asynchronous
	Measurement Int64Measurement
}

type Int64Measurement struct {
	Attributes []attribute.KeyValue
	Value int64
}

type Float64Observation struct {
	Instrument instrument.Asynchronous
	Measurement Float64Measurement
}

type Float64Measurement struct {
	Attributes []attribute.KeyValue
	Value float64
}

The text was updated successfully, but these errors were encountered:

MadVikingGod · 2022-12-09T15:08:57Z

So in this approach, I see two things that are potential problems.

The user is not supposed to use the Observe() within the callbacks. This will cause confusion, and I think is counter to the intent of the API.
This will require ALL callbacks to allocate additional slices.

This additionally doesn't prevent users from using an instrument from a different meter.

MrAlias · 2022-12-09T16:11:31Z

The user is not supposed to use the Observe() within the callbacks. This will cause confusion, and I think is counter to the intent of the API.

The API specifically states this as a valid callback design:

Return a list (or tuple, generator, enumerator, etc.) of individual Measurement values.

It is also how other implementations, e.g. python, are designed.

I don't follow how this isn't the intent of the API. Can you elaborate?

This additionally doesn't prevent users from using an instrument from a different meter.

If the SDK is the one doing the observation, why can it not verify the instrument belongs to the appropriate meter in the process?

MrAlias · 2022-12-10T00:31:26Z

I'm looking into an alternate proposal that uses the runtime.Caller function to ensure the callback is in the call-stack when Observe is called.

MrAlias · 2022-12-10T16:05:59Z

Alternate proposal to use the runtime.Caller function to ensure the Observe method is called from a Meter:

https://go.dev/play/p/uF-Utm0S5h6

type Meter struct {
	name          string
	registrations []registration
}

func NewMeter(name string) *Meter { return &Meter{name: name} }

func (m *Meter) Instrument() *Instrument {
	return &Instrument{
		meter:   m.name,
		callers: make([]uintptr, 8),
	}
}

func (m *Meter) Register(f Callback, instruments ...*Instrument) error {
	for _, i := range instruments {
		if i.meter != m.name {
			return fmt.Errorf("instrument from another meter: %s", i.meter)
		}
	}
	m.registrations = append(m.registrations, registration{
		Instrument: instruments,
		Callback:   f,
	})
	return nil
}

func (m *Meter) Collect() error {
	for _, reg := range m.registrations {
		if err := reg.collect(); err != nil {
			return err
		}
	}
	return nil
}

type Callback func() error

type registration struct {
	Instrument []*Instrument
	Callback   Callback
}

func (r registration) collect() error {
	pc, _, _, ok := runtime.Caller(1)
	if !ok {
		return errors.New("failed to get program counter")
	}
	for _, i := range r.Instrument {
		i.caller = pc
	}
	err := r.Callback()
	for _, i := range r.Instrument {
		i.caller = 0
	}
	return err
}

func contains(pc uintptr, frames *runtime.Frames) bool {
	for {
		frame, more := frames.Next()
		if pc == frame.PC {
			return true
		}
		if !more {
			break
		}
	}
	return false
}

type Instrument struct {
	meter string

	caller  uintptr // TODO: locking and multiple callers.
	callers []uintptr
}

func (i *Instrument) Observe() error {
	if i.caller == 0 {
		return errors.New("Observe must be called from a registered callback")
	}
	var stackIdx int
	for {
		n := runtime.Callers(1+stackIdx, i.callers)
		if contains(i.caller, runtime.CallersFrames(i.callers)) {
			// Called from a reservation.
			fmt.Println("Instrument observed")
			return nil
		}
		if n != len(i.callers) {
			break
		}
		stackIdx += n
	}
	return errors.New("Observe must be called from the callback it is registered with")
}

MadVikingGod · 2022-12-13T14:48:49Z

There seems to be a very high overhead cost to this approach.

I think we can expect the users to use the context in the callback in the Observe calls. Could we do something simpler like use a random number when the meter is created and pass that in the context?

MrAlias · 2022-12-15T16:34:26Z

There seems to be a very high overhead cost to this approach.

Can you elaborate on this? What were your findings?

MrAlias · 2023-01-05T00:57:04Z

From #3373

Idiomatic APIs for multiple-instrument Callbacks MUST distinguish the instrument associated with each observed Measurement value.

Our API currently does not do this. It assumes the association is correctly done within a Callback by a user.

This seems to motivate us resolving the redesign here to ensure we do comply with this.

MrAlias · 2023-01-07T00:40:42Z

From #3380

It is RECOMMENDED that the API authors use one of the following forms for the callback function:

The list (or tuple, etc.) returned by the callback function contains (Instrument, Measurement) pairs.

the Observable Result parameter receives an additional (Instrument, Measurement) pairs

We do not follow this recommendation from the specification.

MrAlias added bug Something isn't working pkg:API Related to an API package area:metrics Part of OpenTelemetry Metrics labels Dec 6, 2022

MrAlias added this to Go: Metric SDK (Beta) and Go: Metric API (GA) Dec 6, 2022

MrAlias moved this to Triage Needed in Go: Metric API (GA) Dec 6, 2022

MrAlias moved this from Triage Needed to Todo in Go: Metric API (GA) Dec 7, 2022

MrAlias moved this to Todo in Go: Metric SDK (Beta) Dec 7, 2022

MrAlias mentioned this issue Jan 5, 2023

Verify compliant metric API specification implementation: Instrument General Characteristics #3373

Closed

3 tasks

MrAlias mentioned this issue Jan 7, 2023

Verify compliant metric API specification implementation: Measurement #3380

Closed

2 tasks

MrAlias self-assigned this Jan 10, 2023

This was referenced Jan 11, 2023

Multiple-instrument Callbacks must only be associated with instruments from the same Meter #3583

Closed

Redesign RegisterCallback API #3584

Merged

Restructure RegisterCallback method #3587

Merged

MrAlias moved this from Todo to In Progress in Go: Metric SDK (Beta) Jan 12, 2023

MrAlias moved this from Todo to In Progress in Go: Metric API (GA) Jan 12, 2023

MadVikingGod closed this as completed in #3584 Jan 19, 2023

github-project-automation bot moved this from In Progress to Done in Go: Metric SDK (Beta) Jan 19, 2023

github-project-automation bot moved this from In Progress to Done in Go: Metric API (GA) Jan 19, 2023

XSAM added this to the untracked milestone Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redesign of Asynchronous Instrument Registration API via the Meter #3519

Redesign of Asynchronous Instrument Registration API via the Meter #3519

MrAlias commented Dec 6, 2022

MadVikingGod commented Dec 9, 2022

MrAlias commented Dec 9, 2022

MrAlias commented Dec 10, 2022

MrAlias commented Dec 10, 2022

MadVikingGod commented Dec 13, 2022

MrAlias commented Dec 15, 2022

MrAlias commented Jan 5, 2023

MrAlias commented Jan 7, 2023

Redesign of Asynchronous Instrument Registration API via the Meter #3519

Redesign of Asynchronous Instrument Registration API via the Meter #3519

Comments

MrAlias commented Dec 6, 2022

Proposal

MadVikingGod commented Dec 9, 2022

MrAlias commented Dec 9, 2022

MrAlias commented Dec 10, 2022

MrAlias commented Dec 10, 2022

MadVikingGod commented Dec 13, 2022

MrAlias commented Dec 15, 2022

MrAlias commented Jan 5, 2023

MrAlias commented Jan 7, 2023