-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[service/extensions] extension lifecycle order #8733
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
a82289a
to
312876d
Compare
…s according to configuration
312876d
to
24dfe76
Compare
Codecov ReportAll modified and coverable lines are covered by tests ✅
... and 6 files with indirect coverage changes 📢 Thoughts on this report? Let us know!. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to hold off until discussion in #8732 is resolved
I received a comment that I should stop extensions in reverse order that they are started, I'll push that. After that, we can discuss the merits of this PR. |
Another implementation option discussed on the issue is to build support for extension dependencies in the graph/ module. Then an extension can programmatically declare a dependency on another extension and the graph would guarantee correct execution order. This would be a larger change, but preferable over explicit config order where users need to worry about listing extensions in the right order. |
Makes sense, but it would require another predefined way to specify dependencies on the extensions in the configuration interface. The current dependencies topology is constructed directly from the pipeline YAML definition. Introducing an additional source for building the graph seems like an excessive complication of the user interface. And some extensions naturally do not provide any capabilities to other components, only to the end-user. I think it's ok to keep it simple like implemented in this PR going forward. |
Don't get me wrong, I am happy with this PR. But when we talk about complicating user interface, I think it's important to remember which users are exposed to such interface. With dependencies expressed programmatically, a very small number of users who write their own extensions are affected. With dependencies expressed via config order, all users of the collector are exposed to the notion that there are dependencies between extensions, even though there's very little they can actually do about it. E.g. |
@yurishkuro, how do you envision a programmatic way to define the dependencies? Something like an optional interface for extensions to define a dependency on another extension interface? I believe that doesn't require integration with the pipeline graph. We would still start all the extensions before the pipelines. |
I'd add an optional argument opentelemetry-collector/extension/extension.go Lines 103 to 107 in 6405e15
I'm not sure why these constructor APIs are currently not made extensible by taking args as a struct instead of fixed positional args. |
In that case, you can only define a hardcoded dependency on a particular component. There should be a way to define a dependency on a class of extensions like storage or encoding, which is defined by an interface an extension implements. |
I personally would avoid duck-typing dependencies, it's better to have explicit ones by ID. There is only one pool of extensions, so what if I have some components depending on one kind of encoding extension and other components depending on another? When they refer to those by ID there is no ambiguity, but duck-typing only works when there is exactly one instance defined for a given interface. |
Extensions should provide the functionality to components via interfaces like storage. Components dependent on that interface can choose any storage extension even if it's implemented outside of OTel core/contrib repos. Choosing a particular storage implementation is exposed to the user with a config option. Having a hardcoded set of IDs means that custom extensions of particular classes cannot be implemented outside, and every extension dependent on a particular class of extension has to maintain a set of all instances, e.g., each time a new storage/encoding extension is added, all dependent components have to be updated.
They should be two classes of extensions that implement two different (or additional) interfaces. |
@dmitryax I didn't mean that the dependency has to be hardcoded: for some extensions it may be sufficient, others may need to express their dependencies after reading their own config. That means the dependencies should not be exposed via |
service/extensions/extensions.go
Outdated
@@ -77,7 +82,8 @@ func (bes *Extensions) NotifyPipelineReady() error { | |||
func (bes *Extensions) NotifyPipelineNotReady() error { | |||
// Notify extensions in reverse order. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this accurate?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like it's not. @atoulme can you please address this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How would you like me to address this? It feels like this comment needs to be removed.
@yurishkuro, I think we can merge this one and rebase #8768 on top of it since it won't cause any breaking changes, right? |
@dmitryax my PR is already based on this one (it includes the original commits). I don't have a strong objection to merging each one separately, but it means the 1st PR will introduce one way of ordering (based purely on config order) and 2nd will override that in favor of the order defined by the DependentExtension interface (with other order still being random). In theory it's possible to support both at the same time: by default start with config-based order as documentation implies, but if DependentExtension interfaces are detected in some extensions move them to the end and apply topological sort (seems a bit complicated). I think just the 2nd approach is cleaner - no order guarantees unless explicitly requested via interface. |
Ok, makes sense to me. Especially if that simplifies the implementation |
**Description**: Enforce order of start and shutdown of extensions according to their internally declared dependencies **Link to tracking Issue**: Resolves #8732 **Motivation**: This is an alternative approach to #8733 which uses declaration order in the config to start extensions. That approach (a) enforces order when it's not always necessary to enforce, and (b) exposes unnecessary complexity to the user by making them responsible for the order. This PR instead derives the desired order of extensions based on the dependencies they declare by implementing a `DependentExtension` interface. That means that extensions that must depend on others can expose this interface and be guaranteed to start after their dependencies, while other extensions can be started in arbitrary order (same as happens today because of iterating over a map). The extensions that have dependencies have two options to expose them: 1. if the dependency is always static (e.g. `jaeger_query` extension depending on `jaeger_storage` as in the OP), the extension can express this statically as well, by returning a predefined ID of the dependent extension 2. in cases where dependencies are dynamic, the extension can read the names of the dependencies from its configuration. The 2nd scenario is illustrated by the following configuration. Here each complex extension knows that it needs dependencies that implement `storage` and `encoding` interfaces (both existing APIs in collector & contrib), but does not know statically which instances of those, the actual names are supplied by the user in the configuration. ```yaml extensions: complex_extension_1: storage: filestorage encoding: otlpencoding complex_extension_2: storage: dbstorage encoding: jsonencoding filestorage: ... dbstorage: ... otlpencoding: jsonencoding: ``` **Changes**: * Introduce `DependentExtension` optional interface * Change `Extensions` constructor to derive the required order using a directed graph (similar to pipelines) * Inherited from #8733 - use new ordered list of IDs to start/stop/notify extensions in the desired order (previously a map was used to iterate over, which resulted in random order). * Tests **Testing**: Unit tests --------- Signed-off-by: Yuri Shkuro <github@ysh.us> Co-authored-by: Antoine Toulme <antoine@lunar-ocean.com>
Superseded by #8768 |
Description:
Enforce order of start and shutdown of extensions according to configuration
Link to tracking Issue:
Fixes issue #8732
Testing:
Unit tests