-
Notifications
You must be signed in to change notification settings - Fork 449
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decide on requirements for C++ library #4
Comments
I'd recommend having the first version be based on C++11, but with support for older compilers (for example, gcc 4.8, which would be a subset of C++11) Gcc 4.8 support is something that's frequently been asked for with OpenTracing and some of their related projects. Example issues C++98 and C could be supported via a separate C API if there's enough demand for it. And the C API could bind to a C++11 implementation via a bridge so that it's not necessary to have an entire C implementation.
It might be possible to make most of the API usable as a header-only file. But if you want support for a global tracer concept (where a tracer can be registered to a global variable), you'll need a source file. I would expect an implementation of the API to, of course, require source files.
Given that errors in a tracer should should be non-critical for an application, I'd recommend having most of the API The exception would be cases where user-provided callback code gets invoked and the user might throw exceptions in their own code that gets propagated out of the API function. For example, the OpenTracing API allowed methods like this to throw exceptions if they were initiated by the user's code
I think the API should have no external dependencies and dependencies from the implementations should be optional depending on which one you use. |
I think that we should consider what is most important: (C-libraries that want to adopt open-telemetry would need a separate C-implementation that does not depend on a C++-runtime. Thus, I think we should at least provide a C++ interface, although we could provide a more API-stable C-interface as well)
Multiple SDKs might be a possibility. Requirements like no use of the C++ Standard Library make the SDK harder/more time consuming to develop - perhaps this could be done in more specialized versions of the SDK?
|
C++11 or newer, std, cmake |
While you are consider C++ versions, this might be useful: https://www.jetbrains.com/lp/devecosystem-2019/cpp/ Consider also that Protobuf is C++11, and so is gRPC. |
May want to compare C++98 and C++03 in previous years: https://www.jetbrains.com/research/devecosystem-2018/cpp/ |
Note that gRPC decided to depend on https://github.com/grpc/proposal/blob/master/L59-core-allow-cppstdlib.md I think you should use |
That is probably not what you want. Declaring a function |
@coryan Yes, I'm aware of what it does; and yes, that is what I want. Given that tracing isn't critical to an app, a tracer should handle errors by logging and dropping spans, data, etc rather than exposing an exception to the end user. So it makes a lot of sense to mark most of the API as noexcept and that's what we did with the OpenTracing API (see, for example, the span interface). It's is also consistent with recommended guidelines. See Core Guidelines and Effective C++. Could you explain why you don't think it should be used? |
Because it is trivially easy to make function calls that do raise exceptions and then you get a crash. For example in your span interface:
Assuming the implementation does some memory allocation (I think that is reasonable, but you tell me), then this can throw a I am aware that http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p1404r1.html I might be wrong about all this, and if so I am happy to learn otherwise. |
That function returns a nullptr on failure -- it's documented in the interface. While the most resilient implementations will turn bad_alloc's into noops (and that's what I did when I implemented it, it's also perfectly accept terminate on bad_alloc. In fact, it's likely that the default standard allocator will change to terminate on bad_alloc (See Herb Sutter's talk on error handling |
I was hoping to add a viewpoint regarding decisions around ABI Compatibility, std:: and versions. Early vs Late adopters argumentAll the following points are based on the viewpoint that: early adopters are more likely to be using latest technologies (versions) whereas those using say C++03 (this includes parts of my org too...) will be less likely to adopt OpenTelemetry until it has been more widely adopted/stable. (I don't have data to back up this opinion 😞 other than personal experience) Therefore I'd suggest that the best delivery and adoption would be achieved by offering a library to the likely early adopters using latest technologies and deferring the overhead of support for older platforms (and hence late adopters). ABI CompatibilityHow about a hybrid approach of an ABI stable C interface and a header-only C++ interface. This not only avoids the ABI compatibility issue but enables C-only users to use the library, albeit with more difficulty. As per the argument above, this can be deferred and added later, e.g. the initial release can be for say C++17 only, and offer no ABI compatibility guarantees when compiled with older/newer standards. Standard LibraryPresuming ABI issues are solved using the mechanism above, there should be no issue of using std:: in a header-only implementation (presuming care is taken to avoid this). However, if OpenTelemetry is used in a library that is then distributed, then it will inherit this problem which I consider beyond the responsibility of the OpenTelemtry library. C++ VersionGiven the goal of catering to the early adopters who are likely using later versions, I suggest starting with C++17. This version comes with Support for older versions can be added incrementally through version guards ( ConclusionBased on the above I am suggesting:
|
Right now we're supporting gcc-4.8 which includes most of C++11. There's still a lot of demand for older compilers and I think targeting something as late as C++17 would be too restrictive. If you look through the issues, you'll find lots of people using opentracing-cpp with older compilers. A few examples: And we've already gotten most of the benefits of the later versions of C++ by backporting the relevant parts of the standard library (e.g. std::string_view, std::span)
I'm not opposed to having an additional C API that can wrap the opentelemetry-cpp API. However, a C API would be less convenient to use, have a larger surface area, and have fewer people willing to maintain it. This came up regularly with opentracing and someone started opentracing-c, but it's unmaintained and I doubt anyone is using it: https://github.com/opentracing/opentracing-c I'd rather focus on making the C++ API as portable and broadly usable as we can. |
I appreciate the appeal of broader support, my comment was more about the "when" rather than the "if", but I do understand if the answer is "we wish to support C++11 from day 1". If that's the case, might I suggest that we catalogue a list of key projects that we're tackling that require these constraints, e.g. Nginx. It would be a great way to both validate, maintain a list of supported projects/platforms and also provide some marketing of what is being targeted end hence will hopefully offer OpenTelemetry support. Another thought, for further down-the-line, would be to produce and document an engineering strategy for supporting older platforms, e.g. how many years a certain platform is likely to be supported and the exit strategy. It may be prohibitive to maintain, say C++03 support.
While this does provide an ABI portable C API, the C++ API will not be ABI portable, e.g. it would be hard to guarantee that a version of OpenTelemetry-cpp built with one toolchain would be link-able in a project built in another toolchain. It raises the question of whether this project wants to provide Binary or Source distribution artefacts as this problem does not exist with Source distribution as long as a single toolchain is mandated. I'm not sure if this is covered by the statement in
Maybe worth while clarifying the stance on this. Sorry if this comes across critical, my intention is to get clarity and align with the goals. |
It's absolutely possible to get a portable C++ ABI. You only need a of handful basic vocabulary types to describe the virtual methods of tracers and meters. We provide those (for example nostd::string_view, nostd::span) so that we fix their ABI and provide implicit conversion operators to their corresponding standard c++ types. While the C++ standard doesn't officially describe things like vtable layout, in practice, nearly all the compilers we care about will have ABI compatible vtable layouts for a target architecture (it's specified as part of the Itanium C++ ABI). To get a portable tracer plugin, you then
This will give you a tracer plugin that's usable across a wide range of environments and isn't tied to a particular version of the standard C++ library. |
Following up from last weeks call, here's my attempt at breaking out the different deployment scenarios that are relevant for ABI stability. Case 1: Early-binding linking. Both the OpenTelemetry API and implementation are known and linked into the application at compile-time. This might be one of the most common scenarios. But I don't see it posing any challenges for ABI stability since everything is fixed to the same version. Case 2: Early-binding linking for the OpenTelemetry API. Late-binding linking for an Implementation. The OpenTelemetry API is known and linked into the application at compile-time. An implementation is loaded as a plugin at runtime. This approach has the advantage that the application isn't tethered to a particular OpenTelemetry implementation. We are free to use alternative implementations without rebuilding the app, allowing vendors to support target applications without integrating into the OpenTelemetry repo or changing the target application's source code. Some examples of this pattern: It's desirable with this approach to support pre-build portable OpenTelemetry plugin binaries that can be easily deployed. Ensuring that an application can work together with a plugin is possible, but takes some care (See my post above or the plugin PR) as the plugin and application may not have been linked against the same standard C++ library or OpenTelemetry API. Case 3: Late-binding for instrumented code. In this case, (possibly multiple) instrumented DSOs are loaded into an application at runtime. If we want the DSOs to share global OpenTelemetry objects (tracers, meters, span-contexts), then we need to be careful that 1) the ABIs for global OpenTelemetry objects are stable for different standard C++ libraries and 2) the DSOs are dlopen-ed and dlclose-ed in a way that allows them to share global symbols. Examples:
I haven't thought through this scenario as thoroughly, but I put together a quick example of how global symbol can be shared across loaded DSOs. A few notes on the linking
Only S::Get1 will be shared across DSOs; other symbols like S::Get2 would only have local visibility.
Python, for example, uses RTLD_LOCAL by default; so for this to work, you'd need to tweak some options.
|
And to clarify on ABI stability requirements, what's important is to have stable ABI types in the virtual method signatures. Provided you set up linking correctly, it's fine to use STL types with non-stable ABIs in non-virtual methods and as data members. Here's an example that sets up two DSOs that interface with a virtual Cont class but have different definitions for the derived class ContImpl (to simulate using incompatible STLs). If you build it and run, you can see that each DSO can still interact with a ContImpl constructed from the other DSO
|
@rnburn this was a great write up in #4 (comment), as per my comments during previous week's SIG meeting, I think it would be worthwhile broadening the options further to consider: Binding Options:
Components:
We can then explore each scenario/example, e.g. the Nginx example becomes:
Now, in this case, the 3rd step, whether statically or dynamically early bound has some implications on how an OTel singleton would be loaded. This would also highlight the challenges of some combinations, such as: I suggest documenting the "library binary distribution" policy, including both the ABI compatibility implications, the different binding scenarios, and recommendations. |
Origin/propagators
enable SSL for curl in bazel build environment
A few topics that need decisions:
std
or no.Opinions are welcome here in the form of comments and we will also discuss this in SIG meetings.
The text was updated successfully, but these errors were encountered: