-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tracking Issue: Procedural Macro Diagnostics (RFC 1566) #54140
Comments
I'd argue that support for associating warnings with lints should be a separate RFC, and shouldn't block moving forward with unsilenceable warnings (with the expectation that anything to associate warnings with lints would need to be an additional API) |
Similarly, I'm not sure we actually need to address multi-span support before this API can be stabilized. The proposed change there involves changing some methods to be generic, which is considered a minor change under RFC #1105. It could also be done by changing |
@sgrif I suppose the question is: should unsilenceable warnings be allowed at all? I can't think of a reason to remove this control from the end-user. And, if we agree that they shouldn't be allowed, then we should fix the API before stabilizing anything to account for this. I'd really rather not have four warning methods:
Sure, but the change is so minor, so why not do it now? What's more, as much as I want this to stabilize as soon as possible, I don't think enough experience has been had with the current API to merit its stabilization. I think we should implement these two features, announce them broadly so others can play with them, gather feedback, and then stabilize.
Right, that works too. |
I think "having a thing we can ship" is a decent reason, but I also think an API that only supports
Because we have a perfectly workable API that's being used in the wild right now that we could focus on stabilizing instead. Typically we always trend towards the more conservative option on this sort of thing, shipping an MVP that's forward compatible with extensions we might want in the future.
So what needs to happen for that? Should we do a public call for testing? Definitely adding more docs is huge. I suppose it'd be good to see what serde looks like using this API as well. |
Yeah, that's exactly what I was thinking.
I don't think this is an eccentric proposal in any way. When folks play with this, they should have this feature. In any case, I'll be implementing this soon, unless someone beats me to it, as Rocket needs it. |
Should we be going through the RFC process for additions to the API here?
(I noticed that `Diagnostic` itself actually never got an RFC)
…On Tue, Sep 11, 2018 at 3:26 PM Sergio Benitez ***@***.***> wrote:
So what needs to happen for that? Should we do a public call for testing?
Definitely adding more docs is huge. I suppose it'd be good to see what
serde looks like using this API as well.
Yeah, that's exactly what I was thinking.
Because we have a perfectly workable API that's being used in the wild
right now that we could focus on stabilizing instead. Typically we always
trend towards the more conservative option on this sort of thing, shipping
an MVP that's forward compatible with extensions we might want in the
future.
I don't think this is an eccentric proposal in any way. When folks play
with this, they should have this feature. In any case, I'll be implementing
this soon, unless someone beats me to it, as Rocket needs it.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#54140 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ABdWK2Gpoc6vTWz8fKpJmVaGysf9w7-Uks5uaCpsgaJpZM4WkOKO>
.
|
Maybe it's too late (I'm lacking context here), but is there any hope of unifying proc-macro diagnostics with those emitted by the compiler itself? It seems sad and unmotivated to have two parallel implementations of diagnostics. (Rustc's diagnostics also have a suggestions API (albeit still somewhat in flux) that harbors a lot of promise given the new |
@zackmdavis The API being exposed by the diagnostics API in In general, |
Maud also uses the diagnostic API. It would benefit from both features described in the summary:
|
Is there any way to emit warning for arbitrary file? It could be usefull for macros that read additional data from external files (like derive(Template) in https://github.com/djc/askama ) . If it's not possible, how problematic it is to add to |
Something I find confusing about the current nightly API: does Currently I must use In my opinion:
|
@dhardy |
Okay, fair enough; not sure why I some panic messages sometimes and not others.
I guess |
Personally I'd prefer not exposing a The compiler will not proceed with type checking if your macro emits at least one error, so you can get away with returning nonsense (like |
@macpp It's not possible today. I agree that something like this should exist, but much thought needs to be given to the API that's exposed. The API you propose, for instance, puts the onus of tracking line and column information on the user, and makes it possible to produce a The API that I've considered is having a mechanism by which a impl TokenStream {
fn from_source_file<P: AsRef<Path>>(path: P) -> Result<TokenStream>;
} This would make it possible to get arbitrary (but "well-formed") spans anywhere in the file. As a bonus, it means you can reuse libraries that work with |
Well, of course if you deal with .rs files, then panic("bad configuration in config.toml file: line X column Y") Unfortunately, i forgot about |
@macpp No, that would work for almost any kind of text file. A |
And that strings inside single quotes must contain a single codepoint, which eliminates a lot more things (including TOML files) that have arbitrary strings in single quotes. Even the "delimiters must be balanced" requires that the file use the same idea of where a delimiter is and isn't valid. For example the source file must agree that a In short, I don't have a problem with having a |
Marking this as libs-api nominated to evaluate the current state of this API and consider stabilizing some or all of it. Currently (to the best of my knowledge), proc macros don't have any way to emit warnings. They can kinda emit errors, via panic, but they can't emit multiple errors that way. I do understand that However, I think it's safe to say in general that the concept of "emit an error" or "emit a warning" is to some degree something we'll always want in some form, for as long as we have proc macros. We may wish to improve it in the future, but the core concept is fundamental to a compiler. That doesn't mean we need to stabilize all of this API, exactly as it currently is. But, by way of example, I think the following bits could be safely stabilized as a very simple subset:
That would be just enough to emit errors and warnings, using a single Span. There's a lot more that proc macros may want in the future, but I think the above subset should be safe to commit to. Thoughts? |
cc @rust-lang/wg-macros ^^
It is possible to emit As to the API, I think it would be worth revisiting the proposal from a few years back that I had a PR open to implement (that languished). Even if not stabilizing a large API all at once, there should at least be a general vision rather than simply stabilizing what happened to be submitted as a PR first. That's not to say that the API is bad, just that it should have more thought put into it. |
Fair enough. It sounds like there'd still be value in having a simple error/warning mechanism though.
Do you have a summary of that proposal's API surface? |
Buried in this issue. See #54140 (comment) and the subsequent few comments. What Sergio proposed is what was implemented (but not merged) in #83363. As far as I am aware, there hasn't been any meaningful discussion about the diagnostics API since then (the early 2023 comments). |
@jhpratt Sergio's proposal seems reasonable to me, but it does have a substantial API surface. The followup version from Mara has a much smaller surface area, though I agree with Sergio's comment that we'd need to at least specify what order things get emitted in. (I would propose "in order of creation", for simplicity.) I do think that deferring the I also think it may make sense to make these functions on That said, I'd happily sign off on either approach, in the same spirit of starting with an MVP. |
A copy of Mara's proposed API surface, for reference: pub fn error(span: impl Spans, message: impl Message) -> DiagnosticHandle;
pub fn warning(span: impl Spans, message: impl Message) -> DiagnosticHandle;
pub fn note(span: impl Spans, message: impl Message) -> DiagnosticHandle;
impl Message for String;
impl Message for &str;
impl Spans for Span;
impl<I: IntoIterator<Item = Span>> Spans for I;
impl Copy for DiagnosticHandle;
impl !Send for DiagnosticHandle;
impl !Sync for DiagnosticHandle; I'd personally propose |
Personally, I agree with @SergioBenitez's comments on Mara's proposal, and prefer the API @SergioBenitez proposal (although I care less about whether or not it is required to call There is one point of @SergioBenitez's proposal I dislike however: his // alternative name: ToSpan
pub trait Spanned {
fn span(&self) -> Span;
}
impl Diagnostic {
pub fn mark_all(self, item: impl Iterator<Item = impl Spanned>) -> Self;
} This leaves one gap in the API — joining spans. So lets address that separately: impl Span {
pub fn join(&self, other: impl Spanned) -> Option<Span>; // existing unstable method with modified arg type
pub fn join_all(iter: impl Iterator<Item = impl Spanned>) -> Option<Span>;
pub fn extend(&self, iter: impl Iterator<Item = impl Spanned>) -> Option<Span>; // optional extra
} Note: the new Mostly though, I agree with @joshtriplett that it would be good to get some traction on this. The plan to stabilise only |
/bikeshedding
|
I would be really happy to see process on this, as the current way to report errors and warnings for proc-macros is really not good. I would like to add a few points as input from user of proc-macros:
That's currently not true. You can generate a warning by using That brings me to another point: Generating warnings from proc macros is likely very use-full for a lot of use-cases. Nevertheless people likely want to use
An important point to keep in mind here is that most crates are built on top of |
When I suggested deferring Lints, lint groups, and allow/deny/forbid/etc for proc macros would be useful as well, but very much something we can defer from the initial implementation. And yes, there are ways to get output to the user. I mean that there's no current way to get a well-integrated warning that's associated with the user's code. |
I don't immediately recall what my concerns might have been then, but I'd guess it is about stabilizing an overly constrictive API that doesn't provide as many natural points of expansion (what would the api look like for using a |
We discussed this in today's @rust-lang/libs-api meeting. Several people in the meeting said that they didn't want to ship a design without a lint-style mechanism to associate warnings with a name (and namespace that name), allowing for the possibility of suppressing it in the future. (The suppression mechanism doesn't have to be finished before shipping, but the namespacing mechanism does.) Some in the meeting felt that warnings should require such a name or ID, rather than having it be optional. We discussed two possible designs for naming and namespacing, in the meeting.
Either way, the only path forward that had consensus in the meeting was to have such an ID mechanism for lints, with a namespacing mechanism that ensured two different proc macros wouldn't have conflicting names. |
@rustbot labels +T-lang +I-lang-nominated In the libs-api call today, it was discussed how this feature may overlap with matters that concern lang. Certainly some particular designs, e.g. those that would add namespacing within But more broadly, we've been thinking about a number of seemingly-related namespacing concerns, e.g. how to namespace attributes applied to fields for derive macros, the tooling namespace, etc. We may want to think holistically about this, or to encourage designs that fall within whatever direction we take here. And of course, many matters of the proc macro API are inherently lang concerns, as they affect the specification of the language and fall outside of what could otherwise be written in stable Rust. In the meeting, @dtolnay in particular mentioned too that he'd like to see lang involved here, and that the interesting question is perhaps who would be driving this to then present something to both teams. So let's tag this (and any follow-on stabilization or other significant PRs) for lang along with libs-api, and then let's nominate it so we can discuss briefly to build context on this. cc @rust-lang/lang |
Here is a skeleton of design 1 and design 2 from #54140 (comment). (The semantics are more interesting than the exact spelling used.) Static at compile-time of proc macro crateRe-exportable in macro namespace (or a fourth one?), similar to the name resolution of macros. Documentable by rustdoc. // foo_macros
#[proc_macro_warning]
static ambiguous_thing;
#[proc_macro_warning]
static ambiguities = [crate::ambiguous_thing];
#[proc_macro_derive(Foo)]
pub fn derive_foo(input: TokenStream) -> TokenStream {
if ... {
proc_macro::warning(crate::ambiguous_thing, span, "...");
}
}
// foo
pub use foo_macros::{ambiguities, ambiguous_thing, Foo};
pub trait Foo {...}
// downstream code
#![allow(foo::ambiguous_thing)]
use foo::Foo;
#[derive(Foo)]
... Static at compile-time of downstream crateNot re-exportable, similar to the name resolution of inert attributes. Documented manually in crate-level markdown. // foo_macros
#[proc_macro_derive(Foo)]
pub fn derive_foo(input: TokenStream) -> TokenStream {
setup_warnings();
if ... {
proc_macro::warning("foo::ambiguous_thing", span, "...");
}
}
fn setup_warnings() {
proc_macro::register_warning("foo::ambiguous_thing");
proc_macro::register_warning_group("foo::ambiguities", ["foo::ambiguous_thing"]);
}
// foo
pub use foo_macros::Foo;
pub trait Foo {...}
// downstream code (same as above)
#![allow(foo::ambiguous_thing)]
use foo::Foo;
#[derive(Foo)]
... |
#[proc_macro_warning]
static ambiguous_thing;
#[proc_macro_warning]
static ambiguities = [crate::ambiguous_thing]; This Or is this a type-level construction? But then Or something else entirely distinct from Rust's usual type model? This gets confusing. fn setup_warnings() {
proc_macro::register_warning("foo::ambiguous_thing");
proc_macro::register_warning_group("foo::ambiguities", ["foo::ambiguous_thing"]);
} Why bother with this — why not auto-detect lint names? Is it only for lint groups? There could be an automatic lint group for the proc-macro crate. This seems like a considerable amount of complexity for a feature I would not expect to have much usage. Proc-macro warnings are only applicable to macro input which is somehow valid yet incorrect, which surely doesn't occur all that often. Further, I wouldn't expect such lints to have frequent false positives (like, say, Clippy lints). |
Just a few quick reflections... If someone adds So I tend to feel that having runtime lint names and no sense of namespace identity or collision mitigation (behind string matching) is likely the simplest for the user, the smallest API and probably the most flexible option. I also don't immediately see any practical problems with allowing different crates to have a namespace collision:
This could give some very simple APIs: // (A) Simplest
proc_macro::warn(
"foo::bar",
span,
"My warning message",
);
// (B) Slightly more type-safe, but effectively still just a string.
// The lints could be defined centrally in a crate as const or static to avoid repetition.
proc_macro::warn(
proc_macro::LintId::new("foo::bar"),
span,
"My warning message",
); As for grouping, it wouldn't really make sense to define groups statically in this model, because they could clash... But you could possibly support adding groups at emit time to this specific lint, by saying "emit foo::bar which should also considered in groups foo::group1 and foo::group2": // (C) Option supporting groups
proc_macro::warn(
proc_macro::LintId::new("foo::specific").in_group("foo::group_1").in_group("foo::group_2"),
span,
"My warning message",
); And in terms of use-cases: I'm currently working on a library called preinterpret with a toolkit of simple, composable commands for code-generation (intended for use on its own or inside a declarative macro, and supports e.g. quote and paste functionality)... eventually I have aspirations that it could even be a go-to tool for writing declarative macros, but that's a little way off. In any case, I have been really interested in this topic - and also the span join API - for better error diagnostics. For what it's worth, runtime lint name resolution would also work better with preinterpret, as users using preinterpret to build their declarative macros could use their own lint names... Something like this: // This is just example preinterpret syntax, not anything I'm proposing for procedural macro APIs
macro_rules! emit_my_app_warning {
($message:literal) => {preinterpret!{
[!warning! { lint: "my_app::warning", message: $message, spans: [$message], sub_diagnostics: [] }]
}};
} |
Simultaneously responding to the previous 2 comments:
The issue with the approach @dhedey has shown, and the reason #54140 (comment) looks different than that, is you've only informed the compiler about a particular lint name if that lint is ever triggered. Imagine the caller writes: #![deny(foo::ambigous_thing)]
use foo::Foo;
#[derive(Foo)]
... Did you catch the typo? The compiler needs to know whether an arbitrary scoped lint name refers to a valid thing without it being triggered, in order to report In my 2 designs (#54140 (comment)), the first one conveys perfect information about what lint names exist. The second one conveys perfect information (that's the reason for
That's right. You would need to tell me more about why this can't work. This: #[proc_macro_warning]
static ambiguous_thing: LintId;
#[proc_macro_derive(Foo)]
pub fn derive_foo(input: TokenStream) -> TokenStream {...} would expand to something like: static ambiguous_thing: LintId = proc_macro::LintId::__new("ambiguous_thing");
const _: () = {
#[rustc_proc_macro_decls]
#[used]
static _DECLS: &[proc_macro::bridge::client::ProcMacro] = &[
proc_macro::bridge::client::ProcMacro::custom_derive("Foo", crate::derive_foo), // same as today
proc_macro::bridge::client::ProcMacro::warning(crate::ambiguous_thing),
];
}; |
Resolving Resolving the contents of @dtolnay mentioned above that the semantics were more important than "the exact spelling", but I would have found the example a little more comprehensible had more effort been paid to typing above. Thus, I'd propose the following to keep within Rust's existing type models: // The attribute macro here supplies a definition (and, if necessary, registers the lint identifier with the compiler).
#[proc_macro_warning]
static ambiguous_thing: LintId;
// The attribute macro *might* be required to register the lint group identifier, but it might also be completely unnecessary here.
#[proc_macro_warning]
static ambiguities: &[LintId] = &[crate::ambiguous_thing]; The group might be better typed as follows if we had support for #[proc_macro_warning]
static ambiguities: [LintId; _] = [crate::ambiguous_thing]; In particular, I don't think |
I think we should omit the name of the proc_macro from the lint id (so |
@bjorn3 I do not necessarily agree with that. It works that way in design 1 from #54140 (comment) (you refer to the warning through your local name of its crate) but not in design 2 (you refer to the warning through some hardcoded path declared by its crate). Consider that inert attributes also do not get renamed depending on how you import a crate locally. // [dependencies]
// my-serde = { package = "serde" }
#[derive(my_serde::Serialize)]
#[serde(deny_unknown_fields)] // not #[my_serde(...)]
struct Example {} I do prefer design 1 but both are good IMO. For the purpose of unblocking a first round of stabilization (#54140 (comment)), as soon as someone sends a PR to implement the following API then this can move forward, even if LintId is currently just ignored and is not usable downstream in allow/warn/deny. The important thing is that we do not allow the creation of a macro-generated warning with no id attached. use proc_macro::{TokenStream, WarningId};
#[proc_macro_warning]
static ambiguous_thing: WarningId; // LintId?
#[proc_macro_derive(Foo)]
pub fn derive_foo(input: TokenStream) -> TokenStream {
if ... {
// pass crate::ambiguous_thing as an argument to proc_macro::warning
// or Diagnostic::warning or whatever other entry point
}
} |
That is because inert attributes are not namespaced and the attribute name doesn't have any relation with the proc_macro name. Under the proposed interface proc macro lints would be namespaced however. And as such I expect the namespace to be renamed based on the crate name, but not the lint name to be renamed. So in a lint called |
Most of the recent discussion seems to focus around how to declare lint groups for the warning. I've not attended the meeting @joshtriplett wrote about but I would like to ask if it's really required to have a mechanism to declare these groups in such fine way from the beginning. After all the easiest possible solution would be to put all proc-macro emitted warnings in the same lint group, which would users allow to suppress these warnings, although not at the finest possible level. Another solution would be to derive these lint groups automatically based on the proc macro that emitted this lint. So a warning coming from Both proposals should be future proof for extending them with an explicit way to declare lint groups, but going with either of the proposals would hopefully unblock the underlying rather important feature of error handling and emitting warnings at all from proc-macros. I would allow to bikesheed the exact syntax of declaring these lint groups later on. |
This is a tracking issue for diagnostics for procedural macros spawned off from #38356.
Overview
Current Status
feature(proc_macro_diagnostic)
Next Steps
Summary
The initial API was implemented in #44125 and is being used by crates like Rocket and Diesel to emit user-friendly diagnostics. Apart from thorough documentation, I see two blockers for stabilization:
Multi-Span Support
At present, it is not possible to create/emit a diagnostic via
proc_macro
that points to more than oneSpan
. The internal diagnostics API makes this possible, and we should expose this as well.The changes necessary to support this are fairly minor: a
Diagnostic
should encapsulate aVec<Span>
as opposed to aSpan
, and thespan_
methods should be made generic such that either aSpan
or aVec<Span>
(ideally also a&[Vec]
) can be passed in. This makes it possible for a user to pass in an emptyVec
, but this case can be handled as if noSpan
was explicitly set.Lint-Associated Warnings
At present, if a
proc_macro
emits a warning, it is unconditional as it is not associated with a lint: the user can never silence the warning. I propose that we require proc-macro authors to associate every warning with a lint-level so that the consumer can turn it off.No API has been formally proposed for this feature. I informally proposed that we allow proc-macros to create lint-levels in an ad-hoc manner; this differs from what happens internally, where all lint-levels have to be known apriori. In code, such an API might look lIke:
The
lint!
macro might check for uniqueness and generate a (hidden) structure for internal use. Alternatively, the proc-macro author could simply pass in a string:"unknown_media_type"
.The text was updated successfully, but these errors were encountered: