Fix handling of escape codes in paths #118

ahl · 2021-06-22T21:44:47Z

Previously we haven't had any special handling for HTTP path escape codes. This implicitly required consumers to do all their own decoding... which they surely were not. This PR adds proper handling for input paths i.e. paths as the results of a query, and it enforces stricter requirements for paths specified by consumers for endpoints. For example, we strip out "//" for a query, but panic!() if a consumer were to specify a path that contained "//" (note we also panic if a consumer tries to register two handlers for the same path).

smklein

One of those patches where I really appreciate having all the tests - thank you!

Looks good, just a handful of comments.

dropshot/src/api_description.rs

dropshot/src/router.rs

smklein · 2021-06-22T23:52:01Z

dropshot/src/router.rs

+ * We use this type to avoid confusion with paths used to define routes.
+ */
+#[derive(Debug)]
+pub struct InputPath<'a>(&'a str);


I have no issue with this current use of typing, but I feel like I've usually seen the strong types used as an "output" of parsing functions, rather than input. IMO that seems more useful, since it propagates the notion that "this &str has been validated", whereas I think InputPath is currently truly indistinguishable from an arbitrary &str.

(I'm kinda eyeing route_path_to_segments and input_path_to_segments below - it does seem like both could take an arbitrary &str as input, but the results kinda have different implications)

I don't have a strong feeling about what would be most idiomatic here. I wanted to "quarantine" the input type to avoid accidental use of it. I agree that it's all a little meh. I'd love any input regarding how to improve it.

I wanna give the caveat - I think this patch is decidedly a net improvement, so this comment shouldn't be a blocker!

The reason I'm a little skeptical of the current usage is that I don't think the quarantining is actually happening very clearly (aside from requiring a bunch of .into() calls). Any function that operates on an InputPath may as well operate on a &str, since there is (seemingly?) no check or consideration to distinguish them.

IMO, the path that's the result of a "user input" actually is a str. If we want to distinguish that from a route-defining path, which has a higher burden of validation, perhaps that is the type we should abstract more cautiously.

As an example, route_path_to_segments returns a Vec<&str> - but it implicitly carries the implication that "this is actually a non-empty &str type, without / characters.

We could make that more explicit by returning a Vec<Segment>, where Segment is a &str, but it can be only constructed post-validation.

Similarly, input_path_to_segments could return a Vec<Segment>, to indicate that it represents a percent-decoded, non "./.." str.

(This actually raises the question - should these functions actually be returning distinct Segment types? They appear to be performing slightly different validations)

FWIW: I think this pattern is generally the one Cliff calls "the typestate pattern".

http://cliffle.com/blog/rust-typestate/

https://lexi-lambda.github.io/blog/2019/11/05/parse-don-t-validate/

ahl · 2021-06-23T06:22:09Z

@smklein thanks for the thorough review. I've pushed a new commit that I think addresses your feedback.

smklein · 2021-06-23T15:16:22Z

@smklein thanks for the thorough review. I've pushed a new commit that I think addresses your feedback.

One last comment on the InputPath type, but LGTM!

ahl · 2021-06-23T18:06:57Z

I'm going to attempt to revisit the discussion of InputPath in #110

ahl added 2 commits June 22, 2021 14:36

fix handling of escape codes in paths

2486f79

update changelog

9fc4900

smklein approved these changes Jun 22, 2021

View reviewed changes

feedback from @smklein

489c7bc

more nits

d3efcbc

ahl merged commit feea258 into main Jun 23, 2021

ahl deleted the escapism branch June 23, 2021 18:06

iliana mentioned this pull request Apr 23, 2024

set content-security-policy, x-content-type-options, and x-frame-options headers for console assets oxidecomputer/omicron#5545

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix handling of escape codes in paths #118

Fix handling of escape codes in paths #118

ahl commented Jun 22, 2021

smklein left a comment

smklein Jun 22, 2021

ahl Jun 23, 2021

smklein Jun 23, 2021 •

edited

Loading

smklein Jun 23, 2021

ahl commented Jun 23, 2021

smklein commented Jun 23, 2021

ahl commented Jun 23, 2021

Fix handling of escape codes in paths #118

Fix handling of escape codes in paths #118

Conversation

ahl commented Jun 22, 2021

smklein left a comment

Choose a reason for hiding this comment

smklein Jun 22, 2021

Choose a reason for hiding this comment

ahl Jun 23, 2021

Choose a reason for hiding this comment

smklein Jun 23, 2021 • edited Loading

Choose a reason for hiding this comment

smklein Jun 23, 2021

Choose a reason for hiding this comment

ahl commented Jun 23, 2021

smklein commented Jun 23, 2021

ahl commented Jun 23, 2021

smklein Jun 23, 2021 •

edited

Loading