Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Properly escape multi-segment path parameters #869

Merged
merged 8 commits into from
Mar 28, 2024

Conversation

mgyucht
Copy link
Contributor

@mgyucht mgyucht commented Mar 26, 2024

Changes

Currently, path parameters are directly interpolated into the request URL without escaping. This means that characters like /, ? and # will not be percent-encoded and will affect the semantics of the URL, starting a new path segment, query parameters, or fragment, respectively. This means that it is impossible for users of the Files API to upload/download objects that contain ? or # in their name. / is allowed in the path of the Files API, so it does not need to be escaped.

The Files API is currently marked with x-databricks-multi-segment, indicating that it should be permitted to have / characters but other characters need to be percent-encoded. This PR implements this.

This PR builds on #868, correcting the test names for UC files.

Tests

  • Unit test for multi-segment path escaping behavior.
  • Updated integration test to use # and ? symbols in the file name. This failed on main but succeeded on this PR.

@mgyucht mgyucht changed the title Properly escape multi-segment paths Properly escape multi-segment path parameters Mar 26, 2024
@codecov-commenter
Copy link

Codecov Report

Attention: Patch coverage is 66.66667% with 8 lines in your changes are missing coverage. Please review.

Project coverage is 8.01%. Comparing base (a823ca3) to head (1af59d4).

Files Patch % Lines
service/files/impl.go 0.00% 8 Missing ⚠️
Additional details and impacted files
@@           Coverage Diff            @@
##            main    #869      +/-   ##
========================================
+ Coverage   8.00%   8.01%   +0.01%     
========================================
  Files        258     258              
  Lines      64416   64427      +11     
========================================
+ Hits        5155    5166      +11     
  Misses     58966   58966              
  Partials     295     295              

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@@ -109,6 +109,18 @@ func makeQueryString(data interface{}) (string, error) {
return "", fmt.Errorf("unsupported query string data: %#v", data)
}

func EncodeMultiSegmentPathParameter(p string) string {
segments := strings.Split(p, "/")
b := strings.Builder{}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be preallocated to at least len(p).

@mgyucht mgyucht added this pull request to the merge queue Mar 28, 2024
Merged via the queue into main with commit bfdcd43 Mar 28, 2024
5 checks passed
@mgyucht mgyucht deleted the support-multi-segment-paths branch March 28, 2024 08:48
github-merge-queue bot pushed a commit to databricks/databricks-sdk-py that referenced this pull request Mar 28, 2024
## Changes
Ports databricks/databricks-sdk-go#869 to Python
SDK. Subsumes #592.

Currently, path parameters are directly interpolated into the request
URL without escaping. This means that characters like /, ? and # will
not be percent-encoded and will affect the semantics of the URL,
starting a new path segment, query parameters, or fragment,
respectively. This means that it is impossible for users of the Files
API to upload/download objects that contain ? or # in their name. / is
allowed in the path of the Files API, so it does not need to be escaped.

The Files API is currently marked with x-databricks-multi-segment,
indicating that it should be permitted to have / characters but other
characters need to be percent-encoded. This PR implements this.

## Tests

- [x] Unit test for multi-segment path escaping behavior.
- [x] Updated integration test to use # and ? symbols in the file name.
This failed on main but succeeded on this PR.
github-merge-queue bot pushed a commit to databricks/databricks-sdk-java that referenced this pull request Mar 28, 2024
## Changes
Ports databricks/databricks-sdk-go#869 to Java
SDK.

Currently, path parameters are directly interpolated into the request
URL without escaping. This means that characters like `/`, `?` and `#`
will not be percent-encoded and will affect the semantics of the URL,
starting a new path segment, query parameters, or fragment,
respectively. This means that it is impossible for users of the Files
API to upload/download objects that contain `?` or `#` in their name.
`/` is allowed in the path of the Files API, so it does not need to be
escaped.

The Files API is currently marked with `x-databricks-multi-segment`,
indicating that it should be permitted to have `/` characters but other
characters need to be percent-encoded. This PR implements this.

## Tests
- [x] Unit test for multi-segment path escaping behavior.
- [x] Updated integration test to use # and ? symbols in the file name.
vikrantpuppala pushed a commit to vikrantpuppala/databricks-sdk-java that referenced this pull request Apr 23, 2024
## Changes
Ports databricks/databricks-sdk-go#869 to Java
SDK.

Currently, path parameters are directly interpolated into the request
URL without escaping. This means that characters like `/`, `?` and `#`
will not be percent-encoded and will affect the semantics of the URL,
starting a new path segment, query parameters, or fragment,
respectively. This means that it is impossible for users of the Files
API to upload/download objects that contain `?` or `#` in their name.
`/` is allowed in the path of the Files API, so it does not need to be
escaped.

The Files API is currently marked with `x-databricks-multi-segment`,
indicating that it should be permitted to have `/` characters but other
characters need to be percent-encoded. This PR implements this.

## Tests
- [x] Unit test for multi-segment path escaping behavior.
- [x] Updated integration test to use # and ? symbols in the file name.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants