fix(unpivot): _field should not be added as a group key by default #5272

skartikey · 2022-10-11T14:05:02Z

if _field is in otherColumns parameter, _field should not be made group key by default
The issue was discovered during the iox group push down work

Checklist

Dear Author 👋, the following checks should be completed (or explicitly dismissed) before merging.

✏️ Write a PR description, regardless of triviality, to include the value of this PR
🔗 Reference related issues
🏃 Test cases are included to exercise the new code
🧪 If new packages are being introduced to stdlib, link to Working Group discussion notes and ensure it lands under experimental/
📖 If language features are changing, ensure docs/Spec.md has been updated

Dear Reviewer(s) 👋, you are responsible (among others) for ensuring the completeness and quality of the above before approval.

wolffcm

I think this is pretty close, but the logic is not quite right.

wolffcm · 2022-10-11T19:04:05Z

stdlib/experimental/experimental_test.flux

+                    f0: 20.1,
+                    f1: 20.2,
+                    _time: 2018-12-01T00:00:10Z,
+                    _field: "load1",


For this to be a representative test case, there shouldn't be a _field column in the input to unpivot, since pivoted data does generally not have a _field column.

Thinking it through, if there is a _field column in the input an error seems to be appropriate? Since there is no way to produce output that does not overwrite the data that already exists in _field. Thoughts?

I think it makes sense to show an error if the input to unpivot contains _field in it. wondering if we have to do the same for _value column?

The reason I have to put up this PR is that the pushdown group test was failing because it contains _field in the input.

The fix would be to throw a user error when there is _field or _value column in the input to unpivot.

At the moment, if a column is present in the otherColumns and missing in the input column, the code throws an error.

import "array" import "experimental" import "testing" array.from( rows: [ { _measurement: "m", tag: "t1", f0: 10.1, f1: 10.2, _time: 2018-12-01T00:00:00Z, }, { _measurement: "m", tag: "t1", f0: 20.1, f1: 20.2, _time: 2018-12-01T00:00:10Z, }, ], ) |> group(columns: ["_measurement"]) |> experimental.unpivot(otherColumns: ["_time", "tag", "_field"])

Result: _result Error: runtime error @24:8-24:70: unpivot: unpivot could not find column named "_field"

wolffcm · 2022-10-11T19:11:26Z

stdlib/experimental/unpivot.go

-		columns = append(columns, flux.ColMeta{Label: "_field", Type: flux.TString})
+		if defaultFieldColLen != 0 {
+			columns = append(columns, flux.ColMeta{Label: influxdb.DefaultFieldColLabel, Type: flux.TString})
+		}


This logic doesn't seem quite right to me. If there is no _field column in the input, then the _field column will never get created. We want to create it regardless, we just want to omit the column from groupCols and groupValues below if _field is in the otherCols parameter.

No, you’ve got it wrong. _field will only 'not' get created when it is present in the otherColumns, rest of the cases it will always get created.

import "array" import "experimental" import "testing" array.from( rows: [ { _measurement: "m", tag: "t1", f0: 10.1, f1: 10.2, _time: 2018-12-01T00:00:00Z, }, { _measurement: "m", tag: "t1", f0: 20.1, f1: 20.2, _time: 2018-12-01T00:00:10Z, }, ], ) |> group(columns: ["_measurement"]) |> experimental.unpivot(otherColumns: ["_time", "tag"])

Result: _result Table: keys: [_measurement, _field] _measurement:string _field:string _time:time tag:string _value:float ---------------------- ---------------------- ------------------------------ ---------------------- ---------------------------- m f0 2018-12-01T00:00:00.000000000Z t1 10.1 m f0 2018-12-01T00:00:10.000000000Z t1 20.1 Table: keys: [_measurement, _field] _measurement:string _field:string _time:time tag:string _value:float ---------------------- ---------------------- ------------------------------ ---------------------- ---------------------------- m f1 2018-12-01T00:00:00.000000000Z t1 10.2 m f1 2018-12-01T00:00:10.000000000Z t1 20.2

if _field is in otherColumns parameter, _field should not be made group key

wolffcm

Looks good, thanks.

skartikey requested a review from a team as a code owner October 11, 2022 14:05

skartikey requested review from wolffcm and removed request for a team October 11, 2022 14:05

wolffcm suggested changes Oct 11, 2022

View reviewed changes

skartikey added 2 commits October 13, 2022 14:04

fix(unpivot): _field should not be added as a group key by default

72685d6

if _field is in otherColumns parameter, _field should not be made group key

fix: unpivot can't have _value or _field as an input column

29c4d23

skartikey force-pushed the skartikey-flux-unpivot branch from ce7d364 to 29c4d23 Compare October 13, 2022 13:04

skartikey requested a review from wolffcm October 13, 2022 13:06

wolffcm approved these changes Oct 13, 2022

View reviewed changes

skartikey merged commit 29766af into master Oct 13, 2022

skartikey deleted the skartikey-flux-unpivot branch October 13, 2022 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(unpivot): _field should not be added as a group key by default #5272

fix(unpivot): _field should not be added as a group key by default #5272

skartikey commented Oct 11, 2022

wolffcm left a comment

wolffcm Oct 11, 2022

wolffcm Oct 11, 2022

skartikey Oct 12, 2022

wolffcm Oct 11, 2022

skartikey Oct 12, 2022 •

edited

Loading

wolffcm left a comment

fix(unpivot): _field should not be added as a group key by default #5272

fix(unpivot): _field should not be added as a group key by default #5272

Conversation

skartikey commented Oct 11, 2022

Checklist

wolffcm left a comment

Choose a reason for hiding this comment

wolffcm Oct 11, 2022

Choose a reason for hiding this comment

wolffcm Oct 11, 2022

Choose a reason for hiding this comment

skartikey Oct 12, 2022

Choose a reason for hiding this comment

wolffcm Oct 11, 2022

Choose a reason for hiding this comment

skartikey Oct 12, 2022 • edited Loading

Choose a reason for hiding this comment

wolffcm left a comment

Choose a reason for hiding this comment

skartikey Oct 12, 2022 •

edited

Loading