evalengine: Support built-in MySQL function CEIL() #11027

Weijun-H · 2022-08-17T08:35:05Z

Signed-off-by: Weijun-H huangweijun1001@gmail.com

Description

Implement support CEIL() to understand how the evaluation engine works

Related Issue(s)

evalengine: Support all built-in MySQL functions #9647

Checklist

"Backport me!" label has been added if this change should be backported
Tests were added or are not required
Documentation was added or is not required

Deployment Notes

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

vitess-bot · 2022-08-17T08:35:07Z

vmg

Hey Alex! Off to a good start! I left you some feedback on the things that need fixing.

vmg · 2022-08-17T16:07:11Z

go/vt/vtgate/evalengine/func.go

+	if sqltypes.IsIntegral(argtype) {
+		inarg.makeFloat()
+		result.setInt64(int64(math.Ceil(inarg.float64())))
+	} else if sqltypes.Decimal == argtype {


This is a small nit, but we don't have any reverse comparisons in the Vitess codebase. As you may be aware of, reverse comparisons have their origin in the C family of languages because if (argtype = sqltypes.Decimal) is valid syntax and does something very different than if (argtype == sqltypes.Decimal), so reversing the comparison may prevent a bad bug from being deployed.

However, Go does not allow assignments in if expressions, so we always write if argtype == sqltypes.Decimal { -- there's no point on doing otherwise except making the code harder to read!

vmg · 2022-08-17T16:11:47Z

go/vt/vtgate/evalengine/func.go

+
+	if sqltypes.IsIntegral(argtype) {
+		inarg.makeFloat()
+		result.setInt64(int64(math.Ceil(inarg.float64())))


I don't see how this could possibly do anything! The result of converting an integral to a float64 will always be a round number, so calling math.Ceil on it will be a no-op. The whole roundtrip from integral -> float -> integral seems to be superfluous.

Have you found any corner cases that that roundtrip handles properly? From testing in the MySQL CLI, CEIL on an integer looks like a noop -- it always returns the same integer.

vmg · 2022-08-17T16:21:24Z

go/vt/vtgate/evalengine/func.go

+		if num.Cmp(decimal.NewFromInt(math.MaxInt32)) == 1 || num.Cmp(decimal.NewFromInt(math.MinInt32)) == -1 {
+			if num.Sign() == 1 {
+				result.setDecimal(num.Add(decimal.NewFromInt(1)), 0)
+			} else {
+				result.setDecimal(num.Add(decimal.NewFromInt(-1)), 0)
+			}
+		} else {
+			floatpart, _ := num.Float64()
+			result.setInt64(int64(math.Ceil(floatpart)))
+		}


Can you explain a little bit what are you trying to do here? It seems like you're using different rounding modes for decimals that are larger than INT32_MAX or smaller than INT32_MIN -- otherwise you're just rounding the decimal as if it were a float...

I don't think this is exactly MySQL's behavior. Are you attempting an optimization here? I.e. doing the float rounding if it would always be safe to do so? I'm afraid this won't behave like you're expecting it to... Converting a decimal to a float64 is super expensive! The comparisons against math.MaxInt32 and math.MaxInt32 are also expensive, particularly because they allocate two temporary decimals, and the num.Float64 would tell you anyway if the conversion to float64 was safe to perform.

I think this is a case where the simplest code would would also be the safest: you can add (or subtract) one from the decimal and truncate the decimals, and that should give you a good ceil... but here's an even better tip: if you check the implementation of our decimal package, you'll see that we've adapted an existing arbitrary-precision decimal implementation to behave like MySQL's. Maybe the original implementation has a Ceil function you could port over? :)

I am confused with the mysql behaviors here. For example, when I test the value like 1.5

/home/hwj/cncf/vitess/go/vt/vtgate/evalengine/integration/comparison_test.go:710: different results: DECIMAL(2); mysql response: INT64(2) (local collation: binary; mysql collation: binary) query: SELECT CEIL(1.5) (SIMPLIFY=true)

Then I checked the type of the 1.5, it is not the FLOAT, but Demical. Do I misunderstand the process here?

From the MySQL docs:

For exact-value numeric arguments, the return value has an exact-value numeric type. For string or floating-point arguments, the return value has a floating-point type.

It seems like rounding a DECIMAL, which is an exact value numeric type, will return INT64 if the result is small enough to fit, otherwise a DECIMAL.

It it tricky to detect when to return INT, because:

"9223372036854775810.4", // bigger than MAXINT64, return DECIMAL "7223372036854775010.1", // bigger than MAXINT32, return DECIMAL "2147483648.1", // bigger than MAXINT32, return INT

All you have to check is whether the result of the rounding fits in int64 or not -- there are no intermediate types. If it fits, it is returned as INT64, if not as DECIMAL.

When the type of args is DECIMAL, but the result is INT64. How could we return the true sqltypes.Type in the type function. My current solution:

func (builtinCeil) call(env *ExpressionEnv, args []EvalResult, result *EvalResult) { inarg := &args[0] argtype := inarg.typeof() if inarg.isNull() { result.setNull() return } if sqltypes.IsIntegral(argtype) { result.setInt64(inarg.int64()) } else if sqltypes.Decimal == argtype { num := inarg.decimal() num = num.Ceil() intnum, isfit := num.Int64() if isfit { inarg.makeSignedIntegral() result.setInt64(intnum) } else { result.setDecimal(num, 0) } } else { inarg.makeFloat() result.setFloat(math.Ceil(inarg.float64())) } } func (builtinCeil) typeof(env *ExpressionEnv, args []Expr) (sqltypes.Type, flag) { if len(args) != 1 { throwArgError("CEIL") } t, f := args[0].typeof(env) if sqltypes.IsIntegral(t) { return sqltypes.Int64, f } else if sqltypes.Decimal == t { return sqltypes.Decimal, f } else { return sqltypes.Float64, f } }

How could we catch the new return type, like the test1.5?

Huh, that's a good question. MySQL's behavior here is really bad... Our typeof helpers cannot (should not, really) return variable types for the same input, so we're gonna have to hack it and do something different to MySQL... Let's assume for now that typeof returns Decimal for Decimal. That'll give some bad typeof results from the tests, and we'll have to ignore them, but I think that's the best we can do for now.

The inarg.makeSignedIntegral() line is superfluous by the way. You can remove it.

vmg · 2022-08-17T16:23:36Z

go/vt/vtgate/evalengine/integration/comparison_test.go

+		"7223372036854775010.1",
+		"2147483640.1",
+		"9223372036854775010.4",
+	}


Good start with the tests. Can you think of more corner cases that would be worth adding here? Larger negative floats? Larger negative decimals?

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

vmg · 2022-08-22T13:31:55Z

I don't think your Ceil implementation is quite correct. Here's a big hint: https://github.com/shopspring/decimal/blob/f55dd564545cec84cf84f7a53fb3025cdbec1c4f/decimal.go#L1288-L1305

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

Weijun-H · 2022-09-02T16:01:22Z

I don't think your Ceil implementation is quite correct. Here's a big hint: https://github.com/shopspring/decimal/blob/f55dd564545cec84cf84f7a53fb3025cdbec1c4f/decimal.go#L1288-L1305

Hi @vmg, I have modified Ceil function, could you have a look?

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

vmg · 2022-09-12T10:34:15Z

We'll need to ignore the bad typeof results from the tests. 👍

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

vmg · 2022-09-13T11:46:32Z

Great job on your first PR! It wasn't an easy one!

support CEIL()

a146d7c

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

Weijun-H marked this pull request as ready for review August 17, 2022 08:45

Weijun-H requested review from harshit-gangal, systay and frouioui as code owners August 17, 2022 08:45

vmg self-assigned this Aug 17, 2022

vmg requested changes Aug 17, 2022

View reviewed changes

Weijun-H added 2 commits August 19, 2022 15:31

redesign the ceil function

b249cde

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

update tests

7def15c

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

Weijun-H requested review from vmg and removed request for systay, harshit-gangal and frouioui August 19, 2022 15:35

fix bugs for integer

1ea15f6

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

update ceil function in decimal

9670a95

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

Weijun-H added 2 commits September 3, 2022 20:51

add the support for ceiling function

97ae0d1

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

add support for CEILING

1db4cf6

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

update test related to the bad typeof

1aed930

Signed-off-by: Weijun-H <huangweijun1001@gmail.com>

vmg added Type: Feature Component: Evalengine changes to the evaluation engine labels Sep 13, 2022

vmg approved these changes Sep 13, 2022

View reviewed changes

vmg merged commit babd1d1 into vitessio:main Sep 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evalengine: Support built-in MySQL function CEIL() #11027

evalengine: Support built-in MySQL function CEIL() #11027

Weijun-H commented Aug 17, 2022

vitess-bot bot commented Aug 17, 2022

vmg left a comment

vmg Aug 17, 2022

vmg Aug 17, 2022

vmg Aug 17, 2022

Weijun-H Aug 18, 2022

vmg Aug 18, 2022

Weijun-H Aug 18, 2022

vmg Aug 18, 2022

Weijun-H Aug 18, 2022 •

edited

Loading

vmg Aug 18, 2022

vmg Aug 18, 2022

vmg Aug 17, 2022

vmg commented Aug 22, 2022

Weijun-H commented Sep 2, 2022 •

edited

Loading

vmg commented Sep 12, 2022

vmg commented Sep 13, 2022

evalengine: Support built-in MySQL function CEIL() #11027

evalengine: Support built-in MySQL function CEIL() #11027

Conversation

Weijun-H commented Aug 17, 2022

Description

Related Issue(s)

Checklist

Deployment Notes

vitess-bot bot commented Aug 17, 2022

Review Checklist

General

Bug fixes

Non-trivial changes

New/Existing features

Backward compatibility

vmg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Weijun-H Aug 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vmg commented Aug 22, 2022

Weijun-H commented Sep 2, 2022 • edited Loading

vmg commented Sep 12, 2022

vmg commented Sep 13, 2022

Weijun-H Aug 18, 2022 •

edited

Loading

Weijun-H commented Sep 2, 2022 •

edited

Loading