fix: Don't stackoverflow on deeply nested expressions #4876

Marwes · 2022-06-14T17:12:13Z

Added stacker just in case another environment has a smaller stack size than my local environment we also grow
the stack automatically. We still have the depth limit to prevent the stack from growing unbounded.

The depth limit was chosen such that the added test did not stack overflow prior to adding stacker. This limit should be a decent approximation that any previous script works and is under the limit or if it is above the limit it would have stack overflowed.

Closes #4712

Marwes · 2022-09-09T13:14:24Z

Added back the second, original stack overflow check since the one in parser did not trigger the second test.

jsternberg

Question about how stacker interacts with cgo. The code looks fine and the depth limits seem appropriate, but I'm worried that dynamic increasing of the stack may not be well documented when happening within a cgo context which is where this code ends up being used.

jsternberg · 2022-09-09T14:31:46Z

libflux/flux-core/src/parser/mod.rs

+            None
+        } else {
+            // Numbers are arbitrary and were just picked from the stack docs
+            Some(stacker::maybe_grow(32 * 1024, 1024 * 1024, || f(self)))


Do we know if this stacker thing is safe regarding cgo? It seems like this is intended to grow the stack, but that stack was allocated by cgo.

Ah damn, I just added it as an additional safety. Maybe I should just remove it and accept the risk of stack overflow and if we see another crash due to it we just continue to lower the depth limit

I'd really like to learn if we can expand the stack. Please don't throw out this code, but maybe we should separate it and come up with a plan to understand how this interacts with cgo?

cgo does some strange things when it comes to the stack and I'm not sure I understand it in detail enough to know how it interacts with this. In summary, Go runs with pretty small stacks because the go runtime has its own way of growing the stack interactively. Since C code generally doesn't, the Go code pre-allocates a larger stack before calling into cgo code. The Rust code runs in this context.

It might be safe to use this because the stack should mostly look exactly like a C program would see it, but I just don't know enough about it.

I also think that cgo calls run in a different pthread from the goroutine that called it. So I think there's pthreads that are "cgo" pthreads (with a C stack) and goroutines run in their own pthreads. The goroutines are managed by the Go runtime to run within one of the pthreads (goroutines do not spawn new pthreads). The cgo code runs in its own pthread so when you make a cgo call you're either using one of the existing pthreads for cgo or it autospins up a new one. It then runs the code in the pthread and waits for the response.

stacker seems to work as it should when I run the crashing script through go run ./cmd/flux. Googling a bit I can't find anything that would say otherwise. Looking at the docs for stacker https://docs.rs/stacker/latest/stacker/ it seems to allocate a new stacck and run the closure there, returning to the orignal stack once the closure returns.

So I think it should be fine, I can't find any reason that it shouldn't work (and I wouldn't expect the stacker path to really execute since we also have the depth to guard against these kinds of expressions)

Ok that seems relatively safe. I don't think we need to do any substantial testing if it works by extending the stack with an extra thread.

Marwes · 2022-09-12T16:44:32Z

Seems to be some issues with wasm might just disable it on wasm, though it seems like stacker should work with wasm

Just in case another environment has a smaller stack size than my local environment we also grow the stack automatically. We still have the depth limit to prevent the stack from growing unbounded

Since this test has nested binary expressions it does not overflow the stack or hit the depth checking path in the parser so we also needed the second check.

Not sure why

Marwes mentioned this pull request Jun 14, 2022

formatter will stack overflow when formatting a long chain of or #4712

Closed

Marwes force-pushed the stackoverflow branch from 62fdaa8 to bf35a73 Compare July 6, 2022 13:44

Marwes force-pushed the stackoverflow branch 2 times, most recently from 37a8f0d to 88a3799 Compare September 9, 2022 12:49

Marwes marked this pull request as ready for review September 9, 2022 12:49

Marwes requested review from a team as code owners September 9, 2022 12:49

Marwes requested review from jsternberg and sunbryely-influxdata and removed request for a team September 9, 2022 12:49

Marwes force-pushed the stackoverflow branch 2 times, most recently from 57bb371 to 324c53e Compare September 9, 2022 13:12

jsternberg reviewed Sep 9, 2022

View reviewed changes

jsternberg approved these changes Sep 12, 2022

View reviewed changes

sunbryely-influxdata approved these changes Sep 15, 2022

View reviewed changes

Marwes force-pushed the stackoverflow branch 6 times, most recently from b703ce4 to 24f77a3 Compare September 16, 2022 11:48

Markus Westerlind added 6 commits September 19, 2022 16:19

fix: Don't stackoverflow on deeply nested expressions

0367d70

refactor: Avoid checking the string message of a fatal error

197ac3c

refactor: Move depth checking to the parser

42eb103

fix: Add stacker to manage stack growth in the parser

c3f0107

Just in case another environment has a smaller stack size than my local environment we also grow the stack automatically. We still have the depth limit to prevent the stack from growing unbounded

test: Add a second stack overflow test

3b640c0

fix: Ensure the second stack overflow issue errors

450bfe7

Since this test has nested binary expressions it does not overflow the stack or hit the depth checking path in the parser so we also needed the second check.

Markus Westerlind added 5 commits September 19, 2022 16:19

chore: make generate

3df9985

fix: Don't try to use stacker on wasm

9f2d3b1

chore: Don't use stacker on wasm

8b46828

chore: make generate

c7781ac

chore: Don't use stacker on windows either

4a0580c

Not sure why

Marwes force-pushed the stackoverflow branch from 24f77a3 to 4a0580c Compare September 19, 2022 14:20

Marwes merged commit 642ca27 into master Sep 19, 2022

Marwes deleted the stackoverflow branch September 19, 2022 16:36

onelson mentioned this pull request Oct 13, 2022

Test around/under flux depth limit #5282

Closed

nathanielc mentioned this pull request Oct 24, 2022

EPIC: Safely parse/execute queries with large nested expressions #5305

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Don't stackoverflow on deeply nested expressions #4876

fix: Don't stackoverflow on deeply nested expressions #4876

Marwes commented Jun 14, 2022 •

edited

Loading

Marwes commented Sep 9, 2022

jsternberg left a comment

jsternberg Sep 9, 2022

Marwes Sep 9, 2022

jsternberg Sep 9, 2022

Marwes Sep 9, 2022

jsternberg Sep 12, 2022

Marwes commented Sep 12, 2022

fix: Don't stackoverflow on deeply nested expressions #4876

fix: Don't stackoverflow on deeply nested expressions #4876

Conversation

Marwes commented Jun 14, 2022 • edited Loading

Marwes commented Sep 9, 2022

jsternberg left a comment

Choose a reason for hiding this comment

jsternberg Sep 9, 2022

Choose a reason for hiding this comment

Marwes Sep 9, 2022

Choose a reason for hiding this comment

jsternberg Sep 9, 2022

Choose a reason for hiding this comment

Marwes Sep 9, 2022

Choose a reason for hiding this comment

jsternberg Sep 12, 2022

Choose a reason for hiding this comment

Marwes commented Sep 12, 2022

Marwes commented Jun 14, 2022 •

edited

Loading