Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test: shared lib build doesn't handle SIGPIPE #19211

Closed
wants to merge 1 commit into from

Conversation

yhwang
Copy link
Member

@yhwang yhwang commented Mar 7, 2018

For shared lib build, we leave the signal handling for embedding users.
In these two test cases:

  • parallel/test-process-external-stdio-close-spawn
  • parallel/test-process-external-stdio-close

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends. Need to modify these two
test cases to check the SIGPIPE instead.

Refs: #18535

Signed-off-by: Yihong Wang yh.wang@ibm.com

Checklist
  • make -j4 test (UNIX), or vcbuild test (Windows) passes
  • commit message follows commit guidelines

@nodejs-github-bot nodejs-github-bot added the test Issues and PRs related to the tests. label Mar 7, 2018
@gireeshpunathil
Copy link
Member

@yhwang : can you please clarify - which one is embedded here: the parent or the child, or both?

  • if parent, the child's exit code and signal are of no difference, right?
  • if child, the proposed assertions on child will form expectations from unknown right?
  • if both, how is it different from the existing code?

@gireeshpunathil gireeshpunathil added the embedding Issues and PRs related to embedding Node.js in another project. label Mar 8, 2018
@bnoordhuis
Copy link
Member

This doesn't look correct to me. The signal should be handled by the binary because otherwise it's practically useless, the first aborted connection would kill it.

@yhwang
Copy link
Member Author

yhwang commented Mar 8, 2018

The case here is both parent and child node executable are using shared lib. And the related code for signal handling is here: https://github.com/nodejs/node/blob/master/src/node.cc#L4103-L4117 and it's surrounded by #ifndef NODE_SHARED_MODE, which means shared lib users should handle that by their own.

For these two test cases, the parent create pipes for child's stdin, stdout and stderr. But after spawn, parent destroys the stdout. And in the console.js, it catches the error and ignore it here: https://github.com/nodejs/node/blob/master/lib/console.js#L110-L132 . So, in regular build (not using shared lib), when the child tries to write to stdout in line 13: console.log('logging should not cause a crash'); it hits the SIGPIPE and handle it (actually, ignores it) then the child process exits with status code 0 and null for term_signal.

Because the case here is both parent and child are shared lib build, the child doesn't have signal handler registration (code in node.cc L110-L132 that I mentioned above), so the child directly exits when it writes to the stdout. The exit code and term signal that parent receives are status code: 1 and term sign: SIGPIPE.

I think there are 2 ways to fix this, one is to modify the test case like what I did in this change. And another one is to modify the node_main.cc to handle the signal if it's shared lib build (#ifdef NODE_SHARED_MODE). Because I just borrow main() function from node_main.cc for the shared lib build, I don't think changing node_main.cc for this case is a good idea. So, I chose the first one :-).

jasnell
jasnell previously approved these changes Mar 8, 2018
@bnoordhuis
Copy link
Member

I understand what you did and why but IMO it's papering over the real issue. This PR fixes the tests but it's only a matter of time before it reappears. I suggest moving some of the signal handling code from node.cc to node_main.cc.

@yhwang
Copy link
Member Author

yhwang commented Mar 8, 2018

@bnoordhuis I agree with you totally. However, for now, the node_main.cc is used to build the executable for shared lib temporally. We should have an example node_shared_main.cc for the embedding users and also provide some tips about how to build the shared lib, link the shared lib and what things they need to handle when using shared lib.

If you think node_shared_main.cc is not necessary, I can refactor the signal handling code in node.cc as a public API and call it in node_main.cc for shared lib case.

@bnoordhuis
Copy link
Member

I'd start out simple and move the sigaction(SIGPIPE, ...) to node_main.cc, that fixes the immediate issue.

@yhwang
Copy link
Member Author

yhwang commented Mar 8, 2018

@bnoordhuis sure! For double check,

move the sigaction(SIGPIPE, ...) to node_main.cc

I guess you were saying copy the sigaction(SIGPIPE,...) to node_main.cc with #ifdef NODE_SHARED_MODE, right?

@bnoordhuis
Copy link
Member

I guess you were saying copy the sigaction(SIGPIPE,...) to node_main.cc with #ifdef NODE_SHARED_MODE, right?

No, because both builds need it. If you move it, you can drop the corresponding code from node.cc that's currently guarded by a #ifndef NODE_SHARED_MODE.

It's currently a loop that does this:

if (nr == SIGKILL || nr == SIGSTOP) continue;
act.sa_handler = (nr == SIGPIPE) ? SIG_IGN : SIG_DFL;
CHECK_EQ(0, sigaction(nr, &act, nullptr));

And you could change that to:

if (nr == SIGKILL || nr == SIGSTOP || nr == SIGPIPE) continue;
act.sa_handler = SIG_DFL;
CHECK_EQ(0, sigaction(nr, &act, nullptr));

But please leave a comment explaining why SIGPIPE can safely be skipped.

I suppose a case could be made that much of PlatformInit() properly belongs in node_main.cc rather than node.cc but let's start small.

@yhwang
Copy link
Member Author

yhwang commented Mar 8, 2018

I see. But only move SIGPIPE out of the #ifndef NODE_SHARED_MODE section seems not a good idea. In that case, it separates the PlatformInit() logic into two places and because of two test case hit the SIGPIPE in the shared lib build. Is it worthy?

I think only adding a section in node_main.cc would be more obvious and clear (because it would be surrounded by the #ifdef NODE_SHARED_MODE.

And in the future, if the PlatformInit() could be moved to node_main.cc we won't need to put SIGPIPE back, right?

Another approach is to suppress these 2 test cases for share lib build.

@yhwang yhwang force-pushed the sharedlib-test-child-fork-pipe branch from 24c5de4 to c89b462 Compare March 9, 2018 01:05
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: nodejs#18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>
@yhwang yhwang force-pushed the sharedlib-test-child-fork-pipe branch from c89b462 to 4a7be07 Compare March 9, 2018 01:10
@yhwang
Copy link
Member Author

yhwang commented Mar 9, 2018

@bnoordhuis I reworked on the change and added a section in node_main.cc to ignore SIGPIPE and put comment to explain about the change. Hope it's easier and clearer for the node::PlatformInit() move in the future.

@bnoordhuis bnoordhuis dismissed jasnell’s stale review March 9, 2018 08:07

PR changed completely

Copy link
Member

@mhdawson mhdawson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yhwang
Copy link
Member Author

yhwang commented Mar 9, 2018

Thank you for the review!
CI: https://ci.nodejs.org/job/node-test-commit/16816/

@yhwang
Copy link
Member Author

yhwang commented Mar 9, 2018

smartos-15-64 failed with these logs:

FATAL: java.io.IOException: Connection reset by peer
java.io.IOException: Connection reset by peer
.........
no workspace for node-test-commit-smartos/nodes=smartos15-64 #15952

But other smartos builds passed. I think the results are good.

@yhwang yhwang added the author ready PRs that have at least one approval, no pending requests for changes, and a CI started. label Mar 9, 2018
@gireeshpunathil
Copy link
Member

My understanding is that in SHLIB mode, absolutely no signal handling is carried out, and the responsibility is delegated to the embedder. Please confirm this.

@yhwang
Copy link
Member Author

yhwang commented Mar 12, 2018

@gireeshpunathil

in SHLIB mode, absolutely no signal handling is carried out, and the responsibility is delegated to the embedder

Yes, squashing signal handlers is only applied to non-shared lib build. Here is the commit: 0f0f3d3 .

This PR modifies node_main.cc to handle SIGPIPE and is not related shared lib. (shared lib doesn't include node_main.cc)

yhwang added a commit that referenced this pull request Mar 13, 2018
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: #18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>

PR-URL: #19211
Refs: #18535
Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl>
Reviewed-By: Colin Ihrig <cjihrig@gmail.com>
Reviewed-By: James M Snell <jasnell@gmail.com>
Reviewed-By: Michael Dawson <michael_dawson@ca.ibm.com>
Reviewed-By: Gireesh Punathil <gpunathi@in.ibm.com>
@yhwang
Copy link
Member Author

yhwang commented Mar 13, 2018

Landed in ffd618b and closed this PR.

@yhwang yhwang closed this Mar 13, 2018
@yhwang yhwang deleted the sharedlib-test-child-fork-pipe branch March 13, 2018 00:14
targos pushed a commit that referenced this pull request Mar 17, 2018
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: #18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>

PR-URL: #19211
Refs: #18535
Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl>
Reviewed-By: Colin Ihrig <cjihrig@gmail.com>
Reviewed-By: James M Snell <jasnell@gmail.com>
Reviewed-By: Michael Dawson <michael_dawson@ca.ibm.com>
Reviewed-By: Gireesh Punathil <gpunathi@in.ibm.com>
@targos targos mentioned this pull request Mar 18, 2018
MylesBorins pushed a commit that referenced this pull request Mar 20, 2018
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: #18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>

PR-URL: #19211
Refs: #18535
Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl>
Reviewed-By: Colin Ihrig <cjihrig@gmail.com>
Reviewed-By: James M Snell <jasnell@gmail.com>
Reviewed-By: Michael Dawson <michael_dawson@ca.ibm.com>
Reviewed-By: Gireesh Punathil <gpunathi@in.ibm.com>
MayaLekova pushed a commit to MayaLekova/node that referenced this pull request May 8, 2018
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: nodejs#18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>

PR-URL: nodejs#19211
Refs: nodejs#18535
Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl>
Reviewed-By: Colin Ihrig <cjihrig@gmail.com>
Reviewed-By: James M Snell <jasnell@gmail.com>
Reviewed-By: Michael Dawson <michael_dawson@ca.ibm.com>
Reviewed-By: Gireesh Punathil <gpunathi@in.ibm.com>
BethGriggs pushed a commit that referenced this pull request Mar 21, 2019
For shared lib build, we leave the signal handling for embedding users.
In these two test cases:
- `parallel/test-process-external-stdio-close-spawn`
- `parallel/test-process-external-stdio-close`

The pipe is used for stdout and is destroied before child process uses
it for logging. So the node executble that uses shared lib build
receives SIGPIPE and the child process ends.

This change ignores the SIGPIPE in node_main.cc for shared lib case.

Refs: #18535

Signed-off-by: Yihong Wang <yh.wang@ibm.com>

PR-URL: #19211
Refs: #18535
Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl>
Reviewed-By: Colin Ihrig <cjihrig@gmail.com>
Reviewed-By: James M Snell <jasnell@gmail.com>
Reviewed-By: Michael Dawson <michael_dawson@ca.ibm.com>
Reviewed-By: Gireesh Punathil <gpunathi@in.ibm.com>
Signed-off-by: Beth Griggs <Bethany.Griggs@uk.ibm.com>
@MylesBorins MylesBorins mentioned this pull request Mar 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
author ready PRs that have at least one approval, no pending requests for changes, and a CI started. embedding Issues and PRs related to embedding Node.js in another project. test Issues and PRs related to the tests.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants