- Sponsor
-
Notifications
You must be signed in to change notification settings - Fork 781
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix failures when ASGI app rejects a connection during handshake #704
Fix failures when ASGI app rejects a connection during handshake #704
Conversation
c2c78e5
to
f608081
Compare
I could use some help on this unit-test failure. I've tested 10 times in the venv on localhost and there are no failures. I also wonder: do other tests have un-awaited tasks? Is my only mistake the use of |
Any updates? This is a blocker for us in prod. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice one, did some local testing against this and this looks pretty good. Left a suggestion to simplify the implementation in the websockets_impl
case!
@matteing Mind giving a look at the Windows error we're seeing…? Occurs in the test modified here, apparently there's a coroutine that's not awaited on Windows but I can't tell which coroutine that is exactly, i.e. is it due to the test server, or due to the ASGI application…? I don't have a Windows machine so debugging this is pretty iffy, and I don't want to rush things in case this could be a hint that there's something deeper we need to take care of here. task: <Task pending name='Task-168' coro=<IocpProactor.accept.<locals>.accept_coro() running at c:\hostedtoolcache\windows\python\3.8.5\x64\lib\asyncio\windows_events.py:562> wait_for=<_OverlappedFuture cancelled>> |
@florimondmanca we're all having trouble reproducing this bug on Windows. But it seems to me this error is currently on Windows, and it has been for ages? It looks to me like this patch merely reveals the bug -- it doesn't cause it. |
my 2 cents on the CI issue, I didn't follow the issue at stake here... we dropped iocp support for windows a while ago and now we force uvicorn to run the SelectorEventLoop, which is not the default loop on windows > 3.8 iirc here the issue (I think) is that the CI uses the default windows loop, so I'd just force it in the |
dd2bcec
to
06e9d51
Compare
Thanks, @euri10! That made the test pass @florimondmanca I added a new function, |
ping? Is there anything I can do to help this along? |
Taking a look. :) |
@@ -236,6 +235,10 @@ async def asgi_receive(self): | |||
return {"type": "websocket.connect"} | |||
|
|||
await self.handshake_completed_event.wait() | |||
if self.closed_event.is_set(): | |||
# Handshake has failed. | |||
return {"type": "websocket.disconnect", "code": self.initial_response[0]} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That code
is tricky… We actually need to consider two situations: A/ the handshake failed / got rejected, in that case .initial_response
is available and we can use it like here, and B/ the connection was closed normally by the client sending a websocket.close
event, and in that case .initial_response
isn't available and we need to send the close code that was given by the client.
@adamhooper Sadly it seems the entire suite of websocket tests seems off. The So I'd say — forget about the test suite for now. Let's have it work on manual tests. If that works, let's ship it. For now I was able to confirm that your So… How about a new pull request, that makes those two changes only? As noted in #704 (comment) we'll need to introduce a new |
Here's the diff I got to, if that's helpful: diff --git a/uvicorn/protocols/websockets/websockets_impl.py b/uvicorn/protocols/websockets/websockets_impl.py
index 407674c..9c26306 100644
--- a/uvicorn/protocols/websockets/websockets_impl.py
+++ b/uvicorn/protocols/websockets/websockets_impl.py
@@ -49,6 +49,7 @@ class WebSocketProtocol(websockets.WebSocketServerProtocol):
self.closed_event = asyncio.Event()
self.initial_response = None
self.connect_sent = False
+ self.close_code = None
self.accepted_subprotocol = None
self.transfer_data_task = None
@@ -216,6 +217,7 @@ class WebSocketProtocol(websockets.WebSocketServerProtocol):
elif message_type == "websocket.close":
code = message.get("code", 1000)
+ self.close_code = code
await self.close(code)
self.closed_event.set()
@@ -236,6 +238,18 @@ class WebSocketProtocol(websockets.WebSocketServerProtocol):
return {"type": "websocket.connect"}
await self.handshake_completed_event.wait()
+
+ if self.closed_event.is_set():
+ if self.initial_response is None:
+ # Client has disconnected.
+ assert self.close_code is not None
+ code = self.close_code
+ else:
+ # Handshake has failed, or the connection was rejected.
+ code = self.initial_response[0]
+
+ return {"type": "websocket.disconnect", "code": code}
+
try:
data = await self.recv()
except websockets.ConnectionClosed as exc:
diff --git a/uvicorn/protocols/websockets/wsproto_impl.py b/uvicorn/protocols/websockets/wsproto_impl.py
index d712963..456af13 100644
--- a/uvicorn/protocols/websockets/wsproto_impl.py
+++ b/uvicorn/protocols/websockets/wsproto_impl.py
@@ -255,10 +255,8 @@ class WSProtocol(asyncio.Protocol):
)
self.handshake_complete = True
self.close_sent = True
- msg = h11.Response(status_code=403, headers=[])
+ msg = events.RejectConnection(status_code=403, headers=[])
output = self.conn.send(msg)
- msg = h11.EndOfMessage()
- output += self.conn.send(msg)
self.transport.write(output)
self.transport.close()
|
[closes encode#244] Previously, if the application tried to close the connection before the handshake completed, uvicorn would log: TypeError: An asyncio.Future, a coroutine or an awaitable is required. These TypeErrors happen in tests/protocols/test_websocket.py. pytest is hiding them. In my initial pull request, I added `caplog` to `test_websocket.py` to expose the bug: def test_close_connection(protocol_cls, caplog): caplog.set_level(logging.ERROR) # ... assert not [r.message for r in caplog.records] ... but this caused unrelated errors on Windows; so @florimondmanca suggests leaving the test suite alone, in encode#704 (comment)
06e9d51
to
45465c8
Compare
@florimondmanca I'm a bit confused as to what my role is here. I assume I am to copy/paste your diff into the pull request, reset to master.... ... and here's the irony: out of curiosity, I added my
The fix is to nix the Also, I changed the logic so the All that to say: I've force-pushed a very brief diff. I also recommend adding caplog to every test ... but I agree with you that we should merge this post-haste to tackle my thousands of spurious stack traces per day :). |
@adamhooper Thanks for following up, sorry if my comments looked patronizing — didn't mean to have it come across that way. Your approach for So is this PR fixing the two main issues that are overflowing your logs? I don't have a workstation handy just now, but I can do an extra-checkup, then I guess we can go ahead and release this? :-) |
@florimondmanca Not patronizing at all! I'm all eagerness here. Yes, I heartily expect this PR to fix #244, the spurious log messages in I don't have experience with the parallel bug in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Was able to confirm this fixes the following two issues:
- When running with
--ws websockets
and the ASGI app rejects the connection, aTypeError
is raised -> Fixed - When running with
--ws wsproto
and the ASGI app rejects the connection, aLocalProtocolError
is raised -> Fixed
@adamhooper Thanks! Seems like we can go ahead and get this in. We'll get to fixing the test suite later — I'll log an issue about that to track it. |
sorry to be late in the game but this feels like a wrong fix to me at least in the websocket case, if you read the comment here python-websockets/websockets#670 (comment) a simpler and cleaner imho fix would look like this :
|
This reverts commit 8581b34
See encode/uvicorn#704 Should silence 20,000 stack traces per day on production: Traceback (most recent call last): File "/usr/local/lib/python3.8/site-packages/uvicorn/protocols/websockets/websockets_impl.py", line 154, in run_asgi result = await self.app(self.scope, self.asgi_receive, self.asgi_send) File "/usr/local/lib/python3.8/site-packages/uvicorn/middleware/proxy_headers.py", line 45, in __call__ return await self.app(scope, receive, send) File "/usr/local/lib/python3.8/site-packages/uvicorn/middleware/asgi2.py", line 7, in __call__ await instance(receive, send) File "/usr/local/lib/python3.8/site-packages/channels/sessions.py", line 183, in __call__ return await self.inner(receive, self.send) File "/usr/local/lib/python3.8/site-packages/channels/middleware.py", line 41, in coroutine_call await inner_instance(receive, send) File "/usr/local/lib/python3.8/site-packages/channels/sessions.py", line 183, in __call__ return await self.inner(receive, self.send) File "/usr/local/lib/python3.8/site-packages/channels/middleware.py", line 41, in coroutine_call await inner_instance(receive, send) File "/usr/local/lib/python3.8/site-packages/channels/consumer.py", line 58, in __call__ await await_many_dispatch( File "/usr/local/lib/python3.8/site-packages/channels/utils.py", line 58, in await_many_dispatch await task File "/usr/local/lib/python3.8/site-packages/channels/utils.py", line 50, in await_many_dispatch result = task.result() File "/usr/local/lib/python3.8/site-packages/uvicorn/protocols/websockets/websockets_impl.py", line 240, in asgi_receive data = await self.recv() File "/usr/local/lib/python3.8/site-packages/websockets/protocol.py", line 492, in recv await asyncio.wait( File "/usr/local/lib/python3.8/asyncio/tasks.py", line 424, in wait fs = {ensure_future(f, loop=loop) for f in set(fs)} File "/usr/local/lib/python3.8/asyncio/tasks.py", line 424, in <setcomp> fs = {ensure_future(f, loop=loop) for f in set(fs)} File "/usr/local/lib/python3.8/asyncio/tasks.py", line 673, in ensure_future raise TypeError('An asyncio.Future, a coroutine or an awaitable is ' TypeError: An asyncio.Future, a coroutine or an awaitable is required
We can't call
The spec is clear on this: https://datatracker.ietf.org/doc/html/rfc6455#section-1.4
The current behavior in uvicorn is rather "a peer discards any further data received, and any data previously received" More detail in #1230 Also, python-websockets/websockets#670 was eventually closed without any changes being made; in other words, there was no bug in |
[closes #244] Previously, if the application tried to close the connection before the handshake completed, uvicorn would log: TypeError: An asyncio.Future, a coroutine or an awaitable is required. These TypeErrors happen in tests/protocols/test_websocket.py. pytest is hiding them. In my initial pull request, I added `caplog` to `test_websocket.py` to expose the bug: def test_close_connection(protocol_cls, caplog): caplog.set_level(logging.ERROR) # ... assert not [r.message for r in caplog.records] ... but this caused unrelated errors on Windows; so @florimondmanca suggests leaving the test suite alone, in encode/uvicorn#704 (comment)
[closes #244] Previously, if the application tried to close the connection before the handshake completed, uvicorn would log: TypeError: An asyncio.Future, a coroutine or an awaitable is required. These TypeErrors happen in tests/protocols/test_websocket.py. pytest is hiding them. In my initial pull request, I added `caplog` to `test_websocket.py` to expose the bug: def test_close_connection(protocol_cls, caplog): caplog.set_level(logging.ERROR) # ... assert not [r.message for r in caplog.records] ... but this caused unrelated errors on Windows; so @florimondmanca suggests leaving the test suite alone, in encode/uvicorn#704 (comment)
Fixes #244. The unit test already existed, but it logged exceptions that
weren't being noticed.
transfer_data_task
is unset if the handshake fails, so we can't call recv().h11.Reject
is invalid during handshake; sendRejectConnection
instead.