Use binary statemachine to parse websocket frames #35

theturtle32 · 2010-08-07T23:33:22Z

Right now, the websocket adapter in the server sets the connection encoding: with this.connection.setEncoding('utf8');

It thus receives the incoming data as utf8 encoded and splits on \ufffd -- the unicode U+FFFD REPLACEMENT CHARACTER. This is inappropriate as it violates the spec, which indicates that the protocol itself is binary and the individual frames should be interpreted as utf8.

Also, the current internet draft of the WebSocket protocol (after Draft 76) uses a completely different framing method which will mandate parsing in binary. It will no longer be possible to read the bytes as UTF-8. See the latest protocol at http://www.whatwg.org/specs/web-socket-protocol/

It may be acceptable for now to keep the current implementation for now and then switch out the frame parsing entirely to handle implementations of the next official draft. However, be aware that a legitimate message could theoretically legitimately contain \ufffd in a position that doesn't indicate a frame boundary, which would break today's parser.

rauchg · 2010-09-05T21:05:44Z

Will reopen for the next official draft.

Try different transports upon connect timeout (fixes #35)

darrachequesne pushed a commit that referenced this issue Jul 4, 2024

Connect timeout (fixes #34)

c2917c7

Try different transports upon connect timeout (fixes #35)

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use binary statemachine to parse websocket frames #35

Use binary statemachine to parse websocket frames #35

theturtle32 commented Aug 7, 2010

rauchg commented Sep 5, 2010

Use binary statemachine to parse websocket frames #35

Use binary statemachine to parse websocket frames #35

Comments

theturtle32 commented Aug 7, 2010

rauchg commented Sep 5, 2010