Define "tee"ing a stream #271

domenic · 2015-01-27T00:31:49Z

See yutakahirano/fetch-with-streams#14. Fetch has req.clone() and res.clone() which are currently ill-defined. They should be defined here, in a generic fashion, and probably there should be a user-exposed API for it (one way or another).

Our current experiments with this are in TeeStream.md, but those are probably very outdated.

The text was updated successfully, but these errors were encountered:

yutakahirano · 2015-01-29T05:18:34Z

From fetch API point of view it might be desirable that req.clone() does not change req.body although I don't know if it is feasible from Streams point of view.

domenic · 2015-01-29T05:36:20Z

Yeah, this might be tricky to get right... I don't immediately see how to do it, but I want to stare at it for a while and see if maybe there's a way to make it work.

annevk · 2015-02-18T10:28:45Z

Note that tee is also used in step 1 of https://fetch.spec.whatwg.org/#concept-http-network-or-cache-fetch (same reason). I should probably abstract the "copy a request" operation.

wanderview · 2015-02-18T14:28:19Z

How should clone() or tee() work when the resulting streams are read at wildly different rates?

Consider:

UA has an underlying push data source that it can provide back pressure on. The full data stream is quite large. To prove the point, lets say something ridiculous like 100 GB.
This data source is exposed as a ReadableStream or ReadableByteStream. We'll call this stream1.
Content script calls stream1.clone() or stream1.tee() to create stream2.
The script then reads stream1 rapidly, but does not read stream2 at all.

At this point, the UA must start buffering the underlying data source for all data in-between stream1's read position and stream2's read position.

My question is, can the UA provide back pressure on the underlying data source because stream2 is not being read?

This might be surprising since it will in effect stall stream1 until stream2 starts getting consumed. The alternative, though, is to buffer until OOM, but that does not seem desirable.

Can the spec clarify that the UA has the freedom to provide back pressure if one of the tee'd streams is read too slowly?

annevk · 2015-02-18T15:58:49Z

This might be surprising since it will in effect stall stream1 until stream2 starts getting consumed. The alternative, though, is to buffer until OOM, but that does not seem desirable.

That's our usual way of doing things... Alternatively, can we notify the streams in some way that this is happening?

domenic · 2015-02-18T16:33:13Z

Sorry, which is our usual way? OOM or stall?

I agree this is the hard problem with teeing. Maybe it even needs to be an option to pick between the two behaviors? That would still leave the question of what the default is.

annevk · 2015-02-18T16:36:53Z

OOM. Platform specifications hardly ever deal with limits.

wanderview · 2015-02-18T16:44:21Z

Our underlying gecko primitives default to providing back pressure in this particular scenario. I'd like to be able to default to back pressure to make implementation easier and more efficient.

Of course, an unfortunate side effect of using back pressure here is that it makes GC observable:

stream2 = stream1.tee();
Read stream1 until it stalls.
Throw away stream2.
When GC collects stream2, then stream1 unblocks.

wanderview · 2015-02-18T16:45:39Z

Or rather, I'd like the default to be weasel words to the effect that "the UA may provide back pressure to all clones if one of the peer streams is not being read".

domenic · 2015-02-18T16:47:44Z

Well, the plan was to define an actual TeeStream with a normative specification of how it operates on its single input and multiple output streams. (See old design here, but don't take it for anything serious.) Maybe the conceptual "tee a stream" could be more weasel-wordy.

wanderview · 2015-02-18T16:54:28Z

Understood. I agree that pure js stream behavior need to be exactly defined.

In the fetch Request/Response clone() case, gecko uses "infinite" buffer size already to match XHR behavior of not doing any back pressure to the network. So this won't come into play for us there.

I understand blink does do back pressure for fetch(), but I don't know how blink's Request/Response clone() works. Back pressure on peer streams might be an issue there.

yutakahirano · 2015-02-23T10:13:36Z

Currently Blink's fetch doesn't have the backpressure mechanism. When we implement it, I think the OOM behavior is the right way.

tyoshino · 2015-03-19T06:40:06Z

Domenic created PR #302 to define "teeing" clearly. Discussion happening there now.

Closes #271; supercedes #302. Includes an abstract operation, TeeReadableStream(stream, shouldClone) which is meant for use by other specs, plus a method ReadableStream.prototype.tee() (which does no cloning).

domenic added acknowledged missing feature implementer concern labels Jan 27, 2015

domenic added this to the Week of 2015-02-02 milestone Jan 27, 2015

yutakahirano mentioned this issue Jan 27, 2015

Backpressure on fetch integrated with Streams w3c/ServiceWorker#452

Closed

domenic removed this from the Week of 2015-02-02 milestone Mar 3, 2015

yutakahirano mentioned this issue Mar 17, 2015

Define "tee" yutakahirano/fetch-with-streams#14

Closed

domenic mentioned this issue Mar 18, 2015

First draft at tee algorithms, for critique #302

Closed

domenic modified the milestone: Week of 2015-03-30 Mar 30, 2015

domenic closed this as completed in 3ed32a5 Apr 6, 2015

yutakahirano mentioned this issue Oct 26, 2015

Custom tee function #401

Open

Peleg mentioned this issue Aug 21, 2016

Backpressure from tee-ing and slow/pending consumer #506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define "tee"ing a stream #271

Define "tee"ing a stream #271

domenic commented Jan 27, 2015

yutakahirano commented Jan 29, 2015

domenic commented Jan 29, 2015

annevk commented Feb 18, 2015

wanderview commented Feb 18, 2015

annevk commented Feb 18, 2015

domenic commented Feb 18, 2015

annevk commented Feb 18, 2015

wanderview commented Feb 18, 2015

wanderview commented Feb 18, 2015

domenic commented Feb 18, 2015

wanderview commented Feb 18, 2015

yutakahirano commented Feb 23, 2015

tyoshino commented Mar 19, 2015

Define "tee"ing a stream #271

Define "tee"ing a stream #271

Comments

domenic commented Jan 27, 2015

yutakahirano commented Jan 29, 2015

domenic commented Jan 29, 2015

annevk commented Feb 18, 2015

wanderview commented Feb 18, 2015

annevk commented Feb 18, 2015

domenic commented Feb 18, 2015

annevk commented Feb 18, 2015

wanderview commented Feb 18, 2015

wanderview commented Feb 18, 2015

domenic commented Feb 18, 2015

wanderview commented Feb 18, 2015

yutakahirano commented Feb 23, 2015

tyoshino commented Mar 19, 2015