Tensorboard smoothing is biased to initial values #610

alexirpan · 2017-10-04T01:16:40Z

The relevant code is in tensorboard/tensorboard/components/vz_line_chart/vz-line-chart.ts

The implementation of resmoothDataset is

  private resmoothDataset(dataset: Plottable.Dataset) {
    let data = dataset.data();
    const smoothingWeight = this.smoothingWeight;
    let last = data.length > 0 ? this.yValueAccessor(data[0], 0, dataset) : NaN;
    data.forEach((d, i) => {
      if (!_.isFinite(last)) {
        d.smoothed = this.yValueAccessor(d, i, dataset);
      } else {
        // 1st-order IIR low-pass filter to attenuate the higher-
        // frequency components of the time-series.
        d.smoothed = last * smoothingWeight + (
            1 - smoothingWeight) * this.yValueAccessor(d, i, dataset);
      }
      last = d.smoothed;
    });
  }

It computes the exponential moving average over the list of data points. However, for finite sequences the EMA is biased.

Let x_i be the raw data values, and y_i be the exponential moving average, defined as

y_0 = x_0
y_i = smooth * y_{i-1} + (1-smooth) * x_i.

For y_n, expanding the EMA gives

y_n = (1-smooth) * x_n + smooth (1-smooth) * x_{n-1} + (smooth)^2 * (1-smooth) * x_{n-2} + ....

and so on. The weight on x_1 is (smooth)^{n-1} * (1 - smooth), and the weight on x_0 is (smooth)^n

Between every two consecutive x_i, x_{i-1}, the weight of its contribution gets multiplied by "smooth", except for x_1 and x_0. In that case, the weight is multiplied by (smooth) / (1 - smooth). If smooth = 0.95, this effectively makes x_0 matter 19 times as much to the final average as x_1.

To undo the biasing, I believe there should be a way to do something similar to https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/training/moving_averages.py#L155, where you add a correction term based on how many data points you've seen so far.

The text was updated successfully, but these errors were encountered:

wchargin · 2017-10-04T22:59:43Z

Hi Alex—if I recall correctly, @dandelionmane responded to this in the original Google-internal issue. Would you mind posting the response here?

teamdandelion · 2017-10-05T07:22:01Z

What I wrote on the Google-internal bug:

Since we made GitHub the source of truth for TensorBoard development, we're trying to move all bug reporting there too. Can you please repost this information on GitHub? (https://github.com/tensorflow/tensorboard/issues)
It seems that every change in the smoothing algorithm results in new tradeoffs and biases, and new requests to change the algorithm. At this point it's not a priority for the team, so just filing an issue probably won't directly result in changes. However, the smoothing algorithm is actually a very simple piece of code to change. If you come up with a new algorithm, and demonstrate persuasively why it is an improvement over the current one (taking into account new biases and edge cases that it introduces), I'll happily merge it.

Here's the discussion from the last time we changed the smoothing algorithm: tensorflow/tensorflow#7891 (proposed) tensorflow/tensorflow#8363 (merged)

@alexirpan would you take a stab at implementing the debiased smoothing, and show us how it would perform on a few realistic cases? I'm guessing that on the common case of rapid initial descent on loss curves it would look a lot better.

nfelt · 2018-04-13T21:15:18Z

Closing this out since I believe it was fixed by #639.

chihuahua added the plugin:scalars label Oct 4, 2017

alexirpan mentioned this issue Oct 12, 2017

Debiasing for Tensorboard smoothing #639

Merged

lucasb-eyer mentioned this issue Nov 26, 2017

Ignore outliers affects scaling but not smoothing #651

Open

nfelt mentioned this issue Apr 13, 2018

TensorBoard is "smoothing" lines with two points #1127

Open

nfelt closed this as completed Apr 13, 2018

wchargin mentioned this issue May 16, 2019

Smoothing depends on sequence length #1518

Closed

TheGuywithTheHat mentioned this issue Jul 12, 2019

Smoothed curve way off the actual line #1439

Closed

wchargin mentioned this issue Dec 20, 2019

Smoothing of a constant plot is messed up #786

Open

nfelt mentioned this issue Dec 18, 2020

Smoothing weighted incorrectly #4481

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensorboard smoothing is biased to initial values #610

Tensorboard smoothing is biased to initial values #610

alexirpan commented Oct 4, 2017

wchargin commented Oct 4, 2017

teamdandelion commented Oct 5, 2017

nfelt commented Apr 13, 2018

Tensorboard smoothing is biased to initial values #610

Tensorboard smoothing is biased to initial values #610

Comments

alexirpan commented Oct 4, 2017

wchargin commented Oct 4, 2017

teamdandelion commented Oct 5, 2017

nfelt commented Apr 13, 2018