Create a backend for the pr_curves plugin #387

chihuahua · 2017-08-18T07:58:22Z

Made PrCurvesPlugin, which serves data for the PR Curves dashboard. Made a metadata.py file that centralizes information such as the plugin name and the indices used to obtain precision and recall from tensor data.

This PR is a part of dividing #334 (which contains resourceful input from @wchargin) into smaller PRs.

wchargin

Looks good overall. Most important requested change is to the data format (use proto wire format instead of json). Then the docs and nondeterminism. Everything else is either minor or a suggestion.

wchargin · 2017-08-22T03:06:38Z

tensorboard/plugins/pr_curve/metadata.py

+  """
+  pr_curve_plugin_data = plugin_data_pb2.PrCurvePluginData(
+      version=PROTO_VERSION, num_thresholds=num_thresholds)
+  content = json_format.MessageToJson(pr_curve_plugin_data)


This should be content = pr_curve_plugin_data.SerializeToString(), for consistency with our stated best-practice and consistency with all other plugins.

wchargin · 2017-08-22T03:07:09Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      response = http_util.Respond(
+          request, self.pr_curves_impl(runs, tag), 'application/json')
+    except ValueError as e:
+      return http_util.Respond(request, '%s' % e, 'text/plain', 400)


'%s' % e is more easily written as str(e).

wchargin · 2017-08-22T03:08:51Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      The JSON object for the tags route response.
+    """
+    all_runs = self._multiplexer.PluginRunToTagToContent(
+        PrCurvesPlugin.plugin_name)


This should be metadata.PLUGIN_NAME. I know that they have the same value, but the argument to PluginRunToTagToContent corresponds to the string stored in the plugin_data of the summaries, not the route used to fetch data. If you changed PrCurvesPlugin.plugin_name to be something else, the value of this argument should still be the original metadata.PLUGIN_NAME.

Likewise elsewhere in this file.

wchargin · 2017-08-22T03:12:21Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      # within the same run. If the latter occurs, TensorBoard will show the
+      # actual step of each tag atop the card for the tag.
+      tensor_events = self._multiplexer.Tensors(
+          run, list(tag_to_content.keys())[0])


This is nondeterministic, which rubs me the wrong way. Consider: min(six.iterkeys(tag_to_content)).

(General note: If you were to not apply this comment, prefer next(six.iterkeys(tag_to_content)) to what's written here. No need to serialize to list, duplicate the list (in Python 2), and then grab the first element.)

True. I actually like next(six.iterkeys(tag_to_content)) because it does not involve looping through all tags like min. Indeed, both alternatives nicely avoid list serialization and duplication.

wchargin · 2017-08-22T03:14:12Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+  def setUp(self):
+    super(PrCurvesPluginTest, self).setUp()
+    logdir = os.path.join(self.get_temp_dir(), 'logdir')
+    tf.reset_default_graph()


This is not needed; it's done in the superclass setup.

wchargin · 2017-08-22T03:14:47Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+    """
+    self.assertEqual(expected_step, pr_curve_entry['step'])
+    # We use an absolute error instead of a relative one because the expected
+    # values are small. The default relative error (trol) of 1e-7 yields many


s/trol/rtol?

wchargin · 2017-08-22T03:16:54Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

@@ -0,0 +1,198 @@
+# Copyright 2017 Google Inc. All Rights Reserved.


Could you add an http_api.md? (Copy-paste and modify one of the other plugins'.)

wchargin · 2017-08-22T03:18:30Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+    The handler should raise a ValueError when no PR curve data can be found
+    for a certain run-tag combination.
+    """
+    with self.assertRaises(ValueError):


You might prefer with six.assertRaisesRegex(self, ValueError, r'No PR curves could be fetched') to ensure that the error is the one that you're looking for.

wchargin · 2017-08-22T03:19:37Z

tensorboard/plugins/pr_curve/summary.py

@@ -58,7 +57,7 @@ def op(
    predictions: A float32 `Tensor` whose values are in the range `[0, 1]`.
        Dimensions must match those of `labels`.
    num_thresholds: Number of thresholds, evenly distributed in `[0, 1]`, to
-        compute PR metrics for. Should be `>= 2`. This value should be a 
+        compute PR metrics for. Should be `>= 2`. This value should be a


I liked the idea of handling these in #389. Perhaps you have a rogue commit in this PR?

Ideally, the "add backend" commit should make no changes to summary.py, and the "add frontend" commit (if separate) should make no changes to pr_curves_plugin.py/etc.

Indeed, I am going to let #389 handle these style changes.

chihuahua

I manually started a backend and tried routes. They seem to work.

chihuahua · 2017-08-23T00:28:52Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

@@ -0,0 +1,198 @@
+# Copyright 2017 Google Inc. All Rights Reserved.


chihuahua · 2017-08-23T00:30:36Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      response = http_util.Respond(
+          request, self.pr_curves_impl(runs, tag), 'application/json')
+    except ValueError as e:
+      return http_util.Respond(request, '%s' % e, 'text/plain', 400)


chihuahua · 2017-08-23T00:31:32Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      The JSON object for the tags route response.
+    """
+    all_runs = self._multiplexer.PluginRunToTagToContent(
+        PrCurvesPlugin.plugin_name)


chihuahua · 2017-08-23T00:33:12Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      # within the same run. If the latter occurs, TensorBoard will show the
+      # actual step of each tag atop the card for the tag.
+      tensor_events = self._multiplexer.Tensors(
+          run, list(tag_to_content.keys())[0])


True. I actually like next(six.iterkeys(tag_to_content)) because it does not involve looping through all tags like min. Indeed, both alternatives nicely avoid list serialization and duplication.

chihuahua · 2017-08-23T00:34:02Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+  def setUp(self):
+    super(PrCurvesPluginTest, self).setUp()
+    logdir = os.path.join(self.get_temp_dir(), 'logdir')
+    tf.reset_default_graph()


chihuahua · 2017-08-23T00:34:33Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+    """
+    self.assertEqual(expected_step, pr_curve_entry['step'])
+    # We use an absolute error instead of a relative one because the expected
+    # values are small. The default relative error (trol) of 1e-7 yields many


chihuahua · 2017-08-23T00:35:06Z

tensorboard/plugins/pr_curve/pr_curves_plugin_test.py

+    The handler should raise a ValueError when no PR curve data can be found
+    for a certain run-tag combination.
+    """
+    with self.assertRaises(ValueError):


wchargin · 2017-08-23T03:08:17Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      # within the same run. If the latter occurs, TensorBoard will show the
+      # actual step of each tag atop the card for the tag.
+      tensor_events = self._multiplexer.Tensors(
+          run, next(six.iterkeys(tag_to_content)))


Like you, I appreciate that next(six.iterkeys(tag_to_content)) avoids looking at all the data, but it is just this that means that the result is still nondeterministic. If you can find a simple deterministic way to not look at all the data, then go for it, but short of changing the storage type used by the event accumulator to a key-ordered dict I think min(six.iterkeys(tag_to_content)) is what you want.

Note also that looking at all the data here is in no way expensive.

It is indeed generally inexpensive. Used min(six.iterkeys(tag_to_content)) for easier debugging if issues arise.

wchargin · 2017-08-23T03:08:35Z

tensorboard/plugins/pr_curve/http_api.md

@@ -0,0 +1,97 @@
+# Precision—Recall Curve plugin HTTP API
+
+The scalar plugin name is `pr_curves`, so all its routes are under


s/scalar/PR curve/

wchargin · 2017-08-23T03:09:55Z

tensorboard/plugins/pr_curve/http_api.md

+
+Each PR data entry contains the following properties.
+
+* **wall_time**: The wall time (number) in seconds since the epoch at which data


Would you mind removing the hanging indent here? It should look like:

Each PR data entry contains the following properties. * **wall_time**: The wall time (number) in seconds since the epoch at which data for the PR curve was allocated.

It makes diffs, editing, etc. more difficult, and renders just the same.

(Arguably you want a <dl> here, but Markdown doesn't have those, and you shouldn't drop down into HTML for it, so.)

Done. A <dl> would seem apt if possible.

wchargin · 2017-08-23T03:10:58Z

tensorboard/plugins/pr_curve/http_api.md

+}
+```
+
+## `/data/plugin/scalars/pr_curves`


s/scalars/pr_curves

wchargin · 2017-08-23T03:11:07Z

tensorboard/plugins/pr_curve/http_api.md

+
+Used by the PR Curves dashboard to render plots.
+
+## `/data/plugin/scalars/tags`


s/scalars/pr_curves

wchargin · 2017-08-23T03:13:27Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+    Returns:
+      The JSON object for the tags route response.
+    """
+    all_runs = self._multiplexer.PluginRunToTagToContent(metadata.PLUGIN_NAME)


From #334 (comment):

You'll need to update this to return a run-to-tag-to-tag-info map, as in the other plugins. Then, you can pass this to tf-card-heading on the frontend. (The frontend part can be a TODO for now because of resolution, but we should get the backend right on the first go. Take a look at the runToTagInfo adaptations in the scalar dashboard, for instance.)

Best to resolve in this PR: there's no need for this backend to ever do the old thing.

Done. Thank you for noting that! I also updated http.md.

chihuahua

Manually verified that backend responses seem reasonable.

chihuahua · 2017-08-23T04:28:40Z

tensorboard/plugins/pr_curve/http_api.md

@@ -0,0 +1,97 @@
+# Precision—Recall Curve plugin HTTP API
+
+The scalar plugin name is `pr_curves`, so all its routes are under


chihuahua · 2017-08-23T04:28:44Z

tensorboard/plugins/pr_curve/http_api.md

+}
+```
+
+## `/data/plugin/scalars/pr_curves`


chihuahua · 2017-08-23T04:28:52Z

tensorboard/plugins/pr_curve/http_api.md

+
+Used by the PR Curves dashboard to render plots.
+
+## `/data/plugin/scalars/tags`


chihuahua · 2017-08-23T04:30:03Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      # within the same run. If the latter occurs, TensorBoard will show the
+      # actual step of each tag atop the card for the tag.
+      tensor_events = self._multiplexer.Tensors(
+          run, next(six.iterkeys(tag_to_content)))


It is indeed generally inexpensive. Used min(six.iterkeys(tag_to_content)) for easier debugging if issues arise.

chihuahua · 2017-08-23T05:21:38Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+    Returns:
+      The JSON object for the tags route response.
+    """
+    all_runs = self._multiplexer.PluginRunToTagToContent(metadata.PLUGIN_NAME)


Done. Thank you for noting that! I also updated http.md.

wchargin · 2017-08-23T21:29:32Z

tensorboard/plugins/pr_curve/http_api.md

+are objects with 2 keys: `displayName` and `description` (associated with the
+run-tag combination).
+
+The `displayName` is shown atop individual plots in TensorBoard. The description


Please note that the description contains sanitized HTML to be injected into the DOM, while the display name is simply an arbitrary string.

wchargin · 2017-08-24T05:58:15Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+      # within the same run. If the latter occurs, TensorBoard will show the
+      # actual step of each tag atop the card for the tag.
+      tensor_events = self._multiplexer.Tensors(
+          run, list(tag_to_content.keys())[0])


This was reverted here: 382b13d#diff-9b643340c1c8537af733696bb8c0c641R166

I'm pausing review on the suspicion that this is a rogue commit (at a quick scan, its contents don't seem to match its title). Let me know when the diff is correct.

Thank you for noticing! I undid the rogue changes in a recent commit. What happened was that I had copied some code from the PR with the demo, which did not include some of the recent changes in this particular PR. I need to be more cognizant of avoiding rogue changes and commits.

wchargin

Okay; just one API question.

wchargin · 2017-08-24T16:26:00Z

tensorboard/plugins/pr_curve/pr_curves_plugin.py

+    return {
+      'step': tensor_event.step,
+      'wall_time': tensor_event.wall_time,
+      'relative': tensor_event.wall_time - initial_wall_time,


Why compute the relative times on the server? No other plugin does this, as far as I know; we use ChartHelpers.relativeAccessor to do this on the frontend:

tensorboard/tensorboard/plugins/scalar/vz_line_chart/vz-chart-helpers.ts

Lines 155 to 169 in 542ac06

export let relativeAccessor =

// tslint:disable-next-line:no-any be quiet tsc

(d: any, index: number, dataset: Plottable.Dataset) => {

// We may be rendering the final-point datum for scatterplot.

// If so, we will have already provided the 'relative' property

if (d.relative != null) {

return d.relative;

}

let data = dataset.data();

// I can't imagine how this function would be called when the data is

// empty (after all, it iterates over the data), but lets guard just

// to be safe.

let first = data.length > 0 ? +data[0].wall_time : 0;

return (+d.wall_time - first) / (60 * 60 * 1000); // ms to hours

};

That is true. The recent commit does away with computing relative on the backend.

I think on the frontend, we could have pr-curve-card logic compute relative values. We could also generalize the logic within vz-chart-helpers, although the relative time in the PR curve UI changes the display in the sliders and doesn't modify the UI of the vz line chart.

wchargin

All right!

Made PrCurvesPlugin, which serves data for the PR Curves dashboard. Made a metadata.py file that centralizes information such as the plugin name and the indices used to obtain precision and recall from tensor data. Made some lint changes required for submission.

chihuahua added the type:feature (new plugin) label Aug 18, 2017

chihuahua requested review from wchargin and jart and removed request for wchargin August 18, 2017 07:58

wchargin suggested changes Aug 22, 2017

View reviewed changes

chihuahua commented Aug 23, 2017

View reviewed changes

wchargin reviewed Aug 23, 2017

View reviewed changes

chihuahua commented Aug 23, 2017

View reviewed changes

chihuahua mentioned this pull request Aug 24, 2017

Make a plugin to serve precision-recall curves #334

Closed

wchargin reviewed Aug 24, 2017

View reviewed changes

wchargin approved these changes Aug 25, 2017

View reviewed changes

chihuahua added 18 commits August 25, 2017 11:04

.

eae6bf9

Add __init__.py to fix test

3e0da7b

Document routes in http.md

74ccfa8

Rebase changes atop master to remove merge commit

f13dfe9

.

b1a49a6

Respond to comments.

19b1002

Update API docs

e65f6e0

Remove indentation

1b215ad

Correct API

bfd17a2

Capitalize Plugin

9f1c107

Fix test. Update http.md

507ef4b

Revert rogue changes.

2ab3fa8

Note that description is sanitized HTML.

6569076

Uppercase PLUGIN_NAME

3105733

Remove relative

d13f242

Pacify the angry linter

2d90ac1

Fix test

f8e6721

chihuahua force-pushed the chizeng-pr-curves-backend branch from 9ae9ca2 to f8e6721 Compare August 25, 2017 18:05

Remove lint errors I had inadvertently added back in a commit

b131dd5

chihuahua merged commit 1c7b8ce into master Aug 25, 2017

chihuahua deleted the chizeng-pr-curves-backend branch August 25, 2017 18:38

wchargin mentioned this pull request Sep 10, 2017

Write op and pb methods for text summaries #510

Merged

		@@ -0,0 +1,198 @@
		# Copyright 2017 Google Inc. All Rights Reserved.

		@@ -0,0 +1,97 @@
		# Precision—Recall Curve plugin HTTP API

		The scalar plugin name is `pr_curves`, so all its routes are under


		Each PR data entry contains the following properties.

		* wall_time: The wall time (number) in seconds since the epoch at which data


		Used by the PR Curves dashboard to render plots.

		## `/data/plugin/scalars/tags`

	export let relativeAccessor =
	// tslint:disable-next-line:no-any be quiet tsc
	(d: any, index: number, dataset: Plottable.Dataset) => {
	// We may be rendering the final-point datum for scatterplot.
	// If so, we will have already provided the 'relative' property
	if (d.relative != null) {
	return d.relative;
	}
	let data = dataset.data();
	// I can't imagine how this function would be called when the data is
	// empty (after all, it iterates over the data), but lets guard just
	// to be safe.
	let first = data.length > 0 ? +data[0].wall_time : 0;
	return (+d.wall_time - first) / (60 * 60 * 1000); // ms to hours
	};

Create a backend for the pr_curves plugin #387

Create a backend for the pr_curves plugin #387

Conversation

chihuahua commented Aug 18, 2017 • edited Loading

wchargin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chihuahua left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chihuahua left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wchargin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wchargin left a comment

Choose a reason for hiding this comment

chihuahua commented Aug 18, 2017 •

edited

Loading