Kafka add counters v1 uw2 #33503

Naireen · 2025-01-06T17:40:08Z

Add per worker metadata on metrics propaged over FnApi. This is on top of #33408

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

Naireen · 2025-01-06T23:18:33Z

Run Java PreCommit

Naireen · 2025-01-06T23:18:56Z

Run Java_IOs_Direct PreCommit

Naireen · 2025-01-06T23:19:12Z

Run Java_Pulsar_IO_Direct PreCommit

Naireen · 2025-01-06T23:19:32Z

Run Python_Coverage PreCommit

Naireen · 2025-01-07T00:26:26Z

Run Java PreCommit

Naireen · 2025-01-07T17:13:08Z

Run Java PreCommit

Naireen · 2025-01-07T18:36:56Z

Run Java PreCommit

Naireen · 2025-01-07T19:55:45Z

tests failing due to #28957

Naireen · 2025-01-07T19:58:28Z

Run Java PreCommit

github-actions · 2025-01-07T20:35:17Z

Checks are failing. Will not request review until checks are succeeding. If you'd like to override that behavior, comment assign set of reviewers

Naireen · 2025-01-07T20:48:52Z

assign set of reviewers @johnjcasey @sjvanrossum

flakey tests - want to get the review started in the meanwhile, but will wait on the tests to pass before I merge.

github-actions · 2025-01-07T20:50:06Z

Assigning reviewers. If you would like to opt out of this review, comment assign to next reviewer:

R: @kennknowles for label java.
R: @damccorm for label build.
R: @damondouglas for label io.
R: @sjvanrossum for label kafka.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

Naireen · 2025-01-07T20:50:33Z

Run Java PreCommit

Naireen · 2025-01-07T22:09:16Z

Run Java PreCommit

Naireen · 2025-01-07T22:51:12Z

Run Java PreCommit

damccorm · 2025-01-13T21:51:12Z

Would you mind adding some more context on what this change is doing?

This is on top of #33408

2 questions:

Does this mean that this is blocked on that PR or just related?
Is it better to have the same person/people review both PRs?

Naireen · 2025-02-12T22:44:11Z

Run Java PreCommit

Naireen · 2025-02-12T22:44:18Z

Run GoPortable PreCommit

Naireen · 2025-02-12T22:44:25Z

Run Go PreCommit

Naireen · 2025-02-13T00:50:56Z

Run PythonDocker PreCommit 3.12

Naireen · 2025-02-13T00:51:08Z

Run Java PreCommit

Naireen · 2025-02-13T17:20:11Z

Reopening this,

This consistently fails though due to a known flakey test: #28957

Naireen · 2025-02-13T18:24:38Z

Run Java PreCommit

Naireen · 2025-02-13T18:24:49Z

Run Prism_Python PreCommit 3.12

Naireen · 2025-02-13T19:55:00Z

Run Java_GCP_IO_Direct PreCommit

Naireen · 2025-02-13T19:55:15Z

R @scwhittle

scwhittle · 2025-02-19T20:53:33Z

runners/core-java/src/main/java/org/apache/beam/runners/core/metrics/MetricsContainerImpl.java

+
+    // Based on namespace, add per worker metrics label to enable separate runner based sink based
+    // processing.
+    if (metricName.getNamespace().equals("BigQuerySink")


it seems like this woudl be cleaner if it was at the call-site when creating the metric.

That would both make it more obvious there and less based upon magic constant and would allow us to support some kafka metrics per-worker but others aggregated or add per-worker metrics to some namespace that already has metrics.

It seems like maybe we could either add it to MetricKey, or we could plumb it into the cell and then update the function where we generate metadata to base it upon that instead.

Done. Removed BQ sink support, that can be done later. I've added it to MetricName

scwhittle · 2025-02-19T21:01:09Z

sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaUnboundedReader.java

@@ -332,7 +332,6 @@ public long getSplitBacklogBytes() {
      if (pBacklog == UnboundedReader.BACKLOG_UNKNOWN) {
        return UnboundedReader.BACKLOG_UNKNOWN;
      }
-      backlogBytes += pBacklog;


Thanks for catching!

scwhittle · 2025-02-19T21:02:45Z

sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaMetrics.java

+    @Override
+    public void recordBacklogBytes(String topicName, int partitionId, long backlogBytes) {
+      Gauge perPartion =
+          Metrics.gauge(


ie can we create the gauge here with the getPerWorkerGauge method? and then ensure that those metrics have the per-worker metadata populated appropriately?

We create it as a regular gauge, and then add per worker metadata to differenciate between the two (with the metric labels). If we add a new api here, it would then be added to any other derived metrics, (and then we'd likely want per worker other sorts of metrics.) Keeping the original set (gauge, hist, counter, etc) and then adding metadata for runner aggregation might be a cleaner approach?

Naireen · 2025-02-21T15:03:18Z

Run Java PreCommit

Naireen · 2025-02-21T15:03:36Z

Run Java_IOs_Direct PreCommit

Naireen · 2025-02-21T16:22:21Z

Run Java_IOs_Direct PreCommit

Naireen · 2025-02-21T17:53:07Z

Run PythonDocker PreCommit 3.10

Naireen · 2025-02-21T17:53:20Z

Run Prism_Python PreCommit 3.12

Naireen · 2025-02-21T22:32:39Z

Run Java PreCommit

Naireen · 2025-02-21T22:33:09Z

Run Java_GCP_IO_Direct PreCommit

Naireen · 2025-02-22T00:04:52Z

Run Java PreCommit

github-actions bot added java build model io runners dataflow kafka core labels Jan 6, 2025

Naireen force-pushed the kafka_add_counters_V1_uw2 branch 3 times, most recently from 5b28665 to 5f28d22 Compare January 6, 2025 22:17

Naireen marked this pull request as ready for review January 7, 2025 19:58

github-actions bot added the Next Action: Reviewers label Jan 7, 2025

Naireen marked this pull request as draft January 14, 2025 18:28

Naireen marked this pull request as ready for review February 13, 2025 17:20

Naireen added 2 commits February 13, 2025 17:46

add counter stuff

b61285a

Add metadata on Kafka sink metrics

83718b3

Naireen force-pushed the kafka_add_counters_V1_uw2 branch from 1861d3b to 83718b3 Compare February 13, 2025 17:46

scwhittle requested changes Feb 19, 2025

View reviewed changes

Naireen force-pushed the kafka_add_counters_V1_uw2 branch 3 times, most recently from 1d17345 to 168d1aa Compare February 21, 2025 07:13

github-actions bot added the gcp label Feb 21, 2025

Naireen requested a review from scwhittle February 21, 2025 17:11

address comments

c107437

Naireen force-pushed the kafka_add_counters_V1_uw2 branch from 168d1aa to c107437 Compare February 21, 2025 17:15

Kafka add counters v1 uw2 #33503

Are you sure you want to change the base?

Kafka add counters v1 uw2 #33503

Conversation

Naireen commented Jan 6, 2025

Add per worker metadata on metrics propaged over FnApi. This is on top of #33408

GitHub Actions Tests Status (on master branch)

Naireen commented Jan 6, 2025

Naireen commented Jan 6, 2025

Naireen commented Jan 6, 2025

Naireen commented Jan 6, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

github-actions bot commented Jan 7, 2025

Naireen commented Jan 7, 2025

github-actions bot commented Jan 7, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

Naireen commented Jan 7, 2025

damccorm commented Jan 13, 2025

Naireen commented Feb 12, 2025

Naireen commented Feb 12, 2025

Naireen commented Feb 12, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

Naireen commented Feb 13, 2025

scwhittle Feb 19, 2025

Choose a reason for hiding this comment

Naireen Feb 21, 2025

Choose a reason for hiding this comment

scwhittle Feb 19, 2025

Choose a reason for hiding this comment

Naireen Feb 21, 2025

Choose a reason for hiding this comment

scwhittle Feb 19, 2025

Choose a reason for hiding this comment

Naireen Feb 21, 2025

Choose a reason for hiding this comment

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 21, 2025

Naireen commented Feb 22, 2025