[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction metric collection #5170

ifndef-SleePy · 2022-07-21T12:55:17Z

Prepare a Pull Request

Fixes [Bug][Sort] Metrics of HBaseSinkFunction are not collected accurately #5169

Motivation

Fix race condition issue of HBaseSinkFunction metric collection

Modifications

Describe the modifications you've done.

Verifying this change

(Please pick either of the following options)

This change is a trivial rework/code cleanup without any test coverage.

Documentation

Does this pull request introduce a new feature? (no)
If yes, how is the feature documented? (not applicable)
If a feature is not applicable for documentation, explain why?
If a feature is not documented yet in this PR, please create a follow-up issue for adding the documentation

dockerzhang · 2022-07-22T03:24:07Z

@yunqingmoswu @thexiay PTAL, thanks.

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java

gong · 2022-07-22T04:35:31Z

Maybe we can define a ConcurrentCounter

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java

yunqingmoswu · 2022-07-23T01:22:36Z

@yunqingmoswu @thexiay PTAL, thanks.

done

ifndef-SleePy · 2022-07-25T13:56:11Z

Hi guys, thanks for reviewing. First, I'm sorry that I didn't explain my points behind the PR.

As you can see, all the comments are focused on the scenario of flushing failure. I think we don't need care about the metrics after flushing failure. Because

If flushing fails, the job would falls into failover scenario.
Metric accuracy can not be guaranteed when failover happens. The consistency mode of HBaseSinkFunction is at-least-once. So some of produced records would probably be re-produced again. However the metric collection would not be reverted. All of these records would be counted.
Moreover, nobody knows how many records have been written into HBase Region Server when flushing fails. Flushing of BufferedMutator is not a transactional operation. If I remember correctly, HBase client spawns RPC calls to several Region Server concurrently. Any failure RPC call leads to a global failure. So when failure happens, we don't know how many RPC succeeded. It might be none, some or all records have been flushed successfully.

My points here are:

We have to guarantee the metric accuracy if no failure happens. That's what we do in this PR.
Metric is not accurate if flushing fails, no matter what we do. So we don't need to involve any synchronization of these counters. Just make it as cheaper as possible.
A counter records the number of flushing failure times is meaningful. But I'm a bit confused of the counter name "dirty", so I removed the these ambiguous counters. Maybe we could introduce another counter to record the failure times.

What do you guys think of this? @EMsnap @gong @yunqingmoswu

yunqingmoswu · 2022-07-26T05:09:44Z

Hi guys, thanks for reviewing. First, I'm sorry that I didn't explain my points behind the PR.

As you can see, all the comments are focused on the scenario of flushing failure. I think we don't need care about the metrics after flushing failure. Because

If flushing fails, the job would falls into failover scenario.

Metric accuracy can not be guaranteed when failover happens. The consistency mode of HBaseSinkFunction is at-least-once. So some of produced records would probably be re-produced again. However the metric collection would not be reverted. All of these records would be counted.

Moreover, nobody knows how many records have been written into HBase Region Server when flushing fails. Flushing of BufferedMutator is not a transactional operation. If I remember correctly, HBase client spawns RPC calls to several Region Server concurrently. Any failure RPC call leads to a global failure. So when failure happens, we don't know how many RPC succeeded. It might be none, some or all records have been flushed successfully.

My points here are:

We have to guarantee the metric accuracy if no failure happens. That's what we do in this PR.

Metric is not accurate if flushing fails, no matter what we do. So we don't need to involve any synchronization of these counters. Just make it as cheaper as possible.

A counter records the number of flushing failure times is meaningful. But I'm a bit confused of the counter name "dirty", so I removed the these ambiguous counters. Maybe we could introduce another counter to record the failure times.

What do you guys think of this? @EMsnap @gong @yunqingmoswu

As far as I know, the underlying logic of refresh should generate hfile first and then load, then refresh will only have this batch of data either all visible or all lost. If I understand correctly, then after refreshing the statistics, the data obtained will be more accurate, not only for dirty data, but also for normal sync data.

gong · 2022-07-26T05:14:18Z

@ifndef-SleePy hello, here define of dirty data is write fail, and we need data records and data size.

ifndef-SleePy · 2022-07-26T11:36:57Z

Hi guys, thanks for reviewing. First, I'm sorry that I didn't explain my points behind the PR.
As you can see, all the comments are focused on the scenario of flushing failure. I think we don't need care about the metrics after flushing failure. Because

If flushing fails, the job would falls into failover scenario.

Metric accuracy can not be guaranteed when failover happens. The consistency mode of HBaseSinkFunction is at-least-once. So some of produced records would probably be re-produced again. However the metric collection would not be reverted. All of these records would be counted.

Moreover, nobody knows how many records have been written into HBase Region Server when flushing fails. Flushing of BufferedMutator is not a transactional operation. If I remember correctly, HBase client spawns RPC calls to several Region Server concurrently. Any failure RPC call leads to a global failure. So when failure happens, we don't know how many RPC succeeded. It might be none, some or all records have been flushed successfully.

My points here are:

We have to guarantee the metric accuracy if no failure happens. That's what we do in this PR.

Metric is not accurate if flushing fails, no matter what we do. So we don't need to involve any synchronization of these counters. Just make it as cheaper as possible.

A counter records the number of flushing failure times is meaningful. But I'm a bit confused of the counter name "dirty", so I removed the these ambiguous counters. Maybe we could introduce another counter to record the failure times.

What do you guys think of this? @EMsnap @gong @yunqingmoswu

As far as I know, the underlying logic of refresh should generate hfile first and then load, then refresh will only have this batch of data either all visible or all lost. If I understand correctly, then after refreshing the statistics, the data obtained will be more accurate, not only for dirty data, but also for normal sync data.

The HFile way you described is called "bulk loading" [1]. That's not the way worked here. BufferedMutator would not trigger bulk loading, there is another API for bulk loading. It would go through WAL and MemStore to be ingested into Region Server.
Bulk loading also could not guarantee "this batch of data either all visible or all lost", because there might be several HFile involved in the batch operation.
HBase could guarantee consistency in a row, but not cross rows, please check the ACID docs [2], "APIs that mutate several rows will not be atomic across the multiple rows.".

[1] https://hbase.apache.org/book.html#arch.bulk.load
[2] https://hbase.apache.org/acid-semantics.html

yunqingmoswu

It is possible to report indicators before and after flushing, but it is recommended to report indicators after flushing.

yunqingmoswu · 2022-07-26T12:08:19Z

Hi guys, thanks for reviewing. First, I'm sorry that I didn't explain my points behind the PR.
As you can see, all the comments are focused on the scenario of flushing failure. I think we don't need care about the metrics after flushing failure. Because

If flushing fails, the job would falls into failover scenario.

Metric accuracy can not be guaranteed when failover happens. The consistency mode of HBaseSinkFunction is at-least-once. So some of produced records would probably be re-produced again. However the metric collection would not be reverted. All of these records would be counted.

Moreover, nobody knows how many records have been written into HBase Region Server when flushing fails. Flushing of BufferedMutator is not a transactional operation. If I remember correctly, HBase client spawns RPC calls to several Region Server concurrently. Any failure RPC call leads to a global failure. So when failure happens, we don't know how many RPC succeeded. It might be none, some or all records have been flushed successfully.

My points here are:

We have to guarantee the metric accuracy if no failure happens. That's what we do in this PR.

Metric is not accurate if flushing fails, no matter what we do. So we don't need to involve any synchronization of these counters. Just make it as cheaper as possible.

A counter records the number of flushing failure times is meaningful. But I'm a bit confused of the counter name "dirty", so I removed the these ambiguous counters. Maybe we could introduce another counter to record the failure times.

What do you guys think of this? @EMsnap @gong @yunqingmoswu

As far as I know, the underlying logic of refresh should generate hfile first and then load, then refresh will only have this batch of data either all visible or all lost. If I understand correctly, then after refreshing the statistics, the data obtained will be more accurate, not only for dirty data, but also for normal sync data.

The HFile way you described is called "bulk loading" [1]. That's not the way worked here. BufferedMutator would not trigger bulk loading, there is another API for bulk loading. It would go through WAL and MemStore to be ingested into Region Server.

Bulk loading also could not guarantee "this batch of data either all visible or all lost", because there might be several HFile involved in the batch operation.

HBase could guarantee consistency in a row, but not cross rows, please check the ACID docs [2], "APIs that mutate several rows will not be atomic across the multiple rows.".

[1] https://hbase.apache.org/book.html#arch.bulk.load [2] https://hbase.apache.org/acid-semantics.html

Thanks for clarifying.

ifndef-SleePy · 2022-07-28T13:28:51Z

After an offline communication with @yunqingmoswu , we reached an agreement that just keep the origin implementation with thread-safe counter. Because it's bases on production requirement, not technical.

We have to keep the meaningless "dirty" prefix counter till someday the production manager realizes that it's meaningless.
Summing the counter after flushing is better than before flushing although neither of them could guarantee accuracy. Because we don't want to meet the scenario that user asks a question why there exists successful counter in metric system however there is no data in HBase.

gong · 2022-07-29T11:20:44Z

please resolve conflic@ifndef-SleePy

ifndef-SleePy · 2022-08-03T12:39:19Z

I've re-created the PR after resolving conflicts.

…lacing counter with thread-safe implementation

yunqingmoswu

LGTM

…ric collection (apache#5170)

github-actions bot added the component/sort label Jul 21, 2022

healchow assigned ifndef-SleePy Jul 21, 2022

dockerzhang requested a review from EMsnap July 22, 2022 03:23

EMsnap reviewed Jul 22, 2022

View reviewed changes

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java Outdated Show resolved Hide resolved

gong reviewed Jul 22, 2022

View reviewed changes

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java Outdated Show resolved Hide resolved

yunqingmoswu reviewed Jul 23, 2022

View reviewed changes

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java Outdated Show resolved Hide resolved

...sort-connectors/hbase/src/main/java/org/apache/inlong/sort/hbase/sink/HBaseSinkFunction.java Outdated Show resolved Hide resolved

yunqingmoswu approved these changes Jul 26, 2022

View reviewed changes

[INLONG-5169][Sort] Introduce a thread-safe counter implementation

38acff6

ifndef-SleePy force-pushed the 5169 branch from d0dd825 to 03451b4 Compare August 3, 2022 12:38

[INLONG-5169][Sort] Fix HBaseSinkFunction race-condition issue by rep…

1e05f15

…lacing counter with thread-safe implementation

ifndef-SleePy force-pushed the 5169 branch from 03451b4 to 1e05f15 Compare August 3, 2022 12:49

gong approved these changes Aug 4, 2022

View reviewed changes

thesumery approved these changes Aug 4, 2022

View reviewed changes

healchow approved these changes Aug 4, 2022

View reviewed changes

yunqingmoswu approved these changes Aug 4, 2022

View reviewed changes

EMsnap approved these changes Aug 4, 2022

View reviewed changes

EMsnap merged commit 089ea1e into apache:master Aug 4, 2022

zcy1010 pushed a commit to jun0315/inlong that referenced this pull request Aug 7, 2022

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction met…

4d69f7b

…ric collection (apache#5170)

bruceneenhl pushed a commit to bruceneenhl/inlong that referenced this pull request Aug 12, 2022

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction met…

f479546

…ric collection (apache#5170)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction metric collection #5170

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction metric collection #5170

ifndef-SleePy commented Jul 21, 2022 •

edited

Loading

dockerzhang commented Jul 22, 2022

gong commented Jul 22, 2022

yunqingmoswu commented Jul 23, 2022

ifndef-SleePy commented Jul 25, 2022

yunqingmoswu commented Jul 26, 2022

gong commented Jul 26, 2022

ifndef-SleePy commented Jul 26, 2022

yunqingmoswu left a comment

yunqingmoswu commented Jul 26, 2022

ifndef-SleePy commented Jul 28, 2022

gong commented Jul 29, 2022 •

edited

Loading

ifndef-SleePy commented Aug 3, 2022

yunqingmoswu left a comment

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction metric collection #5170

[INLONG-5169][Sort] Fix race condition issue of HBaseSinkFunction metric collection #5170

Conversation

ifndef-SleePy commented Jul 21, 2022 • edited Loading

Prepare a Pull Request

Motivation

Modifications

Verifying this change

Documentation

dockerzhang commented Jul 22, 2022

gong commented Jul 22, 2022

yunqingmoswu commented Jul 23, 2022

ifndef-SleePy commented Jul 25, 2022

yunqingmoswu commented Jul 26, 2022

gong commented Jul 26, 2022

ifndef-SleePy commented Jul 26, 2022

yunqingmoswu left a comment

Choose a reason for hiding this comment

yunqingmoswu commented Jul 26, 2022

ifndef-SleePy commented Jul 28, 2022

gong commented Jul 29, 2022 • edited Loading

ifndef-SleePy commented Aug 3, 2022

yunqingmoswu left a comment

Choose a reason for hiding this comment

ifndef-SleePy commented Jul 21, 2022 •

edited

Loading

gong commented Jul 29, 2022 •

edited

Loading