-
Notifications
You must be signed in to change notification settings - Fork 289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changefeed with sink-uri=kafka status become failed after all PD restart #2389
Comments
$ tiup cdc:v4.0.14 cli changefeed list --pd=http://172.16.6.24:237 Flags: Global Flags: fail to open PD etcd client, pd="http://172.16.6.24:237": context deadline exceeded
|
|
checkpoint still updating after failed. |
the error may happen when try to |
#942 may still unsolved, because |
|
The problem should have already fix by #2370. replay the scenario on release-4.0 before try to fix it. |
Bug Report
Please answer these questions before submitting your issue. Thanks!
2x capture, 2x changefeed:
Starting component
cdc
: /root/.tiup/components/cdc/v4.0.14/cdc cli changefeed list --pd=http://172.16.6.28:2379[
{
"id": "kafka-task-11",
"summary": {
"state": "normal",
"tso": 426608082704400445,
"checkpoint": "2021-07-27 18:11:26.586",
"error": null
}
},
{
"id": "replication-task-11",
"summary": {
"state": "normal",
"tso": 426608707681910785,
"checkpoint": "2021-07-27 18:51:10.686",
"error": null
}
}
Restart PD: $tiup cluster restart 360UP -R pd
Changefeed with sink-uri=kafka status become failed after all PD restart
Starting component
cdc
: /root/.tiup/components/cdc/v4.0.14/cdc cli changefeed list --pd=http://172.16.6.28:2379[
{
"id": "replication-task-11",
"summary": {
"state": "normal",
"tso": 426608859290271791,
"checkpoint": "2021-07-27 19:00:49.026",
"error": null
}
},
{
"id": "kafka-task-11",
"summary": {
"state": "failed",
"tso": 426608874219372599,
"checkpoint": "2021-07-27 19:01:45.976",
"error": {
"addr": "172.16.6.32:8300",
"code": "CDC-owner-1001",
"message": "rpc error: code = Unknown desc = rpc error: code = Unavailable desc = not leader"
}
}
}
]
Check cf status again, the failed kafka-task-11 checkpoint is still updating...
Starting component
cdc
: /root/.tiup/components/cdc/v4.0.14/cdc cli changefeed list --pd=http://172.16.6.28:2379[
{
"id": "kafka-task-11",
"summary": {
"state": "failed",
"tso": 426609068572934145,
"checkpoint": "2021-07-27 19:14:07.376",
"error": {
"addr": "172.16.6.32:8300",
"code": "CDC-owner-1001",
"message": "rpc error: code = Unknown desc = rpc error: code = Unavailable desc = not leader"
}
}
},
{
"id": "replication-task-11",
"summary": {
"state": "normal",
"tso": 426609068677791745,
"checkpoint": "2021-07-27 19:14:07.776",
"error": null
}
}
Versions of the cluster
Upstream TiDB cluster version (execute
SELECT tidb_version();
in a MySQL client):TiCDC version (execute
cdc version
):The text was updated successfully, but these errors were encountered: