开发者社区 > 大数据与机器学习 > 正文

有大佬遇到这种Flink CDC 问题吗,Heartbeat of TaskManager with

有大佬遇到这种Flink CDC 问题吗,Heartbeat of TaskManager with id container_e152_1681119346002_0031_01_000002(hadoop1:36167) timed out?2023-04-21 08:34:08,527 INFO org.apache.flink.runtime.checkpoint.CheckpointCoordinator [] - Triggering checkpoint 12732 (type=CheckpointType{name='Checkpoint', sharingFilesStrategy=FORWARD_BACKWARD}) @ 1682037248525 for job 000000006d88de340000000000000000. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.checkpoint.CheckpointCoordinator [] - Completed checkpoint 12732 for job 000000006d88de340000000000000000 (44754 bytes, checkpointDuration=34347 ms, finalizationTime=21 ms). 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_sale_order_kafka_source[7]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_voucher_kafka_source[19]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_voucher_detail_kafka_source[22]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_pay_order_kafka_source[13]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_user_kafka_source[10]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_supplier_kafka_source[4]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_marketing_activity_kafka_source[28]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_tenant_kafka_source[1]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_marketing_log_kafka_source[31]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_marketing_voucher_kafka_source[25]. 2023-04-21 08:34:42,893 INFO org.apache.flink.runtime.source.coordinator.SourceCoordinator [] - Marking checkpoint 12732 as completed for source Source: ods_nmd_t_supplier_order_kafka_source[16].2023-04-21 08:36:08,527 INFO org.apache.flink.runtime.checkpoint.CheckpointCoordinator [] - Triggering checkpoint 12733 (type=CheckpointType{name='Checkpoint', sharingFilesStrategy=FORWARD_BACKWARD}) @ 1682037368525 for job 000000006d88de340000000000000000. 2023-04-21 08:36:30,324 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:30,372 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:30,530 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:30,770 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:30,795 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:31,554 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:31,677 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:31,755 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 1 disconnected. 2023-04-21 08:36:32,267 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 0 disconnected. 2023-04-21 08:36:32,317 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 2 disconnected. 2023-04-21 08:36:32,554 INFO org.apache.flink.kafka.shaded.org.apache.kafka.clients.NetworkClient [] - [AdminClient clientId=cdc_mysql_nmd_iceberg-enumerator-admin-client] Node 2 disconnected.2023-04-21 08:36:50,978 INFO org.apache.flink.runtime.jobmaster.JobMaster [] - Heartbeat of TaskManager with id container_e152_1681119346002_0031_01_000002(hadoop1:36167) timed out. 2023-04-21 08:36:50,979 INFO org.apache.flink.runtime.jobmaster.JobMaster [] - Disconnect TaskExecutor container_e152_1681119346002_0031_01_000002(hadoop1:36167) because: Heartbeat of TaskManager with id container_e152_1681119346002_0031_01_000002(hadoop1:36167) timed out. 2023-04-21 08:36:50,987 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source: ods_nmd_t_marketing_voucher_kafka_source[25] -> Calc[26] -> ConstraintEnforcer[27] (1/1) (a0d51d12b069005b6ea975b630adbf5d_5e41dc390d043aff80bcdeef4d074c59_0_0) switched from RUNNING to FAILED on container_e152_1681119346002_0031_01_000002 @ hadoop1 (dataPort=46030). java.util.concurrent.TimeoutException: Heartbeat of TaskManager with id container_e152_1681119346002_0031_01_000002(hadoop1:36167) timed out. at org.apache.flink.runtime.jobmaster.JobMaster$TaskManagerHeartbeatListener.notifyHeartbeatTimeout(JobMaster.java:1435) ~[flink-dist-1.16.0.jar:1.16.0] at org.apache.flink.runtime.heartbeat.HeartbeatMonitorImpl.run(HeartbeatMonitorImpl.java:155) ~[flink-dist-1.16.0.jar:1.16.0]

展开
收起
cuicuicuic 2023-04-26 14:54:44 184 0
1 条回答
写回答
取消 提交回答
  • taskmanager 挂了,此回答整理自钉群“Flink CDC 社区”

    2023-04-27 15:59:07
    赞同 展开评论 打赏

大数据领域前沿技术分享与交流,这里不止有技术干货、学习心得、企业实践、社区活动,还有未来。

相关产品

  • 实时计算 Flink版
  • 相关电子书

    更多
    Flink CDC Meetup PPT - 龚中强 立即下载
    Flink CDC Meetup PPT - 王赫 立即下载
    Flink CDC Meetup PPT - 覃立辉 立即下载