Pre-Summer Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: force70

Databricks Certified Data Engineer Professional Exam Databricks-Certified-Professional-Data-Engineer Question # 31 Topic 4 Discussion

Databricks Certified Data Engineer Professional Exam Databricks-Certified-Professional-Data-Engineer Question # 31 Topic 4 Discussion

Databricks-Certified-Professional-Data-Engineer Exam Topic 4 Question 31 Discussion:
Question #: 31
Topic #: 4

A data engineer is designing a pipeline in Databricks that processes records from a Kafka stream where late-arriving data is common.

Which approach should the data engineer use?


A.

Implement a custom solution using Databricks Jobs to periodically reprocess all historical data.


B.

Use batch processing and overwrite the entire output table each time to ensure late data is incorporated correctly.


C.

Use an Auto CDC pipeline with batch tables to simplify late data handling.


D.

Use a watermark to specify the allowed lateness to accommodate records that arrive after their expected window, ensuring correct aggregation and state management.


Get Premium Databricks-Certified-Professional-Data-Engineer Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.