Google Professional Data Engineer Exam Professional-Data-Engineer Question # 4 Topic 1 Discussion

Google Professional Data Engineer Exam Professional-Data-Engineer Question # 4 Topic 1 Discussion

Professional-Data-Engineer Exam Topic 1 Question 4 Discussion:
Question #: 4
Topic #: 1

You maintain ETL pipelines. You notice that a streaming pipeline running on Dataflow is taking a long time to process incoming data, which causes output delays. You also noticed that the pipeline graph was automatically optimized by Dataflow and merged into one step. You want to identify where the potential bottleneck is occurring. What should you do?


A.

Insert a Reshuffle operation after each processing step, and monitor the execution details in the Dataflow console.


B.

Log debug information in each ParDo function, and analyze the logs at execution time.


C.

Insert output sinks after each key processing step, and observe the writing throughput of each block.


D.

Verify that the Dataflow service accounts have appropriate permissions to write the processed data to the output sinks


Get Premium Professional-Data-Engineer Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.