Summer Certification Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: force70

Databricks Certified Data Engineer Associate Exam Databricks-Certified-Data-Engineer-Associate Question # 13 Topic 2 Discussion

Databricks Certified Data Engineer Associate Exam Databricks-Certified-Data-Engineer-Associate Question # 13 Topic 2 Discussion

Databricks-Certified-Data-Engineer-Associate Exam Topic 2 Question 13 Discussion:
Question #: 13
Topic #: 2

A data engineer is inspecting an ETL pipeline based on a Pyspark job that consistently encounters performance bottlenecks. Based on developer feedback, the data engineer assumes the job is low on compute resources. To pinpoint the issue, the data engineer observes the Spark Ul and finds out the job has a high CPU time vs Task time.

Which course of action should the data engineer take?


A.

High CPU time vs Task time means an under-utilized cluster. The data engineer may need to repartition data to spread the jobs more evenly throughout the cluster.


B.

High CPU time vs Task time means efficient use of cluster and no change needed


C.

High CPU time vs Task time means over-utilized memory and the need to increase parallelism


D.

High CPU time vs Task time means a CPU over-utilized job. The data engineer may need to consider executor and core tuning or resizing the cluster


Get Premium Databricks-Certified-Data-Engineer-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.