Spring Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: simple70

Databricks Certified Data Engineer Associate Exam Databricks-Certified-Data-Engineer-Associate Question # 30 Topic 4 Discussion

Databricks Certified Data Engineer Associate Exam Databricks-Certified-Data-Engineer-Associate Question # 30 Topic 4 Discussion

Databricks-Certified-Data-Engineer-Associate Exam Topic 4 Question 30 Discussion:
Question #: 30
Topic #: 4

A data engineer needs to optimize the data layout and query performance for an e-commerce transactions Delta table. The table is partitioned by "purchase_date" a date column which helps with time-based queries but does not optimize searches on user statistics "customer_id", a high-cardinality column.

The table is usually queried with filters on "customer_i

d" within specific date ranges, but since this data is spread across multiple files in each partition, it results in full partition scans and increased runtime and costs.

How should the data engineer optimize the Data Layout for efficient reads?


A.

Alter table implementing liquid clustering on "customerid" while keeping the existing partitioning.


B.

Alter the table to partition by "customer_id".


C.

Enable delta caching on the cluster so that frequent reads are cached for performance.


D.

Alter the table implementing liquid clustering by "customer_id" and "purchase_date".


Get Premium Databricks-Certified-Data-Engineer-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.