Summer Certification Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: force70

Pass the Microsoft Microsoft Certified: Azure Databricks Data Engineer DP-750 Questions and answers with CertsForce

Viewing page 1 out of 1 pages
Viewing questions 1-10 out of questions
Questions # 1:

You have an Azure Databricks workspace that is enabled for Unity Catalog and contains a managed Delta table named Tabid.

Table! is written by batch jobs every hour and is queried frequently by filtering two columns named Customerld and EventDate.

You expect Table1 to grow significantly over time.

The rows in Table1 are frequently updated and deleted to support compliance requests.

You need to keep query performance consistent as Table1 grows. The solution must minimize update and deletion effort.

What should you include in the solution? To answer, select the appropriate options in the answer area

NOTE: Each correct selection is worth one point.

Question # 1


Expert Solution
Questions # 2:

You have an Azure Databricks workspace that is enabled for Unity Catalog and contains a Delta table named Orders.

You load the Orders table into an Apache Spark DataFrame named df.

You need to create a DataFrame that excludes rows where the order amount is null.

Solution: You run the following expression.

df.dropna(subset=["order_amount"])

Does this meet the goal?

Options:

A.

Yes


B.

No


Expert Solution
Questions # 3:

You have an Azure Databricks job named Job1 that contains an ingestion task named Task1 and transformation task named Task2. You need to ensure that if Task1 fails, the task retries automatically, and Task2 is prevented from running How should you configure Job1? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Question # 3


Expert Solution
Questions # 4:

You have an Azure Databticks workspace that contains an all-purpose compute cluster named Cluster1. Cluser1 is used for

interactive development.

You need to configure Cluster1 to meet the following requirements:

• Automatically add and remove worker nodes based on workload demand

• Automatically shut down when the cluster has been idle for a specific period.

What should you configure for each requirement? To answer, drag the appropriate options to the correct requirements. Each option may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content

NOTE: Each correct selection is worth one point.

Question # 4


Expert Solution
Questions # 5:

You need to complete the PySpark code for the Spark Structured Streaming pipelines. The solution must meet the data ingestion and processing requirements.

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Question # 5


Expert Solution
Questions # 6:

You need to configure compute for the ingestion of telemetry data. The solution must meet the data ingestion and processing requirements.

What should you do?

Options:

A.

Enable Photon acceleration for a job compute cluster.


B.

Move the ingestion pipelines to shared compute.


C.

Increase an all-purpose cluster to a larger fixed node type.


D.

Disable autoscaling for a job compute cluster.


Expert Solution
Questions # 7:

You need to develop the task logic for a new job in Lakeflow Jobs that processes telemetry data.

Each task must contain only the appropriate logic for its step in the pipeline. The solution must support the planned changes and meet the data ingestion and processing requirements.

What should you do?

Options:

A.

Use a single Databricks notebook task that performs ingestion, cleansing, and curation in one script.


B.

Create three tasks that each contains the identical logic and use task retries.


C.

Use a single SQL task that performs ingestion, cleansing, and curation by running merge commands.


D.

Create separate tasks for ingestion, cleansing, and curation.


Expert Solution
Questions # 8:

Which SCD type should you use to support the planned data modeling changes? To answer, drag the appropriate types to the correct issues. Each type may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Question # 8


Expert Solution
Questions # 9:

Which ingestion option should you recommend for each data source? To answer, drag the appropriate options to the correct data sources. Each option may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Question # 9


Expert Solution
Viewing page 1 out of 1 pages
Viewing questions 1-10 out of questions