Spring Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: simple70

Pass the Databricks Databricks Certification Databricks-Certified-Data-Engineer-Associate Questions and answers with CertsForce

Viewing page 1 out of 5 pages
Viewing questions 1-10 out of questions
Questions # 1:

The Delta transaction log for the ‘students’ tables is shown using the ‘DESCRIBE HISTORY students’ command. A Data Engineer needs to query the table as it existed before the UPDATE operation listed in the log.

Question # 1

Which command should the Data Engineer use to achieve this? (Choose two.)

Options:

A.

SELECT * FROM students@v4


B.

SELECT * FROM students TIMESTAMP AS OF ‘2024-04-22T 14:32:47.000+00:00’


C.

SELECT * FROM students FROM HISTORY VERSION AS OF 3


D.

SELECT * FROM students VERSION AS OF 5


E.

SELECT * FROM students TIMESTAMP AS OF ‘2024-04-22T 14:32:58.000+00:00’


Expert Solution
Questions # 2:

A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location.

Which of the following data entities should the data engineer create?

Options:

A.

Database


B.

Function


C.

View


D.

Temporary view


E.

Table


Expert Solution
Questions # 3:

Identify how the count_if function and the count where x is null can be used

Consider a table random_values with below data.

What would be the output of below query?

select count_if(col > 1) as count_a. count(*) as count_b.count(col1) as count_c from random_values col1

0

1

2

NULL -

2

3

Options:

A.

3 6 5


B.

4 6 5


C.

3 6 6


D.

4 6 6


Expert Solution
Questions # 4:

A data engineer that is new to using Python needs to create a Python function to add two integers together and return the sum?

Which of the following code blocks can the data engineer use to complete this task?

A)

Question # 4

B)

Question # 4

C)

Question # 4

D)

Question # 4

E)

Question # 4

Options:

A.

Option A


B.

Option B


C.

Option C


D.

Option D


E.

Option E


Expert Solution
Questions # 5:

A data engineer is working with two tables. Each of these tables is displayed below in its entirety.

Question # 5

The data engineer runs the following query to join these tables together:

Question # 5

Which of the following will be returned by the above query?

Question # 5

Options:

A.

Option A


B.

Option B


C.

Option C


D.

Option D


E.

Option E


Expert Solution
Questions # 6:

A data engineer needs to ingest from both streaming and batch sources for a firm that relies on highly accurate data. Occasionally, some of the data picked up by the sensors that provide a streaming input are outside the expected parameters. If this occurs, the data must be dropped, but the stream should not fail.

Which feature of Delta Live Tables meets this requirement?

Options:

A.

Monitoring


B.

Change Data Capture


C.

Expectations


D.

Error Handling


Expert Solution
Questions # 7:

A data architect has determined that a table of the following format is necessary:

Question # 7

Which of the following code blocks uses SQL DDL commands to create an empty Delta table in the above format regardless of whether a table already exists with this name?

Question # 7

Options:

A.

Option A


B.

Option B


C.

Option C


D.

Option D


E.

Option E


Expert Solution
Questions # 8:

A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance.

Which of the following keywords can be used to compact the small files?

Options:

A.

REDUCE


B.

OPTIMIZE


C.

COMPACTION


D.

REPARTITION


E.

VACUUM


Expert Solution
Questions # 9:

What is the functionality of AutoLoader in Databricks?

Options:

A.

Auto Loader automatically ingests and processes new files from cloud storage, handling batch data with support for schema evolution.


B.

Auto Loader automatically ingests and processes new files from cloud storage, handling only streaming data with no support for schema evolution.


C.

Auto Loader automatically ingests and processes new files from cloud storage, handling batch and streaming data with no support for schema evolution.


D.

Auto Loader automatically ingests and processes new files from cloud storage, handling both batch and streaming data with support for schema evolution.


Expert Solution
Questions # 10:

In which of the following scenarios should a data engineer use the MERGE INTO command instead of the INSERT INTO command?

Options:

A.

When the location of the data needs to be changed


B.

When the target table is an external table


C.

When the source table can be deleted


D.

When the target table cannot contain duplicate records


E.

When the source is not a Delta table


Expert Solution
Viewing page 1 out of 5 pages
Viewing questions 1-10 out of questions