Pass the Google Google Cloud Certified Professional-Data-Engineer Questions and answers with CertsForce

Viewing page 4 out of 7 pages
Viewing questions 31-40 out of questions
Questions # 31:

What is the HBase Shell for Cloud Bigtable?

Options:

A.

The HBase shell is a GUI based interface that performs administrative tasks, such as creating and deleting tables.


B.

The HBase shell is a command-line tool that performs administrative tasks, such as creating and deleting tables.


C.

The HBase shell is a hypervisor based shell that performs administrative tasks, such as creating and deleting new virtualized instances.


D.

The HBase shell is a command-line tool that performs only user account management functions to grant access to Cloud Bigtable instances.


Expert Solution
Questions # 32:

Which of these sources can you not load data into BigQuery from?

Options:

A.

File upload


B.

Google Drive


C.

Google Cloud Storage


D.

Google Cloud SQL


Expert Solution
Questions # 33:

What Dataflow concept determines when a Window's contents should be output based on certain criteria being met?

Options:

A.

Sessions


B.

OutputCriteria


C.

Windows


D.

Triggers


Expert Solution
Questions # 34:

Scaling a Cloud Dataproc cluster typically involves ____.

Options:

A.

increasing or decreasing the number of worker nodes


B.

increasing or decreasing the number of master nodes


C.

moving memory to run more applications on a single node


D.

deleting applications from unused nodes periodically


Expert Solution
Questions # 35:

Why do you need to split a machine learning dataset into training data and test data?

Options:

A.

So you can try two different sets of features


B.

To make sure your model is generalized for more than just the training data


C.

To allow you to create unit tests in your code


D.

So you can use one dataset for a wide model and one for a deep model


Expert Solution
Questions # 36:

Which of these is not a supported method of putting data into a partitioned table?

Options:

A.

If you have existing data in a separate file for each day, then create a partitioned table and upload each file into the appropriate partition.


B.

Run a query to get the records for a specific day from an existing table and for the destination table, specify a partitioned table ending with the day in the format "$YYYYMMDD".


C.

Create a partitioned table and stream new records to it every day.


D.

Use ORDER BY to put a table's rows into chronological order and then change the table's type to "Partitioned".


Expert Solution
Questions # 37:

What are two methods that can be used to denormalize tables in BigQuery?

Options:

A.

1) Split table into multiple tables; 2) Use a partitioned table


B.

1) Join tables into one table; 2) Use nested repeated fields


C.

1) Use a partitioned table; 2) Join tables into one table


D.

1) Use nested repeated fields; 2) Use a partitioned table


Expert Solution
Questions # 38:

Which Cloud Dataflow / Beam feature should you use to aggregate data in an unbounded data source every hour based on the time when the data entered the pipeline?

Options:

A.

An hourly watermark


B.

An event time trigger


C.

The with Allowed Lateness method


D.

A processing time trigger


Expert Solution
Questions # 39:

When a Cloud Bigtable node fails, ____ is lost.

Options:

A.

all data


B.

no data


C.

the last transaction


D.

the time dimension


Expert Solution
Questions # 40:

Which software libraries are supported by Cloud Machine Learning Engine?

Options:

A.

Theano and TensorFlow


B.

Theano and Torch


C.

TensorFlow


D.

TensorFlow and Torch


Expert Solution
Viewing page 4 out of 7 pages
Viewing questions 31-40 out of questions