Databricks Databricks-Machine-Learning-Associate Exam Questions Free Practice Test

Viewing page 3 out of 3 pages

Viewing questions 21-30 out of questions

Questions # 21:

A machine learning engineer is trying to scale a machine learning pipeline by distributing its single-node model tuning process. After broadcasting the entire training data onto each core, each core in the cluster can train one model at a time. Because the tuning process is still running slowly, the engineer wants to increase the level of parallelism from 4 cores to 8 cores to speed up the tuning process. Unfortunately, the total memory in the cluster cannot be increased.

In which of the following scenarios will increasing the level of parallelism from 4 to 8 speed up the tuning process?

Options:

When the tuning process in randomized

When the entire data can fit on each core

When the model is unable to be parallelized

When the data is particularly long in shape

When the data is particularly wide in shape

Expert Solution

Questions # 22:

A data scientist has replaced missing values in their feature set with each respective feature variable’s median value. A colleague suggests that the data scientist is throwing away valuable information by doing this.

Which of the following approaches can they take to include as much information as possible in the feature set?

Options:

Impute the missing values using each respective feature variable's mean value instead of the median value

Refrain from imputing the missing values in favor of letting the machine learning algorithm determine how to handle them

Remove all feature variables that originally contained missing values from the feature set

Create a binary feature variable for each feature that contained missing values indicating whether each row's value has been imputed

Create a constant feature variable for each feature that contained missing values indicating the percentage of rows from the feature that was originally missing

Expert Solution

Viewing page 3 out of 3 pages

Viewing questions 21-30 out of questions

Pass the Databricks ML Data Scientist Databricks-Machine-Learning-Associate Questions and answers with CertsForce