Pass the Databricks ML Data Scientist Databricks-Machine-Learning-Professional Questions and answers with CertsForce

Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions
Questions # 1:

A machine learning engineer is manually refreshing a model in an existing machine learning pipeline. The pipeline uses the MLflow Model Registry model "project". The machine learning engineer would like to add a new version of the model to "project".

Which of the following MLflow operations can the machine learning engineer use to accomplish this task?

Options:

A.

mlflow.register_model


B.

MlflowClient.update_registered_model


C.

mlflow.add_model_version


D.

MlflowClient.get_model_version


E.

The machine learning engineer needs to create an entirely new MLflow Model Registry model


Expert Solution
Questions # 2:

A data scientist has developed a modelmodeland computed the RMSE of the model on the test set. They have assigned this value to the variablermse. They now want to manually store the RMSE value with the MLflow run.

They write the following incomplete code block:

Question # 2

Which of the following lines of code can be used to fill in the blank so the code block can successfully complete the task?

Options:

A.

log_artifact


B.

log_model


C.

log_metric


D.

log_param


E.

There is no way to store values like this.


Expert Solution
Questions # 3:

A machine learning engineering team wants to build a continuous pipeline for data preparation of a machine learning application. The team would like the data to be fully processed and made ready for inference in a series of equal-sized batches.

Which of the following tools can be used to provide this type of continuous processing?

Options:

A.

Spark UDFs


B.

[Structured Streaming


C.

MLflow

D Delta Lake


D.

AutoML


Expert Solution
Questions # 4:

A machine learning engineer has developed a random forest model using scikit-learn, logged the model using MLflow as random_forest_model, and stored its run ID in the run_id Python variable. They now want to deploy that model by performing batch inference on a Spark DataFrame spark_df.

Which of the following code blocks can they use to create a function called predict that they can use to complete the task?

A)

Question # 4

B)

It is not possible to deploy a scikit-learn model on a Spark DataFrame.

C)

Question # 4

D)

Question # 4

E)

Question # 4

Options:

A.

Option A


B.

Option B


C.

Option C


D.

Option D


E.

Option E


Expert Solution
Questions # 5:

Which of the following is a reason for using Jensen-Shannon (JS) distance over a Kolmogorov-Smirnov (KS) test for numeric feature drift detection?

Options:

A.

All of these reasons


B.

JS is not normalized or smoothed


C.

None of these reasons


D.

JS is more robust when working with large datasets


E.

JS does not require any manual threshold or cutoff determinations


Expert Solution
Questions # 6:

Which of the following MLflow operations can be used to delete a model from the MLflow Model Registry?

Options:

A.

client.transition_model_version_stage


B.

client.delete_model_version


C.

client.update_registered_model


D.

client.delete_model


E.

client.delete_registered_model


Expert Solution
Questions # 7:

A machine learning engineer has deployed a model recommender using MLflow Model Serving. They now want to query the version of that model that is in the Production stage of the MLflow Model Registry.

Which of the following model URIs can be used to query the described model version?

Options:

A.

https:// /model-serving/recommender/Production/invocations


B.

The version number of the model version in Production is necessary to complete this task.


C.

https:// /model/recommender/stage-production/invocations


D.

https:// /model-serving/recommender/stage-production/invocations


E.

https:// /model/recommender/Production/invocations


Expert Solution
Questions # 8:

A machine learning engineer is attempting to create a webhook that will trigger a Databricks Jobjob_idwhen a model version for modelmodeltransitions into any MLflow Model Registry stage.

They have the following incomplete code block:

Question # 8

Which of the following lines of code can be used to fill in the blank so that the code block accomplishes the task?

Options:

A.

"MODEL_VERSION_CREATED"


B.

"MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"


C.

"MODEL_VERSION_TRANSITIONED_TO_STAGING"


D.

"MODEL_VERSION_TRANSITIONED_STAGE"


E.

"MODEL_VERSION_TRANSITIONED_TO_STAGING", "MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"


Expert Solution
Questions # 9:

A data scientist has created a Python functioncompute_featuresthat returns a Spark DataFrame with the following schema:

Question # 9

The resulting DataFrame is assigned to thefeatures_dfvariable. The data scientist wants to create a Feature Store table usingfeatures_df.

Which of the following code blocks can they use to create and populate the Feature Store table using the Feature Store Clientfs?

Options:

A.

Databricks-Machine-Learning-Professional Question 9 Option 1


B.

9


C.

features_df.write.mode("fs").path("new_table")


D.

9


E.

features_df.write.mode("feature").path("new_table")


Expert Solution
Questions # 10:

Which of the following MLflow Model Registry use cases requires the use of an HTTP Webhook?

Options:

A.

Starting a testing job when a new model is registered


B.

Updatingdata in a source table for a Databricks SQL dashboard when a model version transitions to the Production stage


C.

Sending an email alert when an automated testing Job fails


D.

None of these use cases require the use of an HTTP Webhook


E.

Sending a message to a Slack channel when a model version transitions stages


Expert Solution
Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions