Databricks Certified Machine Learning Associate Exam Databricks-Machine-Learning-Associate Question # 22 Topic 3 Discussion

Databricks Certified Machine Learning Associate Exam Databricks-Machine-Learning-Associate Question # 22 Topic 3 Discussion

Databricks-Machine-Learning-Associate Exam Topic 3 Question 22 Discussion:
Question #: 22
Topic #: 3

A data scientist has replaced missing values in their feature set with each respective feature variable’s median value. A colleague suggests that the data scientist is throwing away valuable information by doing this.

Which of the following approaches can they take to include as much information as possible in the feature set?


A.

Impute the missing values using each respective feature variable's mean value instead of the median value


B.

Refrain from imputing the missing values in favor of letting the machine learning algorithm determine how to handle them


C.

Remove all feature variables that originally contained missing values from the feature set


D.

Create a binary feature variable for each feature that contained missing values indicating whether each row's value has been imputed


E.

Create a constant feature variable for each feature that contained missing values indicating the percentage of rows from the feature that was originally missing


Get Premium Databricks-Machine-Learning-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.