Pass the Databricks Databricks Certification Databricks-Certified-Professional-Data-Scientist Questions and answers with CertsForce

Viewing page 4 out of 5 pages
Viewing questions 31-40 out of questions
Questions # 31:

Which of the following is a correct example of the target variable in regression (supervised learning)?

Options:

A.

Nominal values like true, false


B.

Reptile, fish, mammal, amphibian, plant, fungi


C.

Infinite number of numeric values, such as 0.100, 42.001, 1000.743..


D.

All of the above


Expert Solution
Questions # 32:

Select the statement which applies correctly to the Naive Bayes

Options:

A.

Works with a small amount of data


B.

Sensitive to how the input data is prepared


C.

Works with nominal values


Expert Solution
Questions # 33:

Which activity is performed in the Operationalize phase of the Data Analytics Lifecycle?

Options:

A.

Define the process to maintain the model


B.

Try different analytical techniques


C.

Try different variables


D.

Transform existing variables


Expert Solution
Questions # 34:

You are having 1000 patients' data with the height and age. Where age in years and height in meters. You wanted to create cluster using this two attributes. You wanted to have near equal effect for both the age and height while creating the cluster. What you can do?

Options:

A.

You will be adding height with the numeric value 100


B.

You will be converting each height value to centimeters


C.

You will be dividing both age and height with their respective standard deviation


D.

You will be taking square root of height


Expert Solution
Questions # 35:

A problem statement is given as below

Hospital records show that of patients suffering from a certain disease, 75% die of it. What is the probability that of 6 randomly selected patients, 4 will recover?

Which of the following model will you use to solve it.

Options:

A.

Binomial


B.

Poisson


C.

Normal


D.

Any of the above


Expert Solution
Questions # 36:

In which of the scenario you can use the regression to predict the values

Options:

A.

Samsung can use it for mobile sales forecast


B.

Mobile companies can use it to forecast manufacturing defects


C.

Probability of the celebrity divorce


D.

Only 1 and 2


E.

All 1 ,2 and 3


Expert Solution
Questions # 37:

Your customer provided you with 2. 000 unlabeled records three groups. What is the correct analytical method to use?

Options:

A.

Semi Linear Regression


B.

Logistic regression


C.

Naive Bayesian classification


D.

Linear regression


E.

K-means clustering


Expert Solution
Questions # 38:

Suppose a man told you he had a nice conversation with someone on the train. Not knowing anything about this conversation, the probability that he was speaking to a woman is 50% (assuming the train had an equal number of men and women and the speaker was as likely to strike up a conversation with a man as with a woman). Now suppose he also told you that his conversational partner had long hair. It is now more

likely he was speaking to a woman, since women are more likely to have long hair than men.____________

can be used to calculate the probability that the person was a woman.

Options:

A.

SVM


B.

MLE


C.

Bayes' theorem


D.

Logistic Regression


Expert Solution
Questions # 39:

A denote the event 'student is female' and let B denote the event 'student is French'. In a class of 100 students suppose 60 are French, and suppose that 10 of the French students are females. Find the probability that if I pick a French student, it will be a girl, that is, find P(A|B).

Options:

A.

1/3


B.

2/3


C.

1/6


D.

2/6


Expert Solution
Questions # 40:

In which lifecycle stage are appropriate analytical techniques determined?

Options:

A.

Model planning


B.

Model building


C.

Data preparation


D.

Discovery


Expert Solution
Viewing page 4 out of 5 pages
Viewing questions 31-40 out of questions