Pass the EMC Data Science D-DS-FN-23 Questions and answers with CertsForce

Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions
Questions # 1:

You have been given a task to improve sales force compensation of your organization. As a result of a study, your team decides to classify personnel as follows:

● Did not meet quota

● Met quota

● Exceeded 150% of quota

In which data analytics lifecycle phase should you define these categories for analysis purposes?

Options:

A.

Model building


B.

Communicate results


C.

Operationalize


D.

Model planning


Expert Solution
Questions # 2:

In which programming language is Hadoop written?

Options:

A.

C++


B.

Scala


C.

Java


D.

Python


Expert Solution
Questions # 3:

Which analytic technique would be appropriate to estimate home sale price in U.S. dollars as a function of square footage, number of bedrooms, and lot size?

Options:

A.

Time series analysis


B.

Linear regression


C.

Naive Bayesian classification


D.

K-means clustering


Expert Solution
Questions # 4:

After running a density plot you realize that the data has a long tail to the right. What can you do to make the dataset more normally distributed?

Options:

A.

Use a scatter plot to obtain a better picture


B.

Use a histogram to obtain a better picture


C.

Apply a square transformation


D.

Apply a logarithmic transformation


Expert Solution
Questions # 5:

Consider the following text:

“Stop!” he shouted. “Don’t go there!”

What set of words result from using a tokenizer for punctuation on the text?

Options:

A.

Stop, he, shouted, don. t. go. there


B.

Stop, he shouted, don. t go there


C.

Stop, he shouted, dpnt go there


D.

Stop, he, shouted, dpnt. go. there


Expert Solution
Questions # 6:

In the data preparation phase of the data analytics lifecycle, what does the term “data conditioning” refer to?

Options:

A.

Building training and testing datasets


B.

Identifying relationships and correlations among variables


C.

Deploying the model and monitoring its performance


D.

Cleaning the data, normalizing datasets. and performing transformations


Expert Solution
Questions # 7:

What data asset is an example of quasi-structured data?

Options:

A.

Excel file


B.

Clickstream data


C.

Relational database table


D.

Comma-separated value file


Expert Solution
Questions # 8:

What is a business driver for Big Data analytics adoption?

Options:

A.

Implement the latest technology and tools


B.

Maintain existing data silos


C.

Identify new business opportunities


D.

Ensure the analysts work in isolation


Expert Solution
Questions # 9:

In ANOVA, what is the null hypothesis for k population means?

Options:

A.

All population means are equal to each other


B.

At least two population means are equal


C.

At least two population means are not equal


D.

At most k-1 population means are equal


Expert Solution
Questions # 10:

Question # 10

Refer to the exhibit, which shows pairwise counts for items purchased together.

Consider the following association rule: Milk -> Eggs

What is value of the lift?

Options:

A.

1.18


B.

0.264


C.

120


D.

70.81


Expert Solution
Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions