You have been given a task to improve sales force compensation of your organization. As a result of a study, your team decides to classify personnel as follows:
● Did not meet quota
● Met quota
● Exceeded 150% of quota
In which data analytics lifecycle phase should you define these categories for analysis purposes?
In which programming language is Hadoop written?
Which analytic technique would be appropriate to estimate home sale price in U.S. dollars as a function of square footage, number of bedrooms, and lot size?
After running a density plot you realize that the data has a long tail to the right. What can you do to make the dataset more normally distributed?
Consider the following text:
“Stop!” he shouted. “Don’t go there!”
What set of words result from using a tokenizer for punctuation on the text?
In the data preparation phase of the data analytics lifecycle, what does the term “data conditioning” refer to?
What data asset is an example of quasi-structured data?
What is a business driver for Big Data analytics adoption?
In ANOVA, what is the null hypothesis for k population means?
Refer to the exhibit, which shows pairwise counts for items purchased together.
Consider the following association rule: Milk -> Eggs
What is value of the lift?