The graph is an elbow plot showing the inertia or within-cluster sum of squares on the y-axis and number of clusters (also called K) on the x-axis, denoting the change in inertia as the clusters change using k-means algorithm.
What would be an optimal value of K to ensure a good number of clusters?
Word Embedding describes a task in natural language processing (NLP) where:
Normalization is the transformation of features:
Which two techniques are used to build personas in the ML development lifecycle? (Select two.)
Which of the following can take a question in natural language and return a precise answer to the question?
When working with textual data and trying to classify text into different languages, which approach to representing features makes the most sense?
For each of the last 10 years, your team has been collecting data from a group of subjects, including their age and numerous biomarkers collected from blood samples. You are tasked with creating a prediction model of age using the biomarkers as input. You start by performing a linear regression using all of the data over the 10-year period, with age as the dependent variable and the biomarkers as predictors.
Which assumption of linear regression is being violated?
Which of the following tools would you use to create a natural language processing application?
Which two of the following decrease technical debt in ML systems? (Select two.)
Which of the following tests should be performed at the production level before deploying a newly retrained model?