Pass the IBM IBM Data and AI: Data and AI C1000-059 Questions and answers with CertsForce

Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions
Questions # 1:

What are two methods used to detect outliers in structured data? (Choose two.)

Options:

A.

multi-label classification


B.

isolation forest


C.

gradient descent


D.

one class Support Vector Machine (SVM)


E.

Word2Vec


Questions # 2:

In which example would recall be preferred over precision?

Options:

A.

recall is always preferred


B.

identify suitable candidates for a job


C.

detection of malignant tumors


D.

book recommendation


Questions # 3:

When should median value be used instead of mean value for imputing missing data?

Options:

A.

for skewed data


B.

for real numbers


C.

for normally distributed data


D.

for large data sets


Questions # 4:

In machine vision, the algorithm for detecting objects or features in an image based on a target pattern is known as?

Options:

A.

OCR


B.

Hough transformation


C.

Fourier transform


D.

normalized correlation


Questions # 5:

What are three operators used by genetic programming? (Choose three.)

Options:

A.

reciprocation


B.

mutation


C.

duel


D.

selection


E.

sheltering


F.

crossover


Questions # 6:

Which statement is true for naive Bayes?

Options:

A.

Naive Bayes can be used for regression.


B.

Let p(C1 | x) and p(C2 | x) be the conditional probabilities that x belongs to class C1 and C2 respectively, in a binary model, log p (C1 | x) – log p(C2 | x) > 0 results in predicting that x belongs to C2.


C.

Naive Bayes is a conditional probability model.


D.

Naive Bayes doesn't require any assumptions about the distribution of values associated with each class.


Questions # 7:

Determine the number of bigrams and trigrams in the sentence. "Data is the new oil".

Options:

A.

3 bigrams, 3 trigrams


B.

4 bigrams, 4 trigrams


C.

3 bigrams, 4 trigrams


D.

4 bigrams, 3 trigrams


Questions # 8:

The least squares optimization technique (The Method of Least Squares) is used in which algorithm?

Options:

A.

Support Vector Machines


B.

Naive Bayes classification


C.

Logistic regression


D.

Linear regression


Questions # 9:

What is the technique called for vectorizing text data which matches the words in different sentences to determine if the sentences are similar?

Options:

A.

Cup of Vectors


B.

Box of Lexicon


C.

Sack of Sentences


D.

Bag of Words


Questions # 10:

What is meant by the curse of dimensionality?

Options:

A.

The number of available algorithms for a given task is high.


B.

The number of available data sources for a given task is high.


C.

The data sparsity becomes more severe as the number of features is increased.


D.

The data sparsity becomes more severe as the number of samples is increased.


Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions