EMC Advanced Analytics Specialist Exam for Data Scientists E20-065 Question # 15 Topic 2 Discussion

EMC Advanced Analytics Specialist Exam for Data Scientists E20-065 Question # 15 Topic 2 Discussion

E20-065 Exam Topic 2 Question 15 Discussion:
Question #: 15
Topic #: 2

You conduct a TFIDF analysis on 3 documents containing raw text and derive TFIDF ("data", document y) = 1.908. You know that the term "data” only appears in document 2.

What is the TF of “data" in document 2?


A.

2 based on the following reasoning:

TFIDF = TF1DF = 1 908

You then know that IDF will equal LOG (32)=0.954

Therefore, TFIDF=TF*0.954 = 1.908

TF will then round to 2


B.

4 based on the following reasoning:

TFIDF = TF1DF = 1.908

You then know that IDF will equal LOG (3/1 )=0.477

Therefore, TFIDF=TF'0 477 = 1.908

TF will then round to 4


C.

6 based on the following reasoning:

TFIDF = TF1DF = 1.908

You then know that IDF will equal 3/1=3

Therefore, TFIDF=TF/3 = 1.908

TF will then round to 6


D.

11 based on the following reasoning:

TFIDF = TF1DF = 1908

You then know that IDF will equal LOG(3/2)=0.176

Therefore, TFIDF=TF"0.176 = 1.908

TF will then round to 11


Get Premium E20-065 Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.