EMC Advanced Analytics Specialist Exam for Data Scientists E20-065 Question # 15 Topic 2 Discussion
E20-065 Exam Topic 2 Question 15 Discussion:
Question #: 15
Topic #: 2
You conduct a TFIDF analysis on 3 documents containing raw text and derive TFIDF ("data", document y) = 1.908. You know that the term "data” only appears in document 2.
What is the TF of “data" in document 2?
A.
2 based on the following reasoning:
TFIDF = TF1DF = 1 908
You then know that IDF will equal LOG (32)=0.954
Therefore, TFIDF=TF*0.954 = 1.908
TF will then round to 2
B.
4 based on the following reasoning:
TFIDF = TF1DF = 1.908
You then know that IDF will equal LOG (3/1 )=0.477
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit