NVIDIA Generative AI LLMs NCA-GENL Question # 18 Topic 2 Discussion

NVIDIA Generative AI LLMs NCA-GENL Question # 18 Topic 2 Discussion

NCA-GENL Exam Topic 2 Question 18 Discussion:
Question #: 18
Topic #: 2

Why might stemming or lemmatizing text be considered a beneficial preprocessing step in the context of computing TF-IDF vectors for a corpus?


A.

It reduces the number of unique tokens by collapsing variant forms of a word into their root form, potentially decreasing noise in the data.


B.

It enhances the aesthetic appeal of the text, making it easier for readers to understand the document’s content.


C.

It increases the complexity of the dataset by introducing more unique tokens, enhancing the distinctiveness of each document.


D.

It guarantees an increase in the accuracy of TF-IDF vectors by ensuring more precise word usage distinction.


Get Premium NCA-GENL Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.