menu search
brightness_auto
more_vert
1 1

Which words in a corpus have the highest values and which ones have the least?

Topic   Natural Language Processing (AI Domain)
Type  Short answer type
Class 10
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike

1 Answer

more_vert
 
verified
Verified Answer
1

Stop words like - and, this, is, the, etc. have the highest values in a corpus. But these words do not talk about the corpus at all. Hence, these are termed stopwords and are mostly removed at the pre-processing stage only.

Rare or valuable words occur the least but add the most important to the corpus. Hence, when we look at the text, we take frequent and rare words into consideration.

The graph of value of words in a corpus


Study more about Natural Language Processing at Natural Language Processing Class 10 

thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike

Related questions

thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 2 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 3 like thumb_down_off_alt 0 dislike
1 answer
Welcome to Aiforkids, where you can ask questions and receive answers from other members of the community.

AI 2024 Class 10 Board Exams mein 100% laane ka plan OPEN NOW

Class 10 Complete One Shot AI Lectures at - Youtube

1.5k questions

1.4k answers

4 comments

11.5k users

...