menu search
brightness_auto
more_vert
2 1

Does the vocabulary of a corpus remain the same before and after text normalization? Why?

Topic   Natural Language Processing (AI Domain)
Type  Short answer type
Class 10
thumb_up_off_alt 2 like thumb_down_off_alt 0 dislike

1 Answer

more_vert
 
verified
Verified Answer
1

No, the vocabulary of a corpus does not remain the same before and after text normalization. Reasons are –

1. In normalization, the text is normalized through various steps and is lowered to minimum vocabulary since the machine does not require grammatically correct statements but the essence of it.

2. In normalization Stop words, Special Characters, and Numbers are removed

3. In stemming the affixes of words are removed and the words are converted to their base form.

4. So, after normalization, we get a reduced vocabulary.


Study more about Natural Language Processing at Natural Language Processing Class 10   

thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike

Related questions

thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 0 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
thumb_up_off_alt 1 like thumb_down_off_alt 0 dislike
1 answer
Welcome to Aiforkids, where you can ask questions and receive answers from other members of the community.

AI 2024 Class 10 Board Exams mein 100% laane ka plan OPEN NOW

Class 10 Complete One Shot AI Lectures at - Youtube

1.5k questions

1.4k answers

4 comments

5.0k users

...