In text normalization, the text is reduced to a minimum vocabulary by—
(i) eliminating stop words, special characters, and numbers from the vocabulary of a corpus.
(ii) Deleting the affixes and converting the words to their base form using stemming and lemmatization.
Study more about NLP at NLP Class 10
Ask the community — students and mentors are here to help, and you can search past answers too.
Ask a Question arrow_forward