In text normalization, the text is reduced to a minimum vocabulary by—
(i) eliminating stop words, special characters, and numbers from the vocabulary of a corpus.
(ii) Deleting the affixes and converting the words to their base form using stemming and lemmatization.
Study more about NLP at NLP Class 10