What is a corpus?

by Rohit (11.7k points) in Other asked Feb 2, 2022 2.7k views

What is a corpus?

Topic	Natural Language Processing
Type	Sample Question Term 2 CBSE
Class	10

1 Answer

by aiforkids (19.4k points) answered Feb 2, 2022

In Text Normalization, we undergo several steps to normalize the text to a lower level. That is, we will be working on text from multiple documents and the term used for the whole textual data from all the documents altogether is known as corpus.

OR

A corpus is a large and structured set of machine-readable texts that have been produced in a natural communicative setting.

OR

A corpus can be defined as a collection of text documents. It can be thought of as just a bunch of text files in a directory, often alongside many other directories of text files.

← Prev Question Next Question →

What is a corpus?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories