Basics
Special Types
Comparisons
Mode of Data
Applications
100

What is a corpus?


A collection of texts used for linguistic analysis.

100

What is a learner corpus?


This corpus includes texts produced by second language learners.

100

What is a parallel corpus?


A corpus that pairs texts with their translations.

100

What is a written corpus?


A corpus made of newspaper articles, books, and magazines.

100

What is corpus linguistics?


Corpora are mainly used in this branch of study.

200

What is a monolingual corpus?

This corpus contains texts in only one language.

200

What is a monitor corpus?


A corpus continuously updated with new data.

200

What is a comparable corpus?


A corpus with texts in different languages but not translations.

200

What is a spoken corpus?

A corpus built from spoken conversations or transcripts.

200

What is a concordancer?


This tool is often used to search through corpora.

300

What is a multilingual corpus?


This corpus contains texts in more than one language.

300

What is a specialized corpus?


A corpus focused on one domain, such as law or medicine

300

What is “parallel = translated texts, comparable = similar genre but not translated”?


The difference between parallel and comparable corpora.

300

What is a mixed-mode corpus?

A corpus that mixes both speech and writing.

300

What is language teaching / error analysis?

A corpus of learners’ texts can help improve this.

400

What is a general corpus?


A corpus that is balanced across genres, registers, and text types.

400

What is a diachronic corpus?

.

A corpus containing historical texts across time periods.

400

What is a multilingual parallel corpus?

A corpus with translated texts across multiple languages.

400

What is an electronic corpus / digital corpus?

A corpus built from computer-mediated communication (e.g., social media).

400

What is a parallel corpus?


This corpus type is crucial for training machine translation systems.

500

What is a reference corpus?


This type of corpus is designed to serve as a standard benchmark for study.

500

What is a synchronic corpus?


A corpus that focuses on everyday present-day language.

500

What is a comparable specialized corpus?

If you compare English and Spanish medical research articles not translated, you are using this type of corpus.

500

What is an academic corpus?


A corpus designed specifically for classroom speech and academic writing.

500

What is a monitor corpus?


This type of corpus helps track slang, new words, and language change in real time.

M
e
n
u