Corpus Basics
By Purpose
By Time
By Access Type
By content
100

What is a corpus?

Correct: A collection of written or spoken texts used for linguistic research.


100

Corpus designed to study general features of a language.

General (reference) corpus.

100

Corpus that includes texts from one specific time period.

Synchronic corpus.

100

Corpus freely available to everyone online.

Open corpus.

100

Corpus made of written texts such as books or newspapers.

Written corpus.

200

The main purpose of a corpus.

 To analyze language use and patterns in real contexts.

200

 Corpus built for a specific field, like law or medicine.

Specialized corpus.

200

Corpus that includes texts from different historical periods.

Diachronic (historical) corpus.

200

Corpus available only to registered researchers or organizations.

Restricted corpus.

200

Corpus consisting of spoken language transcriptions.

Spoken corpus.

300

What is corpus linguistics?

The study of language based on examples taken from real texts (corpora).

300

Corpus used to compare two or more languages.

Parallel (comparable) corpus.

300

The main goal of a diachronic corpus.

To study language change over time.

300

Corpus you must buy or subscribe to use.

Commercial corpus.

300

Corpus that combines both written and spoken texts.

Multimodal corpus.

400

Name one famous English corpus.

The British National Corpus (BNC).


400

Corpus created for educational purposes.

Learner corpus.

400

A corpus with continuously updated data.

Monitor corpus.

400

Corpus that can be accessed with a password or login.

Closed corpus.

400

Corpus containing texts translated from other languages.

Translation corpus.

500

What is a concordancer?

A software tool that finds and displays word occurrences in a corpus.

500

Corpus used to build dictionaries and grammar books.

Reference corpus.


500

Example of a historical corpus.

Helsinki Corpus of English Texts.

500

Which type of corpus gives public access to its data?

Open corpus.

500

Corpus created from social media posts or online communication.

Web corpus.