Vector Space
Boolean Retrieval
Web IR
Cloud Computing
Virtualization
100

This measure is used to calculate similarity between two vectors in a vector space

What is Cosine?

100

This logical operator retrieves documents that contain both terms.

What is AND?

100

The process search engines use to automatically collect web pages is called

What is Crawling?

100

This type of cloud is operated solely for a single organization.

What is Private?

100

This software layer allows multiple operating systems to run on a single physical machine.

What is Hypervisor?

200

Weighting terms by their importance across documents is called ____ frequency

What is TF-IDF?

200

This operator retrieves documents that contain either of the terms

What is OR

200

Search engines store copies of web pages in a ____ for fast retrieval

What is Index?

200

Cloud services that provide virtualized computing resources over the internet are called ____.

What is IaaS?

200

A fully independent virtual machine that runs its own OS is called a ____.

What is VM?

300

This type of matrix shows the frequency of terms across documents

What is Term-Document?

300

This operator retrieves documents that contain the first term but not the second

What is NOT?

300

The algorithm used by Google to rank web pages based on links is called ____.

What is PageRank?

300

This model delivers software applications over the internet without local installation.

What is SaaS?

300

This type of virtualization abstracts hardware at the CPU level to run multiple OS instances.

What is Hardware?

400

A document in vector space modeling is represented as a ____ of numbers

What is Vector?

400

A search system that matches documents strictly based on presence or absence of terms uses ____ retrieval

What is Boolean?

400

A list of URLs discovered but not yet visited by a crawler is called a ____.

What is Frontier?

400

Cloud resources that can automatically scale up or down based on demand are said to be ____.

What is Elastic?

400

The process of dividing a single physical server into multiple virtual servers is called ____.  

What is Partitioning?

500

In NLP, the smallest unit of text usually considered in a vector space is a ____

What is Word?

500

This symbol is often used to group terms in a Boolean query.

What is Parentheses?

500

This metric measures the relevance of a search result based on how many users click on it.

What is Click-through?

500

This type of cloud combines private and public cloud environments.

What is Hybrid?

500

This is a virtual version of a physical server, storage device, or network resource.

What is Virtual Machine?