Correlation vs Causation
Types of Data
Statistics
Graph Types
Data Structures
100

This term describes when two variables appear to be related but one does not necessarily cause the other

What is Correlation

100

This type of data is descriptive and often involves language or interpretation

What is qualitative data

100

True or False, mean, median, and mode can sometimes all be the same value. 

True!

100

This type of graph shows the distribution of a data set by splitting the data into bins and displaying the frequency of each bin

What is a histogram

100

This basic data structure is an ordered collection of elements, typically accessed by their position or index

What is an array

200

Researchers find that smoking is linked to an increased risk of lung cancer

what is causation

200

This type of data can only take on integer values, such as the number of students in a class.

What is discrete data

200

This statistical measure tells us how spread out the numbers in a data set are and is the square of the standard deviation

What is the variance

200

This type of plot displays data points and can show the relationship between two quantitative variables

What is a scatter plot

200

In an array, this is the index of the first element

What is 0

300

When two variables both decrease together, such as time spent exercising and body fat percentage, this type of correlation is observed

What is a negative correlation

300

This type of data can be any value within a range and includes measurements like time and weight.

What is continuous data

300

This type of distribution is symmetrical and bell-shaped, often referred to as the "normal distribution. What's another name for this distribution? 

What is the Gaussian distribution

300

This graph uses boxes to represent the interquartile range and whiskers to show variability outside the upper and lower quartiles, often used to identify outliers

What is a box plot

300

This structure consists of nodes where each node contains data and a reference to the next node in the sequence

What is a linked list

400

Finding that students who attend more tutoring sessions have better grades, yet not being able to claim tutoring causes better grades, might be due to this kind of variable.

What is a confounding variable

400

This type of data can be both qualitative and quantitative, and includes categories such as "freshman," "sophomore," "junior," and "senior.

What is categorical data

400

The probability of an event happening given that another event has already happened is known as this

What is conditional probability

400

This type of graph displays quantitative data in a two-dimensional grid, with colors representing different values.

What is a heat map

400

This data structure can efficiently store and retrieve student grades based on their student ID.

What is a dictionary

500

This principle states that the simplest explanation is usually the correct one, often used to argue against unnecessary assumptions about causality.

What is Occam's Razor

500

This type of data can be categorized and ordered, but the intervals between categories are not meaningful or equal, such as ranks in a competition

What is ordinal data

500

This non-parametric test is used to determine if there are significant differences between the expected frequencies and the observed frequencies in one or more categories

What is the chi-square test

500

This visualization technique shows the structure of relationships within a dataset, where nodes represent data points and edges represent connections between them

What is a network graph

500

This term describes the first node in a linked list

What is the head

M
e
n
u