Excel
PostgreSQL
Visualizations
Statistics
Python
100

What is the difference between a VLOOKUP and Index Match?

VLOOKUP value must be in the first column of the table While INDEX returns a value from a range and MATCH returns the position of a value in a range.

100

What is the difference between DATE_PART and DATE_TRUNC data in SQL?

DATE_TRUNC is continuous and DATE_PART is Discrete. 

or

DATEPART returns an INTEGER value; DATETRUNC returns a DATE

100

What is Chloropleth Chart used for?

Used to represent spatial data

100

What is the difference between Mean, Median and Mode ?

Mean : is the average of all the values in the data set. It is calculated by adding all the values and dividing by the number of values.

Median : is the middle value when all the values are ranked from least to greatest. If there is an even number of values, the median is the average of the two middle values.

Mode : is the value that appears most often in the data set.

100

What is the difference between .append() and .extend()?

.append() : Inserts as a value at the end of the list  

.extend() : Combines a list with another list

200

This command/tool allows you to remove information from sight (NOT DELETE IT) and only see the information you want

FILTER

200

What is the difference between WHERE and HAVING?

WHERE filters individual rows

HAVING filters groups (aggregates)

200

What is the use of Calculated fields in Tableau?

Calculated fields allow you to create new data from data that already exists in your data source. (Create a new column) 

200

How to calculate IQR (Interquartile Range)?

Takes the third quartile value and subtracts the first quartile value.

IQR=Q3-Q1

200

What are the main TWO differences between List, Tuple, and Set?

List:[] , ordered, mutable, non-unique
Tuple: () , ordered, immutable, non-unique
Set: {} , unordered, mutable, unique

300
Mention 2 types of data that can not be cleaned using Excel.

Audio, Video, or Images.

300

Where can you place a Subquery ?

SELECT, FROM, WHERE, HAVING
300

In terms of the mean and median, how can we determine if a distribution is skewed and in what direction?

A distribution is left-skewed if the mean is less than the median. 

A distribution is right-skewed if the mean is greater than the median

300

What does the coefficient of correlation indicate? What is the range of values?

Correlation coefficients are used to measure the strength of the linear relationship between two variables. A correlation coefficient greater than zero indicates a positive relationship while a value less than zero signifies a negative relationship. Range +1 to -1

300

What is the type of error in the following code:

num= 123.467.1

SyntaxError

400

What do we use when we want to highlight cells that meet a certain criteria?

Conditional formatting.

400

Name 3 basic conditions both tables need to meet in order for a INNER JOIN, LEFT JOIN, RIGHT JOIN to occur?

Both tables must have a common column. 

The common column must be unique in each table.

The common column must contain the same data type in both tables. 

Both common column data should match

400

What chart best represent Quartiles?

A box and whisker chart (boxplot) shows distribution of data into quartiles, highlighting the mean and outliers.

400

In standard deviation, what is the 68-95-99.7 Rule?  

The standard deviation is a measure of how spread out a set of data and 68% of data are within one standard deviation, 95% within two standard deviations, and 99.7% within three standard deviations.

400

What does PANDAS stand for?

PANEL DATA

500

A column with 50 cells, of which 42 are numbers, 5 are text, and the rest are empty cells. What is the result of this formula?

=COUNT(A1:A50)+COUNTA(A1:A50)+COUNTBLANK(A1:A50)

92

500

What is the difference between a UNION and a JOIN?

JOIN : combines data by matching rows from the two tables based on a common column.

UNION : combines data by adding rows from the two tables together, even if the rows do not have any matching columns.

500

What is Data-Ink Ratio?

A principle states that extra info should not distract from key relations

500

What is the main difference between the Population and the Sample Standard Deviation Formula?

The population standard deviation formula uses the population size (N), while the sample standard deviation formula uses the sample size minus one (n-1).

500

What is the type of error in the following code:

d={ 1:'Python', '2':'Excel',3:'PowerBi',4:'Tableau'}
d[2]

KeyError

M
e
n
u