What is the difference between a VLOOKUP and Index Match?
VLOOKUP value must be in the first column of the table While INDEX returns a value from a range and MATCH returns the position of a value in a range.
What is the difference between a UNION and a JOIN?
JOIN : combines data by matching rows from the two tables based on a common column.
UNION : combines data by adding rows from the two tables together, even if the rows do not have any matching columns.
What is the median of the following dataset [1, 2, 3, 7, 7, 10] ?
5
what is the difference between = and ==?
= is used for assignment
== is for comparison
A DataFrame function that allows you to order the DataFrame by column(s)
sort_values()
This command/tool allows you to remove information from sight (NOT DELETE IT) and only see the information you want
FILTER
What are two operator symbols that mean not equal?
!=
<>
In Power BI, Which visual filters a dashboard more interactively?
Slicer
A function that removes row(s) with missing values in a pandas DataFrame?
What are the main TWO differences between List, Tuple, and Set?
List: [] , ordered, mutable, non-unique
Tuple: () , ordered, immutable, non-unique
Set: {} , unordered, mutable, unique
Mention 2 types of data that can not be cleaned using Excel.
Audio, Video, or Images.
What result would this function give?
RIGHT(LEFT('yeeerrrr', 4 ),3)
eee
How many axis are in a dual axis?
3
Name five basic DataFrame functions used for EDA
.head()/.tail() , .info() , .describe() , .value_counts() , .isna()/.isnull() , .duplicated() , .corr() , .nlargest( )/.nsmallest() , .nunique( ) , .sample( )
What is the difference between iloc and loc ?
loc : Selects rows using row labels.
iloc : Selects rows using the integer positions
To concatenate values in an Excel formula, you need to use ....
Ampersand (&)
Name 3 basic conditions both tables need to meet in order for a INNER JOIN, LEFT JOIN, RIGHT JOIN to occur?
Both tables must have a common column.
The common column must be unique in each table.
The common column must contain the same data type in both tables.
Both common column data should match
In standard deviation, what is the 68-95-99.7 Rule?
The standard deviation is a measure of how spread out a set of data and 68% of data are within one standard deviation, 95% within two standard deviations, and 99.7% within three standard deviations.
What is the type of error in the following code:
d={ 1:'Python', '2':'Excel',3:'PowerBi',4:'Tableau'}
d[2]
KeyError
Which of the following is is similar? and Why?
append()
extend()
+
append() : Inserts as a value at the end of the list
extend() : Combines a list with another list
+ : Combines a list with another list
_____________________ rules place restrictions on the types of data that can be entered into cells.
Data Validation
What is the difference between DATE_PART and DATE_TRUNC data in SQL?
DATE_TRUNC is continuous and DATE_PART is Discrete.
or
DATEPART returns an INTEGER value; DATETRUNC returns a DATE
In terms of the mean and median, how can we determine if a distribution is skewed?
BONUS (+100): With reference to the mean / median, when can we expect to get a right / left skew?
If the difference between them is small, then its closer to a normal distribution. When its large it indicates that the chart is skewed.
BONUS: A distribution is left-skewed if the mean is less than the median.
A distribution is right-skewed if the mean is greater than the median
What is the type of error in the following code: num= 123.467.1
SyntaxError
What is the output of the following:
print([' '.join(['yadot', 'ylrae']+['evael', 'nac', 'uoy'])][0][::-1])
you can leave early today