This command allows me to see all my remotes
git remote -v
what do these three measures (Mean, Median, Mode)?
they measure the central tendency
What is the output of the function: random.choice('Esraa')
either E,s,r,a or a
This function allows us to loop over a list and retrieve the index and element
enumerate
These two methods return descriptive statistics and informations such as (missing values and column datatypes) for a dataframe respecitvely?
.describe() & .info()
These two commands we discussed are not git commands, but exist in Github
Fork & Pull Request
Describe a distribution that is positively skewed in terms of the central tendency
mode < median < mean
What is the difference between iloc and loc?
loc gets rows (or columns) with particular labels from the index.
iloc gets rows (or columns) at particular positions in the index (so it only takes integers).
“IndexError: list index out of range”, This error happens when?
Going past the last index of a list.
This method is the inverse of .pivot_table?
.melt
What is a merge conflict?
A merge conflict is caused by competing line changes, such as when people make different changes to the same line of the same file
Difference between a probability mass function and a probability density function
pmf - For discrete random variables. pmf(x) is the probability the random variable equals x.
pdf - For continuous random variables. The probability of a specific vale occurring is 0 the probability of the random variable between a and b is the integral from a to b of the pdf.
What does happen for the DataFrame when I reset_index?
the original index will be a column and a new index will be created
What method can we when looping over a dictionary to retrieve key and value pairs
.items()
describe this relative path pd.read_csv("../../datasets/data.csv")
we are going 2 levels up then we will go to datasets folder then we will find the data file.
What git commands do I use to create upstream master and pulls files from it?
git remote add upstream {link}
git pull upstream master
What is the formula for standard deviation?
std = sqrt(mean(abs(x - x.mean())**2))
What datatype is titanic.groupby('Pclass')?
pandas.core.groupby.DataFrameGroupBy object
What is [9, 3, 9] * 2 ?
[9, 3, 9, 9, 3, 9]
This argument will concatenate dataframes column wise - pd.concat([df1, df2], _____ )?
axis=1
This command creates and checkouts to a new branch?
git checkout -b {branch__name}
The difference between numpy.std and pandas.DataFrame.std
Difference is in the default of the denominator for the mean: N for numpy, N - 1 for pandas
what is the Tukey's method for outlier detection ?
[Q1 - 1.5(Q3 - Q1) , Q3 + 1.5(Q3 - Q1) ]
What is the output of ?
[str(round(2.2)) for i in range (1, 6)]
['2', '2', '2', '2', '2']
What is the difference between .map, .apply, and .applymap methods?
apply works on a row / column basis of a DataFrame, applymap works element-wise on a DataFrame, and map works element-wise on a Series.
.map operates down a single column in pandas dataframe and .apply operates down the entire dataframe row-wise.