What does NDL stand for?
Nittany Data Labs
Give one way of dealing with null data
Can vary
What's the most common type of regression?
Linear Regression
What's the most common Python data manipulation library?
Name one step in the data science process
Explain data integration
Combining data from different sources
Provide a method to get the coefficients of a linear regression equation
Ordinary least squares, gradient descent
What's the main advantage of numpy arrays over lists?
Improved preformance
What form of supervised learning is used to predict quantitative variables
What does ETL stand for?
Extract Transfrom Load
What advantage does the error function MAE provide over MSE?
More intrepretable
What's the most popular traditional ML python library?
Scikit-learn
What three things make up data science?
Computer Science, Statistics, Subject Matter Experience
What is this an example of?
Hot One Encoding
What is the purpose of the validation split in the train, test, and validation split of data?
Optimize the model
What popular data visualization library can be used to create interactive plots?
Plotly
Give another application of reinforcement learning
Can vary
What is the Standard Scaler Equation?
Explain overfitting
The model gets too used tothe training data, which hurts the testing data results
What package does this line of code come from: plt.show()
Matplotlib