This type of data is organized in a predefined format, making it easy to store, retrieve, and analyze.
Structured Data
This job title is responsible for developing project plans, resource management, briefings, and budgets, for an individual project.
Project Manager
This job title is most often responsible for working in the back-end of business intelligence tools.
Business Intelligence (BI) Developer
Data Scientists can also be titled (semi-commonly) as one of these other two job titles. (Hint: Not AI or ML Engineer).
Mathematician | Statistician
The three phase process that is utilized to move data from one location to another and allows it to be prepared for analysis.
This is the largest format in which data is stored, and can store any type and size of data.
Data Lake
A Business Analyst is responsible for gathering/defining these when talking to end-users and stakeholders. (Hint: "These" refer to what the system or application should do, or its overall purpose).
Business Requirements (Also acceptable: Functional Requirements)
This describes the process of organizing and reorganizing data in a database to ensure it can be utilized for further queries and analysis ("cleaning" data).
Data Normalization
Which of the following is NOT a type of AI?
Machine Learning (ML) | Deep Learning | Natural Language Processing (NLP) | Data Predictions
Data Predictions
A machine's ability to perform the cognitive functions we associate with human minds, such as perceiving, learning, reasoning, etc.
Artificial Intelligence (AI)
This job title is responsible for designing, constructing, and maintaining the systems and infrastructure needed to acquire, store, process, and transform data into a usable format for analysis and decision-making.
Data Engineer
Which of the following is a type of SDLC that BAs/PMs working on software projects have likely worked with?
Specifications | Testing | Agile | Functional
Agile
True/False: BI professionals are often responsible for delivering presentations to executive-level leadership within an organization.
False (More of a BA responsibility)
This type of problem occurs in data science when a ML model learns training data too well.
Overfitting
Name two tools commonly utilized in data visualization.
Qlik | Tableau | Power BI | Dundas | Crystal Reports
Which of the following is NOT a big data tool?
Hadoop | Hive | Kafka | Nifi
Nifi
This job title is a step above Program Manager, and they might be responsible for numerous programs across a business unit. (Hint: Not a "director" in title.)
Portfolio Manager
Name two things that are typically contained within dashboards.
Visualizations, filters, KPIs, titles/labels, notes
What is the primary difference between supervised and unsupervised learning?
Whether the data is labeled (supervised) or not (unsupervised)
This refers to the process of computers learning from communicating with humans.
Natural Language Processing (NLP)
Which of the following IS an example of a data warehousing tool?
Redshift | Alation | Talend | ERwin
Redshift (Amazon)
This is the "best" certification for individuals in the project management field.
PMP (Project Management Professional Certification)
Name one tool that is commonly used for data mining and/or predictive analysis.
RapidMiner | IBM SPSS Modeler | SAS Enterprise Miner
Which of the following libraries is NOT commonly utilized in ML?
scikit-learn | PyTorch | TensorFlow | CNNs
CNNS (type of neural network)
TensorFlow Serving, Flask, Django, and AWS SageMaker are examples of tools/technologies that perform this function.
Model Deployment