BIG DATA BASICS
PREDICTIVE MAINTENANCE
INDUSTRY APPLICATIONS
TECHNOLOGIES AND TOOLS
ASSESSMENT 2 PREP
100

These are the 5 V's of Big Data

What are Volume, Velocity, Variety, Veracity, and Value?

100

This maintenance approach uses sensor data and ML to predict failures before they occur.

What is predictive maintenance?

100

Samsung's Nexplant Analytics platform targets this industry.

What is manufacturing?

100

Apache Kafka is used for this type of real-time data processing.

What is stream processing?

100

This document establishes team roles and responsibilities for Assessment 2.

What is the group contract?

200

Unlike predictive maintenance, this traditional approach waits for equipment to fail.

What is reactive (corrective) maintenance?

200

These devices collect real-time equipment data to enable predictive maintenance.

What are IoT sensors?

200

The Sahal et al. (2020) article focuses on these two specific industries for predictive maintenance.

What are railway transportation and wind energy?

200

This framework processes petabyte-scale data across distributed computing clusters.

What is Apache Hadoop or Apache Spark?

200

This Assessment 2 criterion is worth the most marks — a whopping 25 points.

What is the project proposal (or solution design)?

300

This type of analytics tells you what will happen, not just what has happened.

What is predictive analytics?

300

This Industry 4.0 concept creates a virtual replica of a physical asset to simulate behaviour and predict failures before they happen in the real world.

What is a digital twin?

300

Siemens monitors over 30,000 of these renewable energy assets globally using big data stream processing to predict blade fatigue and gearbox wear up to 6 weeks in advance.

What are wind turbines?

300

This cloud architecture places computing power close to the data source for low latency.

What is edge computing?

300

Assessment 2 requires evaluating this from your Assessment 1 work — the current state of the smart industry app.

What is the current implementation?

400

Descriptive, diagnostic, and predictive analytics all lead to this most advanced form.

What is prescriptive analytics?

400

This practice involves automatically updating a predictive maintenance ML model as new failure data arrives, so accuracy improves over time without manual retraining.

What is continuous training (or continuous model retraining)?

400

This Asian city-state's Mass Rapid Transit system uses big data analytics across 130 km of track and 120+ trains to predict rail degradation and signal failures before service is disrupted.

What is Singapore (MRT)?

400

This distributed file system, originally inspired by Google's internal storage system, stores petabytes of data across clusters of commodity hardware in Hadoop environments.

What is HDFS (Hadoop Distributed File System)?

400

This section of Assessment 2 is worth 25 points and requires professional diagrams and schematics.

What is solution design?