These major AI features were available for the first time in KNIME Analytics Platform Version 5.1.
What is new K-AI and KNIME AI Extension?
This KNIME feature is used to automate workflow execution.
What is scheduling?
These are the three stages of analyzing data in KNIME Analytics Platform
What is
1. Access and blend
2. Transform and analyze
3. Visualize and explore
A framework who's major stages include: Blend & Transform, Model & Visualize, Optimize & Capture, Validate & Deploy, Consume & Interact, Monitor & Update
What is the Data Science Lifecycle?
The smallest possible unit at KNIME and are created to perform discrete data tasks.
The short-hand for what we call our tone of voice (that's down-to-earth, not exaggerating, not salsey)
What is "the sober voice"?
These are the top reasons to switch from KNIME Server to KNIME Business Hub
What is:
1. Ease of Collaboration: Unlock cross-discipline work.
Share & reuse expertise
between skill sets and teams.
2. Ease of Deployment: Easily & automatically deploy data apps and services to any number of end users
3. Built for Scale Cloud-native infrastructure with no limit to users. Easy to administrate & maintain.
The shortcomings of Excel macros (name two).
1. What is a security risk?
2. What is complexity/lack of transparency?
3. What is the increased file size of a workbook?
4. What is lack of rollback -- can't "undo"?
These two fundamental elements are needed to create a workflow.
What is
1. Node
2. Component
This section in KNIME Analytics Platform (editor) lets you connect to KNIME Business Hub.
What is KNIME Space Explorer?
The signal that a node has been executed but execution has failed.
What is a red "x" on the traffic light beneath a node?
Three reasons that justify why KNIME should be used.
What is Analytic depth/breadth?
What is End-to-end?
What is Enterprise scale?
(Underpinned by "open")
KNIME Analytics Platform 5.1 new nodes for spreadsheet users (name 3).
What is Value Lookup?
What is Row Aggregator?
What is Table Splitter?
What is Table Cropper?
What is Cell Extractor?
What is Table Updator?
What is Cell Updator?
This is what CI/CD is called in the data science world.
What is CDDS?
These types of techniques deal with trend analysis. They're used to see trends over time, based on defined time intervals. Many marketing and sales teams, for instance, already use this technique when they look at month over month revenue or quarterly churn numbers.
What is Time Series Analysis?
This is the term for a group of users on KNIME Hub that work together on shared projects.
What are "teams"?
What kind of node is needed to connect to a database?
Connector node
Three KNIME customers in manufacturing.
What are
1. Seagate
2.Continental
3. Siemens
Kaercher, P&G, Wurth, Tata Steel, ZF, Forest Grove, Gemmacon, etc. also acceptable
This new feature (in labs), released with AP 5.1, allows you to export your visualizations as PDF or HTML.
What is KNIME Reporting (Labs)?
This deployment type in KNIME lets you deploy your workflow as an API endpoint.
What is service / "create service"?
DESIGN BONUS QUESTION
What does the triangle in the middle of the KNIME logo stand for?
What is the “core”/“heart” of data?
This is the popular software delivery practice CDDS is based on.
What is CI/CD
The file type of a saved KNIME workflow.
What is a ".knwf" file?
This is the new font that will be used in all decks/presentations going forward.
What is "Avenir / LT Avenir Pro"?
These are the three ways a workflow can be deployed in KNIME Business Hub.
What is
- Data apps
- Schedules
- API services
This Component automatically trains supervised machine learning models for both binary and multiclass classification. The component is able to automate the whole ML cycle by performing some data preparation, parameter optimization with cross validation, scoring, evaluation and selection.
What is AutoML?
These types of techniques identify naturally occurring groups in your dataset. This is an “unsupervised” analysis–so it identifies patterns without predefined labels. A health insurance provider might use this technique to identify high-risk groups based on doctor visits per year, age, household size and chronic conditions.
What is clustering?
DESIGN BONUS QUESTION
Which one of the graphics below has been recently used in a KNIME campaign?
What is the top picture?
Nodes that are magenta in color and have no output ports.
What are Writer nodes?
DESIGN BONUS QUESTION
How is the KNIME Business Hub graphically represented?
What is a network?
The starting cost of KNIME Community Hub: Teams.
What is 250/month?
DESIGN BONUS QUESTION
What animated character appears in much of the practioner educational resources?
What is a robot?
2 of the machine learning libraries that KNIME integrates with.
What is Keras? H2O, XGBoost, TensorFlow...
When a model in production is no longer accurate.
What is model drift?
DESIGN BONUS QUESTION!
The hexcode for KNIME yellow.
What is FDD800?
The biggest industry (by annual revenue) represented in our customer database.
What is manufacturing?
These are the four types of data users listed on the new homepage.
Who are
- Business & domain experts
- Data experts
-End users
- MLOps & IT
The maximum number of rows in an Excel spreadsheet (rounded to the nearest hundred is fine).
What is is 1,048,576?
A combination of multiple models from supervised learning algorithms to obtain a more stable and accurate overall model.
What is ensemble learning? (Bagging and boosting also acceptable.)
A popular framework for data science that doesn't cover what happens after deployment
What is CRISP-DM?
A Pythonista's interest in Data Apps.
What is the lack of need to write Javascript and HTML?
The 3 benefits of open.
1. Easily integrate with current and future technology
2. Community-driven developments
3. Frictionless adoption (free cost)
Besides the number of vCores, Users and Teams, this feature is the major difference between the Standard and Enterprise version of KNIME Business Hub.
What is read-access for unlicensed users?
These 3 steps need to be automated to achieve continuous deployment of data science.
What is-
1. Automated integration and deployment
2. Automated testing and validation
3. Automated monitoring and retraining
Doing this helps a data scientist make a dataset more manageable by reducing the number of attributes, without sacrificing the variation.
What is Dimensionality Reduction?
The reason that a data app can be more valuable than a dashboard built in Tableau.
What is the complexity of the underlying workflow?
If this setting is toggled "on," you can use the KNIME Python Script node without installing, configuring or even knowing anything about Python environments.
What is the "Bundled" setting for Python environment configuration.
Roughly the number of people in our marketing database (rounded to nearest 100k).
What is 737k? (350k also excepted, for mailable contacts)