DevOps Jeopardy

Code / Build / Test

Release / Deploy

Operate

Monitor

Plan

100

Source control, the practice of tracking and managing changes to software code, is also known as this name

Version Control

100

Also a famous British butler, the open source automation server we use to build & deploy software

Jenkins

100

A dinosaur-themed exercice that simulates network outages in our AWS public cloud; typically conducted 4X/year

TREx

100

A target range of values for a service level; often measured by its close cousin, the SLI

SLO

100

Agile ceremony that happens once every 6 weeks where we plan the next product increment of work

PI Planning

200

The name of the version control system we use here at Capital One

Github

200

Also a term of art in warfare used to describe enemy aircraft, this is our main in-house CI/CD tool

Bogie

200

This type of engineering specialization describes the practice of defining SLIs, SLOs, and SLAs

Reliability

200

A portmanteau that is also an oxymoron, it’s also the name of main observability tool we use

NewRelic

200

80% is the agreed upon number for measuring an engineering team’s commitment during a 6-week PI

Capacity

300

Also meaning 'one who is experienced or knowledgeable', this is a popular software project management tool

Maven

300

The component of Bogie used for infrastructure management

Gears

300

A metal more precious than gold, 4 ‘9’s describes this level of availability required to achieve this Resiliency Tier

Platinum

300

I am a business commitment that a service provider makes to a customer; often in the form of a contract with financial penalties if I’m missed

SLA

300

This agile ceremony which occurs daily, answers the following questions–’What did I work on yesterday, what am I working on today, and do I have any impediments?’

Daily Stand Up, S1, Scrum all acceptable

400

Commonly referred to as SCA, this type of tool searches for and finds security bugs in our code, hopefully before we run it in production

Static or Source Code Analysis both acceptable

400

The term used to describe a recovery from a bad deployment, or return to steady state operations

Rollback

400

15 minutes is all we get to meet this type of reliability metric

Recovery Time

400

This strange acronym ‘MTBF’ describes the expected (arithmetic mean) time between two failures of a system

Mean Time Between Failures

400

A Capital One Agile process with a funny name that addresses changes to the PI Plan in the event of the need to move shift work priority

WiWo

500

80% is the bare minimum (cough) for code coverage for this type of test which measures functionality in isolation without deployment

Unit Test

500

LaunchDarkly is an example of a tool that allows for this type of flag release

Feature

500

The 'top of the pyramid', this level of observability maturity introduces proactive low-sev incidents; yikes!

Level 4

500

The amount of permissible impact of failed events; similar to its financial analog, it's bad for operations if we exhaust it

Event Budget

500

The agile ceremony where risks are discussed and dispositioned; aka ‘Scrum of Scrum of Scrums’ ;-)