Chapter Jeopardy Template

Training Your Robo Dog

Reinforcement Learning

Roise

Rosie’s First Lesson

100

Type of learning, inspired by the operant conditioning technique of rewarding desired behavior and ignoring unwanted behavior

reinforcement learning

100

In reinforcement learning, an agent learns through this process of trying actions and learning from rewards or mistakes

trial and error

100

What are Rosies Three Built in Actions

Forward, take a step Backward, and she can Kick

100

When Rosie finally gets a reward for kicking the ball, what exactly does she learn from that experience?

that being next to the ball and choosing Kick is a good action

200

This Sony robot dog, often used in robot soccer, can walk, kick, and even wag its plastic tail using built-in sensors, motors, and a camera

Aibo

200

In reinforcement learning, this term refers to the predicted amount of future reward an agent expects to receive

value of an action (or action value)

200

In reinforcement learning, why is it important that Rosie doesn’t learn too much from a single reward?

to avoid forming “superstitions,” or false connections between actions and rewards

300

In 2016, reinforcement learning gained worldwide attention when it powered this AI program that defeated the world’s top players in this game

400

Programmers might teach a Sony Aibo to walk toward and kick the ball by giving it a set of explicit instructions like “take a step toward the ball” and “kick the ball.” This is know as what kind of training

Rule-Based AI

400

Rosie hasn’t learned anything yet and is described as a “tabula rasa.” What does this term mean in the context of reinforcement learning?

a blank slate with no prior knowledge or experience?