NMA Jeopardy Template

Bayesian Statistics

Optimal Control and RL

Neuronal Dynamics

Causality

Deep Learning

100

What is the Bayes' Rule?

See slides

100

Maximize rewards

What is the goal of a reinforcement learning agent

100

What are the three components of a feed-forward network?

Pre-synaptic neurons, Post-synaptic neurons and the synaptic weights.

100

Given two events a and b, if we change a to a', then the probability of b would have been different.

What is the definition of causality?

100

For the same input and output shape.

Why do we apply zero-padding?

200

The end product is mixture of gaussians.

What do you get when you multiply multiple different gaussian distributions?

200

I look into the long-term rewards when I am one and I look into short-term rewards when I am zero.

What is the discount factor?

200

By setting the dynamics to zero.

How can we achieve a dynamical system with fixed points?

200

In large systems, correlation and causation are often not similar.

Why do we say correlation is not causation?

200

By using L2, L1 Penalty terms

How can we apply regularisation in Deep Learning?

300

What do we assume when we compute the log likelihood?

Samples are iid.

300

If I don't try new actions, I will not perform well.

What is the exploration and exploitation trade-off?

300

This type of fixed points has negative slope and this type of fixed points has positive slope.

What is the difference between unstable and stable fixed points?

300

Estimate the causal effect of X on Y

What does causal inference means?

300

All units from current layer connects to all units in the next layer

What is a fully-connected layer?

400

The process where we sum over all the other variables.

What is it mean by marginalisation?

400

Sample vs bootstrapping

What is the difference between Monte Carlo methods and Temporal difference learning?

400

What is the lifetime of a protein?

days?

400

I influence the treatment but never the outcome.

What is an instrument variable?

400

By using skip connections

How can I smoothen the landscape of the loss function?

500

What is utility equivalent to?

-cost.

500

It corresponds to dopamine activity.

What does reward prediction error corresponds to?

500

What is the lifetime of a neuron?

100 years

500

Does X cause Y?

What is casual discovery?

500

To overcome exploding or large gradients

Why do we use Relu instead of sigmoid activation functions?