Alignment Agendas
MATS Events
Philosophy and EA
AI History
np.random.rand()
100
Stuart Russell's approach to alignment (described in Human Compatible)

What is inverse reinforcement learning?

100
This researcher would say something like "It's coup time! Go! Go! Go!"

Who is Buck Shlegeris?

100

A hypothesized period of time during which humanity works out how best to realize its long-term potential

What is the long reflection?

100

A 1997 chess match between an AI and a world chess champion

What is Deep Blue vs Garry Kasparov?

100

a fictional world and civilization invented by Eliezer Yudkowsky

Dath Ilan

200

The full expression for G.O.W.A.W.

What is Go Out With A Whimper?

200

This person famously declared, "These atoms fight back!"

Who is George Hotz?

200

Toby Ord's probability of existential risk in the next century (reported in The Precipice)

1/6

200

A 2016 Go match between an AI and a top Go player

What is AlphaGo vs Lee Sedol?

200

"...most people may not realize how much of this entire field is myself wearing various ______." -- Eliezer, List of Lethalities comment

What are trenchcoats?

300

The full name for H.C.H.

What is Humans Consulting H.C.H.?

300

These two sub-agents are collectively known as Janus

Laria and Kyle

300

This technique is applied to shrimp eyestalks (and neural nets)

What is ablation?

300

The CNN that won the 2012 ImageNet Challenge

What is AlexNet?

300
The number of neurons in a human brain (OOM)

What is one hundred billion (10^11)

400
The full name for Andrew Critch's R.A.A.P

What are Robust Agent-Agnostic Processes?

400

The situational awareness benchmark prototype that Rudolf and Alex built last weekend

SADDER

400

A thought experiment demonstrating that an updateless decision theory using logical counterfactuals is better than an evidential updateless decision theory

What is the Troll Bridge?

400

The three labs that signed the White House voluntary commitments but are not members of the Frontier Model Forum

What are Amazon, Inflection, and Meta?

400

The Lovecraftian race that created the Shoggoth

What are The Elder Things?

500

The two documents named as inspiration for Claude's constitution

What are the UN Declaration of Human Rights and Apple's Terms of Service?

500

This was Ethan Edwards' DJ name at the mid-program party

Who is DJ Kernel Clustered

500

A "rule" that a modal decision theory agent with a fixed proof search ordering can follow to avoid the 5-and-10 problem

What is the Chicken Rule?

500

The first major theorem to be proved using a computer

What is the four color theorem?

500

ChatGPT's MBTI type (as tested by a random Reddit user 8 months ago by feeding it the questions and having it answer)

What is ENFJ?

M
e
n
u