The Lightcone team runs Lighthaven and this forum.
What is LessWrong?
This Berkeley physicist won the 1939 Nobel Prize for inventing the cyclotron.
Who is Ernest Lawrence?
This AI company, founded by Dario Amodei and Paul Christiano, is known for developing Claude.
What is Anthropic?
This type of layer is essential for modern LLMs to process variable-length sequences.
What is an attention layer?
This framework, developed by Peter Singer, argues we should maximize suffering reduction while remaining neutral to species, location, personal affinity, and (sometimes) time.
What is effective altruism?
Halfway up the stairs to the second floor of B, there is currently a door labeled with the name of the protagonist of this Eliezer Yudkowsky rationalist fanfiction.
What is "Harry Potter and the Methods of Rationality"?
This Berkeley professor won a Nobel Prize for discovering dark energy.
Who is Saul Perlmutter?
This independent AI safety research organization focuses on interpretability and publishes the Distill journal.
What is Redwood Research?
This loss function measures the difference between predicted probability distributions, commonly used in language models.
What is cross-entropy loss?
This thought experiment generally involves choosing between allowing a larger number of people to die through doing nothing or actively choosing to kill a smaller number of people.
What is the trolley problem?
Before being acquired by Lightcone, Lighthaven was a hotel that went by this name.
What is the Rose Garden Inn?
This Berkeley alumnus and professor is known for inventing the UNIX operating system.
Who is Ken Thompson?
This method, pioneered by OpenAI, uses human preference data to align language models’ outputs with human values.
What is constitutional AI/RLHF?
This optimization algorithm adaptively adjusts learning rates for each parameter.
What is an adam optimizer?
This paradox shows how rational agents might fail to cooperate despite mutual benefit.
What is the prisoner's dilemma?
The Lighthaven doorcode is 1620, the year of publication of Novum Organum by this 17th century philopsopher.
Who is Francis Bacon?
This 1964 student movement at Berkeley fought for free speech and political activity on campus.
What is the Free Speech Movement?
This DeepMind paper introduced the concept of “reward modeling” for AI alignment.
What is “Learning to Summarize with Human Feedback”?
This technique prevents models from memorizing training data by adding random noise during training.
What is dropout?
This thought experiment challenges functionalist theories of consciousness with physically identical beings lacking consciousness.
What are philosophical zombies?
These are the six “full” names of buildings A, B, C, D, E, and F.
What is Aumann, Bayes, Cantor, Darwin, Eigen, and Feynman?
This Berkeley professor co-founded MIRI and wrote "Superintelligence".
Who is Stuart Russell?
This research organization published “Intelligence Explosion Microeconomics” and “Coherent Extrapolated Volition”.
What is MIRI?
This foundational paper showed how language model performance scales with compute, data, and parameters.
What is the Kaplan scaling laws paper from OpenAI (2020)?
This decision theory considers how agents should act when copies of themselves exist.
What is updateless decision theory?