ideas & writings

Selected Work Worth Reading

A straightforward list of public essays and papers that reflect the ideas many people associate with Dario Amodei.

Curated Reading List

Machines of Loving Grace

A vision essay on opportunity, responsibility, and advanced AI.

Read on Anthropic

Constitutional AI

Foundational work on principle-guided training for safer assistant behavior.

Read on arXiv

AI and Compute

A frequently cited analysis of compute trends and AI progress.

Read on OpenAI

Concrete Problems in AI Safety

Early practical framing of concrete research questions in AI safety.

Read on arXiv

Language Models (Mostly) Know What They Know

Research on calibration and uncertainty in language models.

Read on arXiv

Toward Monosemanticity

Interpretability work focused on clearer internal model features.

Read article

For broader discovery: Google Scholar search for Dario Amodei