Scholar

William Saunders

Google Scholar ID: 8hjFFAoAAAAJ

OpenAI

AI AlignmentAI SafetyDeep Reinforcement LearningNatural Language ProcessingMachine Learning

Google Scholar↗

Citations & Impact

All-time

Citations

11,161

H-index

10

i10-index

10

Publications

14

Co-authors

2

list available

Contact

No contact links provided.

Publications

3 items

Emotion Concepts and their Function in a Large Language Model

2026

Cited

0

Open Problems in Mechanistic Interpretability

2025

Cited

0

RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts

arXiv.org · 2024

Cited

24

Resume (English only)

Co-authors

2 total

Anthropic AI, OpenAI

Affiliate, CHAI, UC Berkeley