Scholar
William Saunders
Google Scholar ID: 8hjFFAoAAAAJ
OpenAI
AI Alignment
AI Safety
Deep Reinforcement Learning
Natural Language Processing
Machine Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
11,161
H-index
10
i10-index
10
Publications
14
Co-authors
2
list available
Contact
No contact links provided.
Publications
3 items
Emotion Concepts and their Function in a Large Language Model
2026
Cited
0
Open Problems in Mechanistic Interpretability
2025
Cited
0
RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts
arXiv.org · 2024
Cited
24
Resume (English only)
Co-authors
2 total
Jeffrey Wu
Anthropic AI, OpenAI
Owain Evans
Affiliate, CHAI, UC Berkeley
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up