Scholar
Euan Ong
Google Scholar ID: vT2qcI0AAAAJ
Anthropic
Machine Learning
Science of Deep Learning
Mechanistic Interpretability
Algorithmic Reasoning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
334
H-index
7
i10-index
5
Publications
13
Co-authors
0
Contact
No contact links provided.
Publications
4 items
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
2025
Cited
0
Auditing language models for hidden objectives
2025
Cited
0
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
2025
Cited
0
Compact Proofs of Model Performance via Mechanistic Interpretability
arXiv.org · 2024
Cited
4
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up