Scholar
Abhay Sheshadri
Google Scholar ID: dujRau4AAAAJ
Undergraduate, Georgia Institute of Technology
AI Safety
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
153
H-index
5
i10-index
4
Publications
7
Co-authors
18
list available
Contact
No contact links provided.
Publications
4 items
AuditBench: Evaluating Alignment Auditing Techniques on Models with Hidden Behaviors
2026
Cited
0
Why Do Some Language Models Fake Alignment While Others Don't?
2025
Cited
0
Obfuscated Activations Bypass LLM Latent-Space Defenses
arXiv.org · 2024
Cited
0
Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs
2024
Cited
72
Resume (English only)
Co-authors
18 total
Aidan Ewart
Maths Undergrad @ University of Bristol
Co-author 2
Aengus Lynch
University College London
Henry Sleight
Research Manager, Anthropic Fellows Program, Program Manager, Constellation
Stephen Casper
PhD student, MIT
Ethan Perez
Anthropic
Dylan Hadfield-Menell
Massachusetts Institute of Technology
Asa Cooper Stickland
Research Scientist, UK AI Security Institute
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up