AgoraResearch hub
ExploreLibraryProfile
Account
Abhay Sheshadri
Scholar

Abhay Sheshadri

Google Scholar ID: dujRau4AAAAJ
Undergraduate, Georgia Institute of Technology
AI Safety
Google Scholar↗
Citations & Impact
All-time
Citations
153
 
H-index
5
 
i10-index
4
 
Publications
7
 
Co-authors
18
list available
Contact
No contact links provided.
Publications
4 items
AuditBench: Evaluating Alignment Auditing Techniques on Models with Hidden Behaviors
2026
Cited
0
Why Do Some Language Models Fake Alignment While Others Don't?
2025
Cited
0
Obfuscated Activations Bypass LLM Latent-Space Defenses
arXiv.org · 2024
Cited
0
Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs
2024
Cited
72
Resume (English only)
Co-authors
18 total
Aidan Ewart
Aidan Ewart
Maths Undergrad @ University of Bristol
Co-author 2
Co-author 2
Aengus Lynch
Aengus Lynch
University College London
Henry Sleight
Henry Sleight
Research Manager, Anthropic Fellows Program, Program Manager, Constellation
Stephen Casper
Stephen Casper
PhD student, MIT
Ethan Perez
Ethan Perez
Anthropic
Dylan Hadfield-Menell
Dylan Hadfield-Menell
Massachusetts Institute of Technology
Asa Cooper Stickland
Asa Cooper Stickland
Research Scientist, UK AI Security Institute

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?