Sirnam Swetha
Scholar

Sirnam Swetha

Google Scholar ID: XwocaTcAAAAJ
University of Central Florida
Computer VisionMultimodal LearningVideo UnderstandingDeep Learning
Citations & Impact
All-time
Citations
79
 
H-index
6
 
i10-index
3
 
Publications
20
 
Co-authors
14
list available
Resume (English only)
Academic Achievements
  • Oct'25: A paper accepted at ICCV 2025 for oral presentation (Top 0.6% papers); Jun'25: Co-organized Workshop on VideoLLMs at CVPR 2025 and served as Area Chair; Mar'25: Organized Challenges on VideoLLMs at CVPR 2025; Sep'24: Third place in Perception Challenge at ECCV 2024; Jul'24: First author paper 'X-Former' accepted at ECCV 2024; Jul'23: First author paper 'Multi-Sinkhorn Knopp' accepted at ICCV 2023.
Research Experience
  • May 2024 - present: PhD Applied Scientist Intern at Amazon, Palo Alto, California, USA, enhancing visual preference alignment to improve general MLLM understanding; May 2023 - present: PhD Applied Scientist Intern at Amazon, Palo Alto, California, USA, enhancing fine-grained visual perception capabilities of MLLMs; May 2022 - present: PhD Applied Scientist Intern at Amazon, Seattle, Washington, USA, developing a detailed video description framework for long-form videos; Analyst at Goldman Sachs, Bengaluru, Karnataka, India, working with the Investment Banking Team.
Education
  • PhD: University of Central Florida, Center for Research in Computer Vision (CRCV), advised by Prof. Mubarak Shah; B.Tech with Honors and MS by research: International Institute of Information Technology, Hyderabad (IIIT-H), advised by Prof. C. V. Jawahar (CVIT, IIIT-H) and Prof. Vineeth N Balasubramanian, IIT-H.
Background
  • Research interests include self-supervised learning, multi-modal learning, covering MLLMs, spatial and temporal reasoning in VideoLLMs, visual preference alignment, safety, and bias analysis.
Miscellany
  • Personal interests and other information not provided.