Scholar

Sirnam Swetha

Google Scholar ID: XwocaTcAAAAJ

University of Central Florida

Computer VisionMultimodal LearningVideo UnderstandingDeep Learning

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

6 items

2026

Cited

2025

Cited

2025

Cited

2025

Cited

2025

Cited

2025

Cited

Resume (English only)

Academic Achievements

Oct'25: A paper accepted at ICCV 2025 for oral presentation (Top 0.6% papers); Jun'25: Co-organized Workshop on VideoLLMs at CVPR 2025 and served as Area Chair; Mar'25: Organized Challenges on VideoLLMs at CVPR 2025; Sep'24: Third place in Perception Challenge at ECCV 2024; Jul'24: First author paper 'X-Former' accepted at ECCV 2024; Jul'23: First author paper 'Multi-Sinkhorn Knopp' accepted at ICCV 2023.

Research Experience

May 2024 - present: PhD Applied Scientist Intern at Amazon, Palo Alto, California, USA, enhancing visual preference alignment to improve general MLLM understanding; May 2023 - present: PhD Applied Scientist Intern at Amazon, Palo Alto, California, USA, enhancing fine-grained visual perception capabilities of MLLMs; May 2022 - present: PhD Applied Scientist Intern at Amazon, Seattle, Washington, USA, developing a detailed video description framework for long-form videos; Analyst at Goldman Sachs, Bengaluru, Karnataka, India, working with the Investment Banking Team.

Education

PhD: University of Central Florida, Center for Research in Computer Vision (CRCV), advised by Prof. Mubarak Shah; B.Tech with Honors and MS by research: International Institute of Information Technology, Hyderabad (IIIT-H), advised by Prof. C. V. Jawahar (CVIT, IIIT-H) and Prof. Vineeth N Balasubramanian, IIT-H.

Background

Research interests include self-supervised learning, multi-modal learning, covering MLLMs, spatial and temporal reasoning in VideoLLMs, visual preference alignment, safety, and bias analysis.

Miscellany