- Masked Vision and Language Modeling for Multi-modal Representation Learning, ICLR 2023
- Semi-supervised Vision Transformers at Scale, NeurIPS 2022
- X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks, ECCV 2022
- Area Chair for CVPR 2023 and ICCV 2023
Research Experience
- Senior Applied Scientist with Amazon AGI multimodal team
- Research intern at Facebook AI Research (FAIR), Micsoft Research Redmond (MSR), IBM T. J. Watson Research, and Institute of Automation, Chinese Academy of Sciences (CASIA)
Education
- Ph.D. and M.S. degrees from UC San Diego, advised by Nuno Vasconcelos
Background
- Research Interests: Computer vision and machine learning, especially vision and language understanding, object detection, semi- and self-supervised learning, low-precision neural networks, etc.