Published multiple papers in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), International Conference of Machine Learning (ICML), etc.; contributed to the study of Gemini Models in Medicine; MOFI: Learning Image Representations from Noisy Entity Annotated Images; Perceptual Grouping in Contrastive Vision-Language Models; STAIR: Learning Sparse Text and Image Representation in Grounded Tokens; On Robustness in Multimodal Learning; Subtle adversarial image manipulations influence both human and machine perception.
Research Experience
Inventor of TensorFlow; deployed many production systems; involved in several large collaborations with Waymo.
Background
Principal scientist and research director at Google DeepMind, interested in vision, language, and learning. Leads organizations focused on machine learning, computer vision, and basic science research.