Staff Research Scientist at the MIT-IBM AI Lab, IBM Research.
Research Affiliate in Electrical Engineering & Computer Science at MIT and Principal Investigator (PI).
Research interests broadly include Representation Learning, Generative AI, Foundation Models, Trustworthy Machine Learning, Large Language Models, Reinforcement Learning, Dynamical Systems, and ML Optimization methods.
Current research focuses on long-context understanding, uncertainty quantification, human-centric system design for LLMs, and time-series foundation models.
Working on post-training techniques and alignment methods for LLMs, including fine-tuning, instruction tuning, synthetic data generation, and prompt optimization.
Generated large-scale long-context instruction-following data with diverse reasoning tasks for training Granite models; contributed to the post-training of IBM’s LLM Granite 3.1 and multimodal model Granite Vision.