Contributed to the de novo genome assemblies of dozens of species; probed the sequence variations related to autism, cancer, and other human diseases; mapped the transcriptional and epigenetic profiles of tomatoes, corn, and other important plant species; explored the role of microbes in different environments. Among other recognitions, received an NSF CAREER award, a Sloan Foundation Fellowship, and was named a TIME100 recipient in 2022.
Research Experience
At the forefront of distributed and parallel computing in genomics, pioneering the use of cloud computing as an enabling platform to address big data challenges. Expertise spans from low level computer architecture, through sequencing, de novo assembly, variant identification, transcriptome & other -omics data, up to machine learning approaches to build predictive models of diseases and treatment response.
Background
Main research interest is in understanding the structure and function of genomes, especially those of medical or agricultural importance. Core strength lies in developing novel algorithms and computational systems for large-scale biological sequence analysis, including leading algorithms for de novo genome assembly, variant detection, and related –omics assays.