Sen Yang
Scholar

Sen Yang

Google Scholar ID: z5O3DLcAAAAJ
Baidu Inc.
human pose estimationautonomous drivingcomputer visiondeep learning
Citations & Impact
All-time
Citations
1,479
 
H-index
8
 
i10-index
8
 
Publications
14
 
Co-authors
0
 
Publications
14 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Publications: SimCC: A Simple Coordinate Classification (ECCV 2022 Oral); TokenPose: Learning Keypoint Tokens (ICCV-2021); TransPose: Keypoint Localization via Transformer (Pattern Recognition). Awards: During the internship at Tencent, proposed an independent token representation method, improving 3DPW metrics by 8%, and the paper was published in ICLR-2023 (spotlight, top25%). Patents & Projects: MLLM Architectures: LLaVA, Qwen2.5-VL, LISA; Training Techniques: SFT, Autoregressive Models, RL; Visual Token Compression, Large-scale Distributed Training.
Research Experience
  • Baidu VIS Senior R&D Engineer (2023.7-Present), responsible for in-depth research and innovative applications in multimodal large models, computer vision perception, and decision-making algorithms; Tencent TPG Intern (2021.12-2022.8), worked on 3D human reconstruction and motion generation project; Megvii Intern (2021.1-2021.10), participated in human pose estimation projects, designed a Transformer-based pose estimation model.
Education
  • Ph.D.: Southeast University (2019.5-2023.3); M.S.: Southeast University (2017.9-2019.1); B.S.: Jilin University (2013.9-2017.7)
Background
  • Research Interests: Computer Vision, Deep Learning, Human Pose Estimation, Autonomous Driving Perception, Multimodal Foundation Models. Background: A research engineer at Baidu, focusing on computer vision, multimodal large language models, and autonomous driving.
Miscellany
  • Personal Interests: Passionate about developing innovative solutions that combine cutting-edge research with practical applications.
Co-authors
0 total
Co-authors: 0 (list not available)