About the job
We are looking for a dynamic, energetic Lead AI Cluster Models Architect to join our growing team. As a key contributor to the success of AMD’s product, you will be part of a leading team to drive and improve AMD’s abilities to deliver the highest quality, industry-leading technologies to market. AMD's Systems Design Engineering team fosters and encourages continuous technical innovation to showcase successes as well as facilitate continuous career development.
Responsibilities
Designing state of the art model architectures, data, and parameter sets, for large AI/ML training and inferencing systems which can be optimized for hyperscale capabilities
Engage with AMD customer base while aligning system and model architectures
Pioneering system and container networking strategies to facilitate seamless operation and scaling of AI clusters
Developing scalable AI/ML training and inferencing communication network reference architectures for each generation of AMD AI/ML products
Participate in design phase of each AMD AI/ML GPU generation by developing cluster computational architectures and requirements
Collaborate across AMD internal and external partner teams to improve performance for AMD AI/ML clusters
Qualifications
Minimum
No minimum qualifications listed.
Preferred
In-depth knowledge and experience with AI clusters and topologies
Extensive real world experience designing hyperscale computing clusters
Strong analytical/problem-solving skills and pronounced attention to details
Must be a self-starter, and able to independently drive tasks to completion