What are the responsibilities and job description for the AI/ML Software Application Optimization Engineer – DCGPU position at Advanced Micro Devices, Inc?
WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
THE TEAM
AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people
THE ROLE
AI/ML Software Application Optimization Engineer – DCGPU
The Datacenter GPU Software Product Applications Engineer is responsible for the technical execution of AMD's Datacenter graphics hardware/software subsystem projects for AMD OEM partners and enterprise commercial end-customers. This position offers a unique opportunity to apply your strong graphics, compute, datacenter, AI / Machine Learning as well as program management skills, to collaboratively work with customers that use AMD Instinct™ Accelerators.
THE PERSON:
An engineer computational scientist, or physicist with experience in multiple scientific computing domains and experience with using Machine Learning techniques in an AI setting. Must be self-motivated and possess the ability to work well within a team environment.
KEY RESPONSIBILITIES:
- Port and optimize a variety of AI and machine learning based models and applications for AMD GPU and CPU systems
- Provide domain specific knowledge to other groups at AMD
- Engage with AMD product groups to drive resolution of application and customer issues
- Develop and present training materials to internal audiences, at customer venues, and at industry conferences
PREFERRED EXPERIENCE:
- Broad experience building, running and tuning AI and machine learning models.
- In depth knowledge of current machine learning frameworks and commonly used models for training and inference
- Strong performance analysis skills for both GPU and CPU
- Extensive experience with C and Python
- Familiarity with distributed model training via NCCL/RCCL, MPI, or similar network technologies
- Experience in implementing and optimizing parallel methods on GPU accelerators in distributed memory systems with MPI, CUDA, HIP, OpenMP, etc.
- Familiarity in scientific computing disciplines such as computational chemistry, fluid dynamics, weather modeling, and oil and gas applications
- In-depth understanding of IO, parallel file systems, and network limitations and capabilities as used in AI models
- Familiarity with installation and setup of various AI applications and machine learning frameworks
- Experience provisioning clusters and validating their performance for use in machine learning applications
- Experience with build system tools including Make, CMake, autoconf, and autotools
- In-depth knowledge of software development practices including debug, test, revision control, documentation, and bug tracking
- Strong team development skills including demonstrated expertise with git and JIRA
- Ability to work well in geographically dispersed teams
ACADEMIC CREDENTIALS AND EXPERIENCE:
- 10-15 years of relevant industry experience
- Masters or PhD in Computer Science, Computational Physics, Engineering or related subjects, or equivalent experience
- Experience working with customers in a support function.
- Strong written and verbal communication skills and knowledge of program management practices.
- Self-started, detail oriented, organized, and capable of multi-tasking in a fast-moving environment.
- Motivated to provide highly responsive support as needed and to work independently
#LI-RL1
#LI-Hybrid
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.