Sr. GPU/Accelerator Hardware Development Engineer, Annapurna Labs
Amazon
- United States
- United States
About
DESCRIPTION: Would you like to develop the Next Generation of AI accelerator compute systems? Lead bleeding-edge HW development projects? Have you heard of Amazon Web Services (AWS) Project Rainer? This is the opportunity to be a part of a fast-moving innovation team that is changing the world of AI at massive scale. At AWS Trainium we develop a complete vertical stack system, from our own Silicon to Hardware to Software and deploy directly to our customers in our own Data Centers
We are seeking experienced Lead System Design Engineers to build the next generation of our cloud server infrastructure, Project Rainier. Project Rainier is a massive $11 billion Amazon Web Services (AWS) AI infrastructure initiative, featuring one of the world's largest compute clusters dedicated to training and running Anthropic's Claude AI models. It utilizes over 500,000 custom Trainium2 chips, designed for high-performance AI training.
As a member of the AWS Trainium Machine Learning Acceleration team you'll be responsible for the System design and optimization of hardware in our data centers. You'll provide leadership in the application of new technologies to large scale server deployments in a continuous effort to deliver a world-class customer experience. This is a fast-paced, intellectually challenging position, and you'll work with thought leaders in multiple technology areas. You'll have high standards for yourself and everyone you work with, and you'll be constantly looking for ways to improve your products performance, quality and cost. We're changing industry, and we want individuals who are ready for this challenge and want to reach beyond what is possible today.
Key job responsibilities We are looking for a Lead Hardware Design Engineer with strong skills in both hardware and software. In this role, you will be responsible for system design, validation, and integration of hardware in the AWS fleet through its entire life cycle. You will work cross functionally with AWS monitoring teams, members of the Hardware Design team, and
additional teams across AWS to improve quality and reliability of products operating in the fleet.
We are looking for candidates who thrive in a fast-paced start-up like environment and work independently to deliver multiple projects in parallel. To be successful, you need to be highly motivated and detailed oriented while meeting the highest standards and time to market, cost and quality goals. BASIC QUALIFICATIONS: - Bachelor's degree in Electrical Engineering or a related field - Experience in developing functional specifications, design verification plans and functional test procedures - 5+ years of relevant work experience with complex accelerator, storage or network server designs. - 3+ years of development experience in hardware/ firmware. Proven ability to debug PCIe, memory or storage subsystems. - Working with interdisciplinary teams to execute product design from concept to production spanning internal hardware, firmware and software teams as well as external design partners. PREFERRED QUALIFICATIONS: - Master's degree or Ph.D. degree in Electrical Engineering or related field - Experience in RTL coding and debug, as well as performance, power, area analysis and trade-offs - Experience with modern ASIC/FPGA design and verification tools - Experience with SOC bring-up and post-silicon validation
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at
https://amazon.jobs/en/benefits
.
USA, CA, Cupertino - 183,000.00 - 247,600.00 USD annually USA, TX, Austin - 159,200.00 - 215,300.00 USD annually USA, WA, Seattle - 159,200.00 - 215,300.00 USD annually]]>
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.