Software Development Engineer III, Annapurna Labs
About the Team
The Neuroboros team was recently created to pursue the ambitious goal of leveraging and expanding Generative AI technologies to help customers benefit from the scale and price/performance equation offered by Amazon Machine Learning hardware. The creation of the team in NYC is key to Annapurna Labs’ location strategy, with the goal of creating an additional hub attracting top talent with varied backgrounds to work on challenging problems, using and building state-of-the-art tooling.
About Amazon Annapurna Labs:
Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our team’s breadth of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), and in computing with AWS Graviton and F1 EC2 instances.
About AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
About AWS Neuron:
AWS Neuron is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS customers. Trainium is designed to deliver the best-in-class ML training performance at the lowest training cost in the cloud, and it’s all being enabled by AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. Our products are being used at scale with external customers like Anthropic and Databricks as well as internal customers like Alexa, Amazon Bedrock, Amazon Robotics, Amazon Ads, Amazon Rekognition and many more.
Job Summary
You will join a dynamic team working at the cutting edge of the GenAI revolution by applying AI to AI. You will work on building agents, tools, and models to simplify and accelerate customer adoption of Neuron, the software stack supporting Amazon's Machine Learning silicon: Trainium. Partnering with external and internal customers, you will identify key obstacles and opportunities to accelerate their migration to AWS's ML silicon. You will be the technical lead for a team building AI agents and tools that simplify AWS Neuron adoption, and drive the team's vision and strategy in this space critical to AWS's Generative AI business.
Key job responsibilities
This role requires collaborating with other Neuron Software teams, Science, AWS AI Services, external partners and customers with a potential high impact on AWS's top and bottom line. As a senior member of the team applying Generative AI to accelerate Neuron adoption, you will play a key role in shaping this space with the following technical responsibilities:
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
USA, NY, New York - 184,900.00 - 250,200.00 USD annually
The Neuroboros team was recently created to pursue the ambitious goal of leveraging and expanding Generative AI technologies to help customers benefit from the scale and price/performance equation offered by Amazon Machine Learning hardware. The creation of the team in NYC is key to Annapurna Labs’ location strategy, with the goal of creating an additional hub attracting top talent with varied backgrounds to work on challenging problems, using and building state-of-the-art tooling.
About Amazon Annapurna Labs:
Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our team’s breadth of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), and in computing with AWS Graviton and F1 EC2 instances.
About AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
About AWS Neuron:
AWS Neuron is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS customers. Trainium is designed to deliver the best-in-class ML training performance at the lowest training cost in the cloud, and it’s all being enabled by AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. Our products are being used at scale with external customers like Anthropic and Databricks as well as internal customers like Alexa, Amazon Bedrock, Amazon Robotics, Amazon Ads, Amazon Rekognition and many more.
Job Summary
You will join a dynamic team working at the cutting edge of the GenAI revolution by applying AI to AI. You will work on building agents, tools, and models to simplify and accelerate customer adoption of Neuron, the software stack supporting Amazon's Machine Learning silicon: Trainium. Partnering with external and internal customers, you will identify key obstacles and opportunities to accelerate their migration to AWS's ML silicon. You will be the technical lead for a team building AI agents and tools that simplify AWS Neuron adoption, and drive the team's vision and strategy in this space critical to AWS's Generative AI business.
Key job responsibilities
This role requires collaborating with other Neuron Software teams, Science, AWS AI Services, external partners and customers with a potential high impact on AWS's top and bottom line. As a senior member of the team applying Generative AI to accelerate Neuron adoption, you will play a key role in shaping this space with the following technical responsibilities:
- Research implementations that deliver the best possible experiences for customers.
- Deliver on goals to improve the time and effort it takes to port and optimize Machine Learning workloads on Neuron.
- Solve challenging technical problems, often ones not solved before, at every layer of the stack
- Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.
- Build high-quality, highly available, always-on products.
- Potentially contribute intellectual property through patents
- Assist in the career development of others, actively mentoring individuals and the community on advanced technical issues and helping managers guide the career growth of their team members.
- Exert technical influence over the team, increasing their productivity and effectiveness by sharing your deep knowledge and experience.
Basic Qualifications
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- Hands-on technical experience working in the Generative AI space
- Excellent written and verbal communication skills with the ability to present complex technical information concisely to executives and non-technical leaders.
- Experience in one or more of the following areas: ML compilers, production coding agents, GenAI model architecture, model training, neural network optimization, or alternatively applied math.
Preferred Qualifications
- Master's degree in computer science or equivalent
- Master's degree or above in computer science or equivalent
- 2+ years in machine learning or other computational modeling environments with an emphasis on hosting, building or optimizing models for diverse hardware platforms
- Proven track record in building AI agents that automate ML workload optimization, ML compiler tuning, distributed inference and training, or ML kernel authoring and optimization
- Experience working with open-source software communities in the optimization space or related areas
- Domain-level knowledge of AWS services
- Knowledge of the state-of-the-art technology used in the Machine Learning space and its mathematical underpinning
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, NY, New York - 184,900.00 - 250,200.00 USD annually