Software Engineer (Leadership), Host Networking
Description
This Software Engineer will be working on NICs and Transport solution addressing growing demands of the distributed fleet of accelerators for our AI workloads. Do you want to work on transport for large scale AI clusters? Do you want to develop innovative solutions to our challenges and ship them into production? This role on our host networking teams is for you!
Responsibilities
Own design and architecture of Drivers and Firmware for NICs supporting AI workloads Collaborate with ASIC and HW teams, and external partners in building infrastructure scale embedded solutions Mentor team members who will also work on building driver and firmware software Work with cross functional teams through releasing software to production and supporting them Help build roadmap for our solutions and the team
Qualifications
Bachelor's degree in Computer Science/Engineering or relevant technical field and 10+ years of experience Proficiency in coding in C/C++ Experience building driver and/or firmware for embedded infrastructure sytems running Linux Experience with RDMA/RoCE and/or TCP stack for Linux Experience with Hardware Bringup Experience developing Drivers and/or Firmware for Networking stack in Linux, preferably for NICs Experience with Congestion control for RDMA/RoCE networks Experience with simulation environments with Qemu and/or emulation environments Working knowledge of Collectives (XCCL) and GPU direct for AI workloads