Network Engineer, Operations & Support
Description
Meta is seeking a forward-thinking Network Engineer to join the Edge & Network Services (ENS) team. This role focuses on improving operational efficiency and reliability across one of the world’s most dynamic and hyperscaler networks. The successful candidate will possess advanced technical skills in networking, systems, automation/tooling, and will thrive in a rapidly evolving environment.This truly global team offers a unique career opportunity to work with all of the latest network technologies and talented engineers solving some of the most complex problems in the industry.
Responsibilities
Be the key Subject Matter Expert on operating & managing Meta's production networks in a hyper-scale, heterogeneous and hybrid infrastructure Formulate the right metrics and definitions of success to drive quality, efficiency, cost, and timeliness, and evolve these over time to match changes to the infrastructure and business requirements Develop the operational process improvement plans, and transform the improvements to scalable and automated workflows by writing and reviewing the code to improve the operational efficiency Perform analysis on complex technical issues across networks, ranging from automated tooling to hardware failures and network issues Anticipate potential operational risks and develop strategies to mitigate/minimize Participate and improve escalation and emergency response with detailed postmortem while addressing issues systematically to prevent future occurrences Build cross-functional relationships with Network Engineering, Systems Engineering, Traffic, Logistics, Program Management, and OEM partners to deliver exceptional operational results and manage the performance of external vendors Work with partner teams and vendors to manage day-to-day operations and reliability of the regional network Participate in an on-call rotation to support the network infrastructure 24x7 15% of travel required (domestic and international)
Qualifications
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience 4+ years of experience with network operations while supporting large-scale network infrastructure Proficient in at least one of the network domains: Backbone IP/Optical network or CDN/Edge network, including topology, protocol, hardware and architectures Experience working within a global team and collaborate with cross-functional teams in a fast-paced and dynamic environment with limited supervision Experience learning new languages, technologies, frameworks and APIs Experience in implementing/maintaining the monitoring, alerting and repairing systems for production network in a DevOps environment Proficient in an operations framework and best practices, such as ITIL v.4, CMMi, Lean Six Sigma Experience managing operations in public cloud environments (OCI, AWS, GCP, etc.), including (but not limited to) monitoring, incident response, and ensuring reliability and security across multiple platforms Working knowledge of network protocols (TCP/UDP, DHCP, DNS) and experience with IPv4 and IPv6 Experience troubleshooting routing and switching protocols (BGP, IS-IS, MPLS, RSVP-TE, VRRP) Demonstrated experience in conducting data-driven analysis Experience in providing technical guidance to external vendors, through escalation (Tier IV) Familiarity with the Linux based systems Experience with scripting and automation (Bash, Python, Perl) Experience with infrastructure automation tools (such as Terraform) is highly desirable