
Data Center Engineer
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Data Center Technician II, you'll independently manage and prioritize host repair efforts for multiple datacenters, perform initial troubleshooting for ambiguous hardware and network issues, and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data Centers and hardware infrastructure at a time of incredible growth for our business.
You will:
- Manage and prioritize your ticket queue according to defined priorities, performing initial troubleshooting for server and network issues, and escalating clearly when problems fall outside standard procedures.
- Maintain the Core Data Center and hardware infrastructure to meet the large scale and real-time requirements of our Imagination Platform™ to ensure our community has an awesome experience anywhere in the world. This includes all aspects of the server, network infrastructure, power, and environmental life cycles.
- Collaborate across regions to track and mitigate systemic issues preventing hosts from returning to service.
- Identify and solve recurring operational problems through root cause analysis, and propose improvements to runbooks, SOPs, and MOPs to prevent re-occurrence.
- Contribute data, feedback, and requirements to partners building automation, ensuring that automation reflects real-world operational workflows
- Coordinate with peers to establish and uphold best practices related to breakfix, install, decom and all other aspects of datacenter operations.
- Influence, and improve the development platform, infrastructure, standards (Runbooks, SOPs, MOPs), and methods to ensure the goal of scalability and high availability can be achieved.
- Leverage partnerships across teams to ensure prompt expansion and recovery of hardware capacity.
- Actively participate in continuous improvement and ongoing learning within the engineering team
- Assist in coordinating vendors and ensuring quality of outsourced projects
- Participate in the on-call rotation for our critical infrastructure.
- Travel: International and Domestic travel may be required 25%
You have:
- At minimum 3+ years of experience working in large-scale Data Center Infrastructure environments and experience planning, executing, and documenting repairs in the server and networking domains.
- Extensive experience installing, monitoring, and maintaining server and network equipment. This includes brand new server and network provisioning.
- In-depth knowledge of data center environments, servers, and network equipment.
- Proven experience executing on multiple tasks simultaneously.
- Proficiency with server out‑of‑band management tools to perform initial troubleshooting on servers, including when the operating system is not fully available.
- Proficiency with Linux/Unix or Windows command-line tools to collect logs, run diagnostics, and perform initial troubleshooting on servers and network devices
- You have installed various equipment that commonly resides in the data center environment and are able to lift 75 pounds occasionally.
You are:
- Someone who is ready for action wielding a wealth of server and network hardware troubleshooting knowledge to support Roblox’s systems.
- Excited about getting in front of complex problems, and can effectively organize your work to overcome emergent high-impact issues.
- Someone who enjoys building processes and procedures for the day to execute our workload and for developing new capabilities as a team.
- Someone who asks the right questions to solve issues within your expertise and you use data to test your theories. You are able to formulate and identify problems, generate and evaluate a variety of solutions (some of which are novel), and implement the best one(s).
- Someone who is committed to demonstrating professionalism in all interactions with partners both inside and outside Roblox to ensure continued success in cross-functional initiatives. You are able to foster trust and uphold the reputation of the team and company.
For roles that are based at our data center in Ashburn, VA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.
Annual Salary Range
$117,440-$143,870 USD
Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.
