Senior SysOps Engineer

ROBLOX

(San Francisco Bay Area)
Full Time
Job Posting Details
About ROBLOX
ROBLOX is the best place to Imagine with Friends™. With the largest user-generated online gaming platform, and over 15 million games created by users, ROBLOX is the #1 gaming site for kids and teens (comScore). Every day, virtual explorers come to ROBLOX to create adventures, play games, role play, and learn with their friends in a family-friendly, immersive, 3D environment.
Summary
Our infrastructure management system is highly automated, enabling engineers to interact with thousands of servers across multiple environments in parallel via the command line. Our application space includes database scalability, caching, message queuing, wide-table search, recommendations, user ratings, services for mobile devices, gaming cloud management, and virtualization. We live in both the cloud and traditional data center environments.
Responsibilities
* Maintain a 99.9% SLA for a 24/7/365 production environment * Develop automation to automatically detect and recover production issues in all layers from hardware, configuration, networking, and vendor outages, and continuous software upgrades * Quickly resolve complex problems encountered during the installation and operation of our applications, to ensure negligible impact on players and internal operations * Prioritize and multitask daily responsibilities, while being flexible enough to respond to emergent high-priority issues * Design, deploy and configure extremely complex computing environments * Assist in implementing disaster recovery procedures and system failover efforts * Collaborate with highly skilled and driven colleagues in a dynamic, agile environment * Manage systems and develop contingency plans * Participate in on-call rotations
Ideal Candidate
**Qualifications:** * Bachelor’s degree in Engineering or Computer Science, or equivalent work experience * 7+ years of experience in ops-related systems implementation * Extensive experience in both Windows and Linux (any flavor) * Experience maintaining and debugging L2/L3 networks * Experience in hardware failure diagnostics, security and database capacity * Experience with server performance and failure monitoring/diagnostics * Knowledge of debugging common scripting languages: PowerShell, bash, python. **Great to have:** * Working knowledge of a continuous integration platform such as Jenkins or TeamCity * Experience with automation tools (Chef, Puppet, Orchestra, etc.)
Compensation and Working Conditions
Benefits Benefits included

Additional Notes on Compensation

Robust medical, dental and vision insurance, 401k.

Questions

Answered by on
This question has not been answered
Answered by on

There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.