Senior Site Reliability Engineer
ROBLOX
(San Mateo, California)ROBLOX is the best place to Imagine with Friends™. With the largest user-generated online gaming platform, and over 15 million games created by users, ROBLOX is the #1 gaming site for kids and teens (comScore). Every day, virtual explorers come to ROBLOX to create adventures, play games, role play, and learn with their friends in a family-friendly, immersive, 3D environment.
As a Sr. Site Reliability Engineer, you’ll play a critical role in helping us scale our software stack and hardware infrastructure at a time of incredible growth for our business. At Roblox, you’ll have boundless opportunities to shape the future of the Imagination Platform™ and demonstrate your passion for delivering awesome solutions in front of a global audience. If you know what it takes to build systems that can sustain over one million concurrent players year-round and you take play as seriously as we do, you’ll fit right into our highly-skilled and ever-expanding engineering team.
- Develop and deliver solutions to meet the requirements of large scale, real time and 5 9s uptime to ensure our community has an awesome experience on the Imagination Platform™ from anywhere in the world.
- Identify and solve critical problems and prevent them from reoccurring via root cause analysis and automation.
- Create, influence and improve the development platform, infrastructure, standards, and methods to ensure our goals of scalability and high availability.
- Develop and share best practices with development teams to improve scalability and reliability of the Imagination Platform™.
- Work with a team that is currently distributed throughout the US and Canada, and soon to be globally.
- While we are growing at our current pace, you may be asked to participate in the on-call rotation for critical infrastructure pieces.
- Experienced: you have a BS degree (or equivalent professional experience) in Computer Science or related engineering field with at least 8 years of hands on experience.
- A coding champion: your prowess in a multitude of programming languages such as Ruby, C#, and Java, as well as scripting languages including Python, Bash, and PowerShell, empower your understanding of the challenges of building large-scale systems.
- A Linux (Ubuntu) and/or Windows expert: you have solid administration skills in either OS, good system-analysis, configuration, and troubleshooting experience.
- Passionate about automation: you have 3+ years of hands-on experience with at least one configuration management solution (Chef, Puppet, etc.).
- Up-to-speed on all things Cloud: you have working experience with public cloud (AWS preferred) and private cloud (OpenStack preferred) solutions.
- Ambitious: you boldly go where no man or woman has gone before; Consul.io, Vault.io, etcd, Docker, Mesos, InfluxDB etc. might not be technologies you’ve used, but you are keen to learn and grow.
- Adaptable: you are capable of adjusting to new challenges, and experimentation is in your blood.
Benefits | Benefits included |
---|
Additional Notes on Compensation
Unlimited paid vacation. Gym reimbursement. Free catered lunches & a fully stocked kitchen with unlimited snacks. 401K. Robust medical, dental and vision insurance. Free onsite parking & other commuter benefits.
Questions
There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.