Sr./Lead Automation Engineer, DevOps

Turn

(Redwood City, California)
Full Time
Job Posting Details
About Turn
Turn delivers real-time insights that transform the way leading advertising agencies and enterprises make decisions. Our digital advertising hub enables audience planning, media execution, and real-time analytics from a single login, and provides point-and-click access to more than 150 integrated marketing technology partners.
Summary
Turn is seeking a very hands-on, seasoned and driven lead for our Technical Operations team. You will lead our mid to senior level operations engineers. The team is responsible for designing, managing, monitoring, and maintaining Turn server infrastructure. Team covers several areas, including Systems, Tools and Automation. The efficient, stable and fast operation of our mission-critical large-scale infrastructure is crucial to our business. The successful candidate will have a strong understanding of distributed computing and high availability concepts and solid knowledge of networking and Linux systems management. He/She will possess excellent written and verbal communication skills and be able to interact effectively and professionally with team members, internal customers and engineers. The ability to rapidly assess, analyze and resolve complicated issues with little initial information or direction and with varying degrees of ambiguity is required.
Responsibilities
- Write tools to automate infrastructure lifecycle, monitoring configuration and automatic issue remediation - Own support and operations of core infrastructure components, such as Kafka, Zookeeper, etc. - Serve as technical escalation point for critical production issues and drive escalation/resolution of problems - Ensure smooth collaboration with NOC, Datacenter Operations, Security Operations and Data Infrastructure teams to achieve well orchestrated infrastructure operations and high reliability of Turn Platform - Plan and execute system and network maintenances while minimizing impact on production environment - Perform periodic on-call duty as part of the rotation maintaining the availability and performance of the Turn Platform
Ideal Candidate
- Strong automated configuration management skills, prior experience with Puppet, Chef or Ansible - Be comfortable with scripting – we use Ruby and Python and a bit of shell - Strong Linux systems administration skills (we use CentOS) - Strong organization and multi-tasking abilities. Solid verbal and written communication skills - Proven ability to quickly learn and implement unfamiliar technologies - Troubleshooting skills that range from diagnosing low-level hardware problems to large-scale failures within datacenter clusters **Preferred Qualifications** - Experience with medium to large-scale distributed Unix/Linux systems administration and performance tuning in latency sensitive production environment - OS hardening, security and compliance process, and security tools - Experience with cloud orchestration and private/public cloud management - Experience with Hadoop a plus - Prior Java development experience is a plus

Questions

Answered by on
This question has not been answered
Answered by on

There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.