Lead, Systems Engineering & Operations

Turn

(Redwood City, California)
Full Time
Job Posting Details
About Turn
Turn delivers real-time insights that transform the way leading advertising agencies and enterprises make decisions. Our digital advertising hub enables audience planning, media execution, and real-time analytics from a single login, and provides point-and-click access to more than 150 integrated marketing technology partners.
Responsibilities
- Lead by example the work of operations engineers responsible for production monitoring and support of critical infrastructure - Work with TechOps management and team to create high-level roadmaps and team strategy - Serve as technical escalation point for critical production issues and drive escalation/resolution of problems. “Full Stack” perspective and expertise is highly valued - Drive requirements for automation/tooling needs as well as other cross-functional business priorities including helping define, develop and maintain monitoring tools and automation systems within the team - Define server sizing and keep up on the newest server, networking and storage hardware technologies - Lead collaboration with NOC, Datacenter Operations, Security Operations and Data Infrastructure teams to achieve well orchestrated infrastructure operations and high reliability of Turn Platform - Become proficient in understanding how each software component, system/hadoop/database design and configuration is linked together to form an end-to-end solution - Plan system and network maintenances while minimizing impact on production environment - Perform periodic on-call duty as part of the rotation maintaining the availability and performance of the Turn Platform - Share the ownership duties of the following infrastructure components: Puppet, Docker, Mesos/Marathon, OpenStack, Nagios/Icinga/Thruk, OpenTSDB, Logstash/Flume, Elasticsearch, Kibana/Grafana, Zookeeper, Kafka and other core systems services
Ideal Candidate
- 7+ years of relevant work experience; or BA/BS degree in CS, Systems Administration or related field - A strong background in internet service deployment, provisioning, IP networking, service infrastructure, and software deployments. - Strong Linux systems administration skills (we use CentOS) - Experience with configuration management such as Puppet or Chef - Strong organization and multi-tasking abilities. Solid verbal and written communication skills - Proven ability to quickly learn and implement unfamiliar technologies - Advanced knowledge of Linux, TCP/IP and web services - Proficiency in one of Python, Ruby for automation tools development - Troubleshooting skills that range from diagnosing low-level hardware problems to large-scale failures within datacenter clusters - Solid experience with ITILv3 methodologies and practical ways of implementation Preferred Qualifications - Experience with medium to large-scale distributed Unix/Linux systems administration and performance tuning in latency sensitive production environment - OS hardening, security and compliance process, and security tools - Experience with cloud orchestration and private/public cloud management (SaltStack, OpenStack, AWS, Google cloud) - Experience with MongoDB, Redis, CouchDB, ElasticSearch is a plus - Experience with Hadoop a plus - Prior Java development experience is a plus

Questions

Answered by on
This question has not been answered
Answered by on

There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.