Cloud Operations Engineer

EMS Software

(Centennial, Colorado)
Full Time Entry-Level
Job Posting Details
About EMS Software
For three decades and counting, EMS Software is committed to helping people create great meetings. EMS offers the greatest depth and breadth of functionality and the fastest and easiest way to schedule a meeting. We deliver this by anticipating market and organizational trends and listening to our customers’ needs.
Summary
As our fourth Operations Engineer, you will help continue to define our transformation from an on-premise enterprise software solution to that of a SaaS solution hosting a number of Fortune 100 clients. We’re looking for an engineer with a development background who has some operational experience and expertise spanning high availability systems in both lower and production environments, a DevOps mentality of continuously improving the system, and a firm grasp on automation and cloud architectures. You must have extensive experience supporting applications developing in at least 3 of the following: .NET, Java, JavaScript, Python, Node, GO or Ruby. You should also be passionate about solving problems and developing creative solutions leveraging automation.
Responsibilities
**Your First Three Months** In your first month, as your familiarity with the product grows, your responsibilities and influence will grow as well. You, along with your team, will be responsible for supporting the product team’s operational needs in the lower environments. You will collaborate with other members of the operations team in established patterns and continue to hone your skills as you push the design, architecture and implementation of our CI pipelines (lower and upper environments) to their next phase. Within two months, you and your team will fill in the gaps to have a well-tested, low-latency and highly available environment for all our product lines. Working with the development team, you will help figure out the gaps in creating and supporting a truly scalable product offering. Your team will be responsible for supporting production environments. Within three months, you will help drive changes to the operational and development roadmap as we inch closer to onboarding 20% of our customer base into hosted production environments by the end of 2017. **What You’ll Do** * Design, provision, configure and maintain the platform operations to handle the scale of running several application stacks in the cloud that will be consumed worldwide * Automate the deployment and maintenance of cloud platform technologies * Oversee production operations, log management, data warehouse, and database operations, including management of Splunk services * Ensure all monitoring systems (IT, development, service management, Apdex) are in place * Enforce consistency of monitoring, reporting, and alarming systems * Help drive process improvements for service management, including: outage/incident management, rollbacks and reporting * Research emerging virtualization techniques and advise management * Perform capacity management, load and scalability planning * Ensure compliance with deployment and operations documentation * Assist management in development and optimization of operational cost models * Design cloud infrastructure for high reliability and availability * Build strategic and tactical plans for continued improvement of cloud architecture and operations * Assist in the establishment of 24x7 performance monitoring and response protocols * Provide on-call support outside of normal work hours/days
Ideal Candidate
**About You** * You’re driven, humble, and autonomous * You’re a quick study, a strong communicator, and you’re able to adapt to a fast-paced environment * You have a working knowledge of Agile Development practices (e.g., SCRUM, TDD) * You are or have the mindset of a developer, but are intrigued by the operational aspects of hosting developed solutions * You are devoted to automation * You’re an expert in Windows (IIS, SQL Server) and Linux * You have at least 1 years of hands-on production experience with Amazon Web Services (AWS), Google Cloud or Microsoft Azure. This includes: * Configuration of VPCs, with VPN to corporate network * Experience setting up, maintaining and monitoring global production environments, QA and staging environments, with a strong understanding of the differing needs of such environments * At least 6 months of experience in a professional production environment * At least 6 months of experience managing networking infrastructure and monitoring at the application level * Performance optimization experience, including: troubleshooting and resolving network and server latency issues; performing hardware evaluation/selection tasks; performance vs cost vs time analysis * At least 1 year of experience with automation or scripting tools (e.g., GO, Python, Shell, PowerShell) * At least 6 months of experience with Ansible, Jenkins * You’re detail-oriented, with excellent documentation skills, and you’re someone who can successfully manage multiple priorities * Troubleshooting skills that range from diagnosing hardware/software issues to large scale failures within a complex infrastructure **Other Things We Hope You Have** * Bachelors in Computer Science or equivalent work experience * Experience with Mongo, MS SQL Server, Splunk, Grafana, Terraform and Prometheus * Experience working with Docker, Kubernetes and GO Hands-on experience with performance, load and security penetration testing * Hands-on experience with building out and maintaining a continuous integration and delivery pipeline

Questions

Answered by on
This question has not been answered
Answered by on

There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.