Senior DevOps Engineer
Loggly
(San Francisco, California)Loggly is the world’s most popular cloud-based log management solution, used by more than 9,000 customers to effortlessly spot problems in real-time, easily pinpoint root causes and resolve operational issues faster to ensure application success. Our simple to scale and consume log management service is designed around the needs of modern DevOps teams and purposely built to dramatically simplify the chore of log management for start-ups through Fortune 500 organizations.
We are looking for an experienced devops engineer who has experience in building and scaling fast growing data based businesses. You will be responsible for growing our infrastructure build using hybrid cloud, which currently includes multiple datacenters. We are also looking for skills on the development side of DevOps as we believe in automating everything.
As a DevOps Engineer at Loggly you are responsible for bringing and spreading the knowledge, ideas, and hands-on implementation skills needed to deliver and run cloud based software services.
You’ll work closely with all parts of the company to help improve our product and ensure an excellent customer experience. You will work within an Agile environment to shepard updates from development through to production. Revamp our automation tools to better suit our growing environments. Improve our monitoring to catch and proactively repair issues before they become fires. Determine areas on the infrastructure that require improvement, upgrades, or expansion due to service growth. You will help in the deployment of new hardware and data centers as the infrastructure grows. Participation in an on-call rotation is required.
You’re an engineer who enjoys working with developers to continuously improve the software, believes that any non-trivial task should be automated, understands and values log files, thinks that processing terabytes of data in near real-time is a good start, and enjoys squeezing the last cycle of power out of servers, VMs, and software.
- Enjoy helping others around you grow and be successful as developers
- Have excellent written and verbal communications skills
- Can be autonomous and self-driven
- Get inspired on a daily basis, thinking of new ideas and sharing them with others
- Take pride in creating new ways to do things and know that the creative process requires momentum, chaos, vibrancy, spontaneity, debate, and silence
- Maintain and enhance core or shared components of the product as well as internal tools
- Provide architectural input for system design to improve scalability, reliability, and adaptability of infrastructure
- Work closely with the software development team and other parts of the company to provide a robust, flexible, and scalable platform that enables both product development and new service offerings
- Ensure processes adapt and evolve to reflect current and future best practices
- Deploy and maintain system automation technologies to streamline operations
- Manage incident response protocol and provide hands-on quarterbacking during major service interruptions
- 5-10 years experience administering Big Data processing infrastructures capable of handling data in 100’s of TB to petabytes
- Experience with private and public cloud environments (especially AWS), AWS services like Route 53, ELB, ADX, VPC, ElastiCache, RDS, S3 etc and AWS APIs.
- Hands-on experience in designing and implementing automation systems for configuration management and code deployment
- Work with geographically distributed systems and complex network topologies.
- A high level of experience with open source tools and the OSS community.
- Networking knowledge (TCP/IP, firewall, load-balancing, etc.)
Technical Skills:
- Excellent Unix/Linux server administration skills, including package management, bare metal installations, and virtualization.
- Data center build-outs and management (power and HVAC calculations, rack and stack, lights out management).
- Excellent system automation experience with Ansible or puppet, or chef
- Excellent scripting skills in shell, python ruby or perl,
- Solid understanding of Java applications, memory, and JVM management.
- Knowledge of security best practices, policies, and procedures a huge plus.
- Experience maintaining monitoring systems like Nagios and New Relic.
- MySQL administration experience.
- Systems and software agnostic– will use the best available tool for the job.
Bonus Skills and Experience:
- Experience with Kafka, ZooKeeper, Elasticsearch, Hadoop or other NoSQL datastore is huge plus.
- Familiar and comfortable with Apache, Nginx, varnish, Django, Mysql, etc.
- Experience with Arista switches, Juniper, JunOS, Cisco iOS and network analysis tools
- Experience with designing and managing networks is huge plus
- Experience with continuous integration and version control systems– Jenkins, git, etc.
Questions
There are no answered questions, sign up or login to ask a question

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.