Site Reliability Engineer Jobs - Infrastructure Architect, 15312

at Syntagma Group
Location Boston, MA
Date Posted May 7, 2019
Category Default
Job Type Full-time

Description

Infrastructure engineers work closely with engineering teams to build and operate the services that make up the Maxwell platform. Here are the key outcomes we are looking for: • Monitoring our core services in Kubernetes and AWS • Identifying security gaps in our structure and how to fix them • Writing and reviewing Terraform for our infrastructure as code • Creating new dashboards and helping teams understand their services through monitoring • Documenting everything you discover in our docs repo in github • Taking on new backlog items in our agile process of two-week development iterations • Mentoring engineers on operational best practices for their owner-operated microservices Our Tech Stack Maxwell's products are supported by an exciting mix of modern technologies. All of our services are containerized. All the infrastructure we build out, in Terraform and Ansible, is in github. We store data in MySQL, MongoDB, Redis, and Elasticsearch. We use logz.io for logging, Datadog and New Relic for monitoring, and VictorOps for alerting. Services are deployed to Kubernetes clusters using a CI/CD pipeline and are hosted on AWS. Requirements • Experience working in operations or infrastructure in a DevOps or SRE environment • Proficiency with at least one language (Python, Ruby, Bash, Go, whatever works!) • Working knowledge of AWS (with knowledge of Elasticsearch or RDS as a plus) • Proficiency with Linux • Experience with logs and metrics collection and analysis • Expertise in troubleshooting infrastructure and performance issues • Handling of security issues, with a background in the field a plus • Bachelors Degree