Site Reliability Engineer Jobs - Site Reliability Engineer, 15100

at Agile
Location Atlanta, GA
Date Posted April 4, 2019
Category Default
Job Type Contract to Hire


Site Reliability Engineer

Our client is a data and technology company fostering on innovation, growth and collaboration. The fast-paced, team-driven environment provides the opportunity to work as a key contributor on high priority initiatives by developing new products, solutions and platforms, and supporting technology operations while maintaining the highest standards of quality.

They are seeking a Site Reliability Engineer to join their team in Alpharetta, GA!

Here's what you'll be doing:

  • Leading the configuration, optimization, documentation, and support of the infrastructure components of our application platforms
  • Ensuring that our production systems are installed and running smoothly
  • Streamlining our pipeline from development to production

Here's what our ideal candidate has:

  • Bachelor's Degree in Computer Science, Information Management or in "STEM" Majors
  • Experience with configuring, customizing, and extending monitoring tools (Appdynamics, Apica, Sensu, Grafana, Prometheus, Graphite, Splunk, Zabbix, Nagios etc.)
  • Excellent hands-on programming knowledge in Application Development
  • 3+ years' experience with all stages of an agile software development lifecycle (CI/CD) supporting Java/Javascript UI applications (ex: Angular JS) and SAAS applications
  • 5 years of experience building JavaEE applications using, build tools like Maven/ANT, Subversion, JIRA Jenkins, Bitbucket and Chef
  • 3-5 years' experience in continuous integration tools (Jenkins, SonarQube, JIRA, Nexus, Confluence, GIT-BitBucket, Maven, Gradle, RunDeck, is a plus)
  • 2+ years' experience with configuration management and automation (Ansible, Puppet, Chef, Salt)
  • 2+ years' experience deploying and managing infrastructure on public clouds (AWS, GCP, or Azure)
  • Experience working with Nginx, Tomcat, HAProxy, Redis, Elastic Search, MongoDB, and RabbitMQ, Kafka, Zookeeper
  • 3+ years' experience in Linux environments (CentOS)
  • Knowledge of TCP/IP networking, load balancers, high availability architecture, zero downtime production deployments. Comfortable with network troubleshooting (tcpdump, routing, proxies, firewalls, load balancers, etc.)
  • Demonstrated ability to script around repeatable tasks (Go, Ruby, Python, Bash)
  • Experience with large scale cluster management systems (Mesos, Kubernetes)
  • Ability to dive into any level of a modern internet service (schedulers, containers, Linux kernel, caching, object storage, distributed filesystems, RDBMS, NoSQL, etc.)
  • Ability to troubleshoot and debug applications (C, Java, Go)
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Excellent written and verbal communication skills with the ability to communicate with team members at various levels, including business leaders
  • Experience with Docker-based containers is a plus

Benefits: Our IT consultants enjoy a wide array of benefits including: medical, dental, 401K, life insurance, and much more.