Site Reliability Engineer Jobs - Site Reliability Engineer, 15545

at TEKsystems, Inc
Location Sunnyvale, CA
Date Posted June 12, 2019
Category Default
Job Type Full-time

Description

SRE Engineer
The Platform Services SRE team's looking for an individual who has intellectual curiosity, breadth
of technical expertise & depth needed for problem solving. The team fosters self-direction and
an open environment for exchange of creative ideas with support and mentorship required to
learn and grow.
This role would be a great fit for someone with operations mindset and a natural instinct on
solving complex engineering issues by creative and innovative solutions. It's not only identifying
problems, but also develop and implement solutions that operate at scale - seeing contribution
directly improve the reliability of Maps platform and services.
The SRE Engineer's Daily Responsibilities Include
* Troubleshooting and problem solving complex issues with thorough root cause analysis..
* Engage in cross functional team discussions on design, deployment, operation and
maintenance.
* Presenting and publishing technical collateral in the form of white-papers, blogs and best
practice guides.
* Interest in designing, analyzing and troubleshooting large-scale distributed systems.
* Systematic problem-solving attitude with strong communication skills and a sense of
responsibility to own an issue and drive to a successful completion.
* Building automation scripts to validate the stability, scalability and reliability of Maps
products & services.
* Ability to debug and optimize code and automate routine tasks.
Preferred Qualifications
* Strong linux system administration and networking knowledge
* Understanding of Software Development, Linux & Networking Technologies
* System Design and Distributed Systems Architecture experience
* Prior experience working for a Site Reliability Engineering team
* One more language fluency such as Python or Ruby
* Experience with Postgres or MySQL administration
* Experience with distributed system performance analysis and optimization
* Experience with distributed (multi-tiered) systems, algorithms, and relational databases
* Ability to effectively articulate technical challenges and solutions
* Deal well with ambiguous/undefined problems; ability to think abstractly
Basic Qualifications
* BS degree in Computer Science or related technical field involving coding or equivalent
practical experience.
* Interest in designing, analyzing and troubleshooting large-scale distributed systems.
* Experience in at least one of the programming/scripting languages: Python, Shell, Java, JS
framework.
* Experience in Hadoop, Spark is a plus
* Experience in at least one of the following: Ansible, Puppet, Chef, Terraform
* Experience with at least one of the following: Kubernetes, Docker, AWS.
* Knowledge in networking protocols & technologies including BGP, MPLS, L3VPN, TCP/IP,
UDP, and SSL/TLS is highly desirable.

About TEKsystems:

We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.