#1107
About this Role:
The Site Reliability Engineering (SRE) team provides leadership, direction, and accountability for building and running large-scale software systems. As a Site Reliability Engineer, you will identify and deliver automation solutions designed to ensure high availability and resiliency using your expertise in software development, complexity analysis, and scalable system design. Strong collaboration skills will be required to work closely with other engineering teams to ensure services/systems are highly stable and performant, meeting the expectations of our business partners and end-users.
Partner with the architecture and development teams on how to make applications highly available, reliable, and performant at global scale
Collaborate with the architecture team to ensure Reliability factors are accounted for in business features and enablers
Guide development teams in understanding established service level objectives and consequences, and implementing appropriate SLIs to support the objectives
Collaborate with development team members to swarm, troubleshoot, and resolve problems
Guide ad-hoc teams to brainstorm solutions and build implementation plans based on the Root Cause Analysis of production issues
Design and build automated solutions to optimize application/service/platform uptime with minimal human intervention
Be available for an on-call rotation to participate in troubleshooting and communication efforts outside of normal business hours
Implement and help create standards and best practices, and mentor other team members in order to drive adoption across development teams
QUALIFICATIONS
What makes you a dream candidate?
Knowledge and Skills
Expert in defining, implementing, and evaluating Service Level Objectives (SLO) and Service Level Indicators (SLI), and associated consequences
Software development expertise in two or more high-level programming and scripting languages
Experience in evolutionary database design, query performance analysis, and indexing as a cornerstone for delivering scalable, performant products and services
Experience in designing, building, and optimizing automated pipelines with automated testing and automated security controls
Experience in performing Root Cause Analysis and Problem Management
Experience working in Agile Scrum teams with demonstrated success leading improvements (getting better/faster/happier)Help establish and maintain a culture of learning through the development and sharing of skills, knowledge, process and tools; combat traditional silos that create "us and them" environments
A driving passion for finding solutions to hard problems at scale and operationalizing them
Exceptional critical thinking and communication skills, with a passion for leveraging documentation as a tool for constant improvement
Pipeline Automation: Azure DevOps (YAML, ARM), Terraform, Jenkins, Chef, Octopus Deploy
Code Scanning: SonarQube, Checkmarx
Source Code repos: Git
Containerization: Azure Kubernetes Service, Kubernetes (open source), Docker
High-level programming languages: C# (.NET MVC and .NET Core)
Scripting: PowerShell, Bash
Database: Oracle, Microsoft SQL Server, NoSQL (e.g. CosmosDB)
Test Automation: Xamarin.UITest, Specflow, DevTest, Selenium, Test Data Manager, Postman, Maven, TestNG, JMeter
Operating systems: Windows, Linux
Cloud Platforms: Azure
Metrics and Monitoring: Splunk
Experience and Experience
3-5 years of experience in software development and test automation required
3-5 years of web development experience strongly preferred
High School Diploma or equivalent required
Bachelor's Degree in related field or equivalent experience required
Master's Degree in related field preferred
What We Offer: Generous benefits package available on day one to include: 401K matching, bonding leave for new parents (12 weeks, 100% paid), tuition assistance, training, GM employee auto discount, community service pay and nine company holidays.
Our Culture: Our team members define and shape our culture - an environment that welcomes innovative ideas, fosters integrity, and creates a sense of community and belonging. Here we do more than work - we thrive.
Compensation: Competitive pay and bonus eligibility
Work Life Balance: Flexible hybrid work environment (onsite 2 days a week/3 days remote) at our Arlington (AOC1), TX office.
Please note: we are unable to provide sponsorship for this role at this time.