#84566-en-us-1
non-functional requirements including SLI/SLOs. Validating requirements with Business Stakeholders
Manage SLI/SLOs of customer facing interfaces as well as backend services and provide improvement plans for non-compliance
Develop custom dashboards in New Relic to represent a holistic view of system operational health
Improve reliability, quality, and time-to-market of our suite of software solutions
Support release engineering by providing automation support as well as push changes to production when manual intervention needed
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational support and engineering for multiple large distributed software applications
Daily and Monthly Responsibilities
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
Partner with engineering teams to improve services through rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Modelling areas of risk to estimate latency characteristics and capacity requirements. Typically, this will either be refining the workload and modelling how it applies to a set of components, or working with component suppliers to estimate capacity requirements.
Create sustainable systems and services through automation and uplifts
Balance feature development speed and reliability with well-defined service level objectives
Skills, Experience and Requirements
Bachelor's degree in computer science or other highly technical, scientific discipline
8 - 12 yrs of overall experience
Ability to program (structured and OO) with one or more high level languages, such as Go, Python, React Native and JavaScript
Experience AWS cloud services like EC2, S3, Cloud Front, EKS as well as dynamic resource management frameworks (Kubernetes)
Experience in any one of the application performance management tool (preferably New Relic), EFK stack and log analysis
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
Ability to drive a collaborative approach across business functions, and external partners
Benefits