#JR010076
whether it's public, multi or hybrid cloud, or mainframe. And because we span across all mission-critical platforms, we can meet you wherever you are in your digital transformation journey, with 24/7 support when you need it. We are your relentless ally, flexing with you when challenges emerge so you don't feel stuck in place. With cross-platform certifications and decades of experience, our technology experts have become an extension of your team so you're continuously innovating - doing more with less while remaining secure. And that's just the beginning.
About Role
Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed services clients. Ensono has invested time to create templated cloud-native solutions to provide value to our clients. They have loved what we've done so far and want us to operate these applications in production on their behalf.
In response to this demand, Ensono is applying Site Reliability Engineering principles to disrupt the traditional Managed Services approach and deliver something that empowers our customers and turns technology into an efficiency, growth and innovation multiplier. The successful candidate will be reporting into the Head of SRE and will start supporting our clients immediately. New projects are in the pipeline, so you will also be working with our pre-sales and delivery teams to ensure operations are considered long before handover.
We are just starting on our journey to Site Reliability Engineering, so we are eager to continue to learn from industry leaders and your experiences in delivering Site Reliability Engineering to build a sustainable workplace that delivers a service which will delight our customers.
What you will be doing: As a Site Reliability Engineer, your overarching responsibility is to ensure we meet our clients' Service Level Objectives, and we respond to incidents in a timely and professional manner.
Responsibilities:
Monitoring our client's services using modern tools and SRE practices.
Responding to incidents originating from 2nd line support within the times set out in the SLA (being on-call).
Performing and assisting in root cause analysis and blameless post-mortems to enable incidents to be understood and avoided in the future.
Improving the testing and release procedure.
Planning for and making changes to capacity to balance the demand vs. cost saving equation better.
Undertaking improvements to the infrastructure and product.
Making changes to client's services based upon operational or business needs.
Advising and supporting the further development of Ensono Intellectual Property to ensure future projects benefit from what we learn.
Experience level - 5 to 8 yrs
Technical Key Skills (Mandatory Skills)
A comprehensive understanding of Site Reliability Engineering
Experience working with a cloud service provider (ideally Azure or AWS)
Strong examples of implementing automation/solutions by code (preferably Python, C#, Java, or Go, any other language)
Commercial experience working with compute technologies (such as Kubernetes (EKS), or Serverless)
Designed, implemented, and/or supported solutions in a production environment
Strong interpersonal and communication skills to work in a fast-paced and rapidly changing dynamic environment
Good to have skills
Experience with CI/CD pipeline tools (such as Azure DevOps, GitHub Actions, Gitlab CI)
Experience with monitoring, logging tools (such as Azure Monitor, CloudWatch or Prometheus)
Experience with ITSM tools (such as ServiceNow, OpsGenie, or PagerDuty)
Working with an Infrastructure as Code tool (Terraform, ARM, CloudFormation or Deployment Manager)
Excellent troubleshooting skills that span systems, networks (TCP/IP), and code
Expert knowledge of Linux internals and tuning
Shift Timings
Should be comfortable with any shift timings
JR010076