#210643109
on, design, deployment, and operation
Automate repeated manual tasks, develop tools and automation to improve the efficiency of the platform and infrastructure.
Analyze defects, propose improvements and drive efficiencies in systems and processes.
Helps to develop new cloud engineering strategies and implementations for the firm
As part of Site Reliability, you have the responsibility of ensuring the reliability, availability, and performance of the cloud infrastructure and platform.
Demonstrates site reliability principles and practices every day and champions the adoption of site reliability throughout your team
Develop observability and telemetry tools.
Author and improve the quality of technical engineering documentation
Debug and solve issues in a production environment
Participates in SRE on-call rotations and escalation workflows.
Required qualifications, capabilities, and skills
Formal training or certification on software engineering or site reliability engineering and 5+ years applied experience
Bachelor's Degree in Computer Science or equivalent
Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform
Expertise in building solutions with AWS cloud service, knowledge in Infrastructure as Code, tools such as Terraform and fluency in at least one programming language such as Python and Java
Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.) and troubleshooting common networking technologies and issues
Ability to identify and solve problems related to complex data structures and algorithms
Drive to self-educate and evaluate new technology and ability to teach team members
Ability to expand and collaborate across different levels and stakeholder groups. Excellent communication skills working with stakeholders and domain experts across the company to design solutions to user problems
Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency and drive
Preferred qualifications, capabilities, and skills
AWS certifications will be a bonus.
ABOUT US
J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world's most prominent corporations, governments, wealthy individuals and institutional investors. Our first-class business in a first-class way approach to serving clients drives everything we do. We strive to build trusted, long-term partnerships to help our clients achieve their business objectives.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
ABOUT THE TEAM
Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we're setting our businesses, clients, customers and employees up for success.