As a Senior Site Reliability Operations Engineer within the Global Technology Platforms (GTP) CCC team you will work with other CCC, TDO, SRE, DevOps and Engineering practitioners to pro-actively maintain mission-critical infrastructure, cloud platforms, micro-services, tools, and processes that will ensure highest levels of availability and reliability across our Global Technology platforms.
You're right for the job if you are comfortable leading our major incident response as part of a technical team of engineer’s laser focused on restoring service across complex distributed systems. You'll excel if you have enthusiasm for digging deep, and a flare for sharp technical communication, prioritization and organization. You will work directly with our SRE, Engineering and DevOps teams to support our next generation “always up” cloud-based e-commerce platforms.
The CCC Senior Site Reliability Operations Engineer is responsible for pro-actively monitoring, detecting and resolving site issues before they become customer and availability impacting. Technically you will understand the full end to end stack and use this knowledge to detect errors/failures and take corrective action to mitigate. During a major incident, you will draw on your technical skills and knowledge to triage and troubleshoot, differentiating between symptom and cause, to help restore impacting issues. Your ability to continuously challenge yourself and develop a strong network within your peer group will see you exceed in this role. Our goal is to protect the customer experience and deliver outstanding levels of availability.
To do so, you will need strong skills in the following areas:
Additional responsibilities may include:
Bachelor’s degree in Computer Science and 3 years’ experience in software engineering or related field OR 5 years’ experience in software
engineering or related field.
600 WEST CALIFORNIA AVENUE, SUNNYVALE, CA 94086-2486, United States of America