#0013740
RE), you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and services hosted primarily on Microsoft Azure. You will collaborate closely with development teams to design, build, and maintain the observability and alerting components of our services. Experience with Azure services and multi-tenant SQL based applications will be instrumental in optimizing our cloud architecture and driving continuous improvement in our systems.
Key Responsibilities:
Help build an SRE culture by sharing best practices, approaches, documentation, and code with other engineering teams across the organization
Design, implement, and manage the alerting and monitoring strategy for Azure based services
Monitor system performance and reliability by implementing monitoring solutions and alerts to ensure proactive response to potential issues
Collaborate with development teams to optimize Azure based applications based off of our observability strategy
Approach operational issues/problems with a software development mindset through defined feedback loops within the software delivery lifecycle
Perform root cause analysis for incidents and implement preventive measures to minimize future disruptions
Stay updated with Azure technologies and best practices; recommend and implement improvements to enhance application / system performance and efficiency
Participate in on-call rotation(s) and respond to incidents as needed, ensuring timely resolution and communication
Participate in product engineering stand-ups and related design activities
Coach other team members to ensure systems are supported by following SRE best practices
Requirements:
Proven experience as a Site Reliability Engineer (SRE) or similar role, with a strong focus on Azure cloud services
Proficiency in scripting and automation using C#, PowerShell, Python, or similar languages
Strong knowledge and proven experience in alerting and monitoring in an Azure based application
Excellent problem-solving skills with a proactive approach to identifying and resolving issues
Experience writing and modifying SQL queries and generating reports
Ability to work independently and collaboratively with Development team(s) in a fast-paced environment with a focus on continuous improvement
Ability to document solutions, SRE architectural patterns, and best practices to ensure that teams have guidance as needed
Proven ability to dig through metrics, logs, and available sources to triage and resolve an incident at any time
Azure certifications such as Azure Administrator Associate or Azure Solutions Architect
Nice to Have
Experience functioning as an SRE in maintaining reliability of the applications and infrastructure
Proficient in infrastructure as code practices
Experience building CI/CD pipelines from scratch
Able to troubleshoot complicated, cross-platform issues by handling OS, Networking, Database, and applications in cloud-based and on-premises environments
About CCC's Commitment to Employees:
CCC Intelligent Solutions understands that our employees play an integral role in our vision to shape a world where life just works. Our team is defined by our values of Integrity, Customer-Focus, Innovation, Inclusion & Diversity, Tenacity, and Connection. Through diverse perspectives, purposeful innovation, and the strength of connections, our technologies empower the people and industry relied upon to keep lives moving forward when it matters most.
At CCC, together everyone can thrive as we innovate and collaborate, creating employee experiences that just work. We are committed to providing opportunities for our people to make real-life impacts, advance in their careers, and contribute to CCC's success.
CCC offers competitive compensation and benefits to support you and your families, including:
401K Match
Paid time off
Annual Incentive Plan Performance Bonus
Comprehensive health insurance
Adoption Assistance
Tuition Reimbursement
Wellness Programs
Stock Purchase Plan options
Employee Resource Groups
For more information about our benefits, please check out our careers site.
Here, you belong. You are seen, valued, and respected. We celebrate you for who you are and all you bring. Every voice is heard and is important to our success. You can hear what employees have to say about our culture here
If you require reasonable accommodation to complete a job application, pre-employment testing, or a job interview or to otherwise participate in the hiring process, please contact (800) 621-8070.