Director - Site Reliability Engineering

UKG

4.7

(113)

Multiple Locations

Why you should apply for a job to UKG:

  • Ranked as one of the Best Companies for Women in 2023
  • 4.7/5 in overall job satisfaction
  • 4.8/5 in supportive management
  • 95% say women are treated fairly and equally to men
  • 99% would recommend this company to other women
  • 94% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Paid leave for new and adoptive parents, medical coverage for IVF services & reimbursement to help offset adoption and surrogacy expenses.
  • 45% company match on total employee 401(k) contributions up to the IRS annual limit.
  • Unlimited paid time off for you to rest, re-charge, and pursue your personal aspirations.
  • #DIRSI015106

    Position summary

    high-impact leadership role within a mature, mission-critical environment. You will inherit and lead an established SRE organization responsible for a large, complex ecosystem comprising hundreds of applications across a hybrid infrastructure.

    Success in this role calls for strong systems thinking, operational leadership at scale, and the ability to influence across boundaries. You will drive consistent reliability practices across diverse technologies, modernize how reliability is delivered, and lead globally distributed teams in service of always-on, customer-critical platforms.

    Responsibilities:

    Production Reliability & Application Behavior

    • Responsible for reliability outcomes across a large, heterogeneous application portfolio, including availability, performance, scalability, and recoverability

    • Ensure applications meet defined reliability expectations as they operate on both on-prem and cloud platforms

    • Lead and participate in major incident response, acting as a senior escalation point and ensuring effective executive communication

    • Drive post-incident learning and systemic improvements to reduce repeat issues

    Platform-Facing SRE Execution

    • Lead teams responsible for understanding how applications behave in production, including runtime performance, resource utilization, and failure modes

    • Partner with Infrastructure, Cloud, Security, and Product Engineering teams to address cross-layer reliability concerns

    • Establish standards for operational readiness, release safety, capacity planning, and disaster recovery across platforms

    SRE Practice Consistency at Scale

    • Apply Site Reliability Engineering principles pragmatically across both legacy and cloud-native systems, including:

    • SLOs and reliability targets

    • Error budgets and risk-based decision-making

    • Toil identification and reduction

    • Automation and self-healing where appropriate

    • Observability to support incident response, performance analysis, and capacity management

    • Ensure SRE practices are consistent in intent but adapted in implementation across different technologies and environments

    People Leadership & Organizational Health

    • Lead and develop SRE managers and engineers across a global organization

    • Inherit existing teams and improve clarity of ownership, execution discipline, and engagement

    • Hire and develop senior SRE leaders capable of operating across both cloud and enterprise platforms

    Strategy, Planning & Influence

    • Translate business priorities into reliability-focused technical initiatives

    • Partner with senior Product and Engineering leadership to balance delivery velocity, reliability, and operational risk

    • Own and execute against a portion of the SRE roadmap, ensuring transparency, prioritization, and measurable outcomes

    • Advocate for reliability improvements using data, production insight, and operational experience

    Qualifications

    Required Qualifications

    • 10+ years of experience in software engineering, systems engineering, SRE, or related disciplines

    • Proven experience leading established, globally distributed engineering organizations

    • Strong understanding of production systems and application behavior at scale

    • Experience operating and leading teams across hybrid environments (on-prem and public cloud)

    • Demonstrated ability to influence outcomes in a matrixed enterprise environment

    • Experience owning incident response, operational reviews, and executive-level communication

    • Excellent communication skills, with the ability to clearly articulate technical and operational concepts to varied audiences

    Preferred Qualifications

    • Experience supporting large-scale application portfolios across both Windows/.NET and cloud-native environments

    • Familiarity with Google Cloud Platform and enterprise-scale cloud operations

    • Strong understanding of observability practices across application, platform, and infrastructure layers

    • Prior experience partnering closely with Product, Infrastructure, and Cloud leadership

    UKG is the Workforce Operating Platform that puts workforce understanding to work. With the world's largest collection of workforce insights, and people-first AI, our ability to reveal unseen ways to build trust, amplify productivity, and empower talent, is unmatched. It's this expertise that equips our customers with the intelligence to solve any challenge in any industry - because great organizations know their workforce is their competitive edge. Learn more at ukg.com.

    Equal Opportunity Employer

    UKG is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, disability, religion, sex, age, national origin, veteran status, genetic information, and other legally protected categories. View The EEO Know Your Rights poster UKG participates in E-Verify. View the E-Verify posters here.

    It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

    Disability Accommodation in the Application and Interview Process

    For individuals with disabilities that need additional assistance at any point in the application and interview process, please email [email protected].

    The pay range for this position is $179,800.00 to $258,500.00. The actual base pay offered may vary depending on skills, experience, job-related knowledge and work location. In addition to base pay, employees may be eligible to participate in a performance-based bonus plan and to receive restricted stock unit awards as part of total compensation. Learn more about UKG's benefits and rewards at https://https://www.ukg.com/about-us/careers/benefits

    Why you should apply for a job to UKG:

  • Ranked as one of the Best Companies for Women in 2023
  • 4.7/5 in overall job satisfaction
  • 4.8/5 in supportive management
  • 95% say women are treated fairly and equally to men
  • 99% would recommend this company to other women
  • 94% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Paid leave for new and adoptive parents, medical coverage for IVF services & reimbursement to help offset adoption and surrogacy expenses.
  • 45% company match on total employee 401(k) contributions up to the IRS annual limit.
  • Unlimited paid time off for you to rest, re-charge, and pursue your personal aspirations.