Tech Lead (SRE) - Cloud Infrastructure

TikTok

4.5

(6)

Singapore

Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.
  • #A197588

    Position summary

    , and to grow as one team. Status quo? Never. Courage? Always.
    At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve.
    Join us.

    Team Introduction
    The Site Reliability Engineering (SRE) team is a fusion of software and systems engineering techniques used to design and operate large-scale, extensively distributed, and resilient systems. Within Infrastructure SRE at TikTok, our primary focus is to ensure that the reliability and uptime of our infrastructure services meet the needs of our users and support rapid improvement iterations. Our software development efforts are deeply committed to optimising existing systems, constructing essential infrastructure, and streamlining operations through automation.

    The Role
    In the role of a Tech Lead, you will assume responsibility for guiding and assembling a team of software and system engineers, leveraging your exceptional technical leadership skills. Your role will involve establishing efficient processes for project execution and promoting sound engineering practices. Additionally, you will maintain regular coordination and communication with other infrastructure teams and our user community.

    What you will be doing:

    1. Establish and oversee the SRE team, which encompasses tasks such as team recruitment, the training of new talent, system operation and maintenance, coordination efforts, and fostering a cohesive team culture;
    2. Oversee the acquisition and development of software systems in organisational units. Establish a comprehensive long-term technical strategy with well-defined implementation steps and milestones to continually enhance the team's competitiveness and technological capabilities;
    3. Oversee the development of Proof-of-Concept/solutions and provide technical expertise on the development of software and platform features, ensuring that appropriate security and risk factors are considered;
    4. Create protocols and strategies for critical aspects of the operating platform, including access management, configuration, disaster recovery, and fault handling;
    5. Devise and implement software platforms and monitoring frameworks that promote efficient, automated, and intelligent governance within a service-oriented architecture (SOA);
    6. Collaborate closely with the system development team to guarantee the reliability of systems from initial design through to launch. Consistently advance automated operations and maintenance facilities and platforms;
    7. Foster improved communication and collaboration with business teams, enhance cross-team coordination, and persistently refine and optimize business processes. Drive the evolution of business architecture design.

    Qualifications

    What you should have:

    • At least a Bachelor's Degree in Computer Science or a closely related technical field, along with more than 5 years of professional experience (including at least 3 years in Research and Development);
    • Demonstrates a systematic approach to operations and maintenance, with proficiency in Linux systems and networking. Brings practical expertise in managing and maintaining large-scale distributed systems;
    • Self-motivated with strong planning and summarisation skills. Possesses a track record of project and team management;
    • Exhibits a high level of responsibility, a proactive team-oriented attitude, and exceptional problem-solving abilities;
    • Prior experience with extensive cloud-computing platforms is a plus.
    • Preferred qualifications include experience in the development of large-scale distributed storage, scheduling, big data computing systems, or intelligent operations and maintenance.

    ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

    Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.