Site Reliability Engineer - Global E-Commerce

TikTok

4.5

(6)

Singapore

Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.
  • #7277759947123231031

    Position summary

    product engineers to combine software code and systems knowledge to ensure that TikTok e-Commerce's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability.

    Responsibilities

    • Be responsible for service levels of mission critical, revenue-generating e-Commerce platform as well as all supporting infrastructure and services. This role will focus on service reliability, highly-scalable design, and release management in a cloud-native environment.
    • Define service level indicators and data-driven objectives, and develop SRE standards, processes and methodologies, to uphold and improve uptime, latency, and system health of a core global e-commerce production platform.
    • Collaborate cross-team with engineering and product to ensure that key stability and maintainability requirements, such as capacity planning and launch reviews, are performed to enable transparent service delivery to customers.
    • Design strategies for risk detection and mitigation, disaster recovery & simulation, release management, cost optimisation, engineering quality etc.
    • Automation geared towards infrastructure-as-code, scalability and service resiliency.
    • Implement best practices around incident management, post-mortems while being part of on-call rotations.
    • Research, design, and develop computer and network software or specialised utility programs.
    • Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
    • Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.

    Qualifications

    Minimum Qualifications

    • Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline.
    • Candidate should have at least 5 years of experience in Linux operating system internals, networking and microservices in cloud-native environments.
    • Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
    • Experience developing platform/tools using scripting languages such as Python/Bash.
    • Experience with implementing observability solutions such as monitoring, logging and tracing in complex service meshes.

    Preferred Qualifications

    • Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
    • Experience with running production-grade web services at scale in a cloud native environment.

    Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.