Site Reliability Engineer - AML Global Recommendation - USDS

TikTok

4.5

(6)

Sydney, Australia

Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.
  • #7358126026768435466

    Position summary

    ve. This is doubly true of the teams that make TikTok possible.
    Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

    At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
    Join us.

    About the Team:
    Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a massively distributed AI/ML recommendation system for the United States and all around the world.

    On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis, and large-scale systems operation. Join us and you'll have the chance to shape the future of AML systems and make a real, tangible impact on TikTok users.

    Responsibilities:
    Design, build, and maintain highly available, scalable, and fault-tolerant systems.
    Monitor and analyze system performance, identifying and resolving issues before causing user impact.
    Develop and maintain automated monitoring, alerting, and incident response systems.
    Collaborate closely with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind.
    Implement and maintain security best practices and ensure compliance with regulatory requirements.
    Participate in on-call rotations and respond to issues and incidents within and outside of normal business hours.
    Conduct root cause analysis of incidents, hold post-mortem reviews with stakeholders, and implement preventative measures to minimize the risk of similar incidents occurring in the future.

    Qualifications

    Basic:
    Expertise in analyzing and troubleshooting Linux-based distributed systems.
    Bachelor's/Master's degree in Computer Science, Computer Engineering, or equivalent years of experience in a SRE or software engineering role.
    Experience programming with at least one commonly used language (C, C++, Python, Go).
    Strong understanding of data structures and algorithms.
    Competent knowledge of relational database systems.

    Preferred:
    Ability to design and maintain large-scale systems.
    Strong understanding of code optimization and routine task automation.
    Proficiency in at least one machine learning framework: TensorFlow, PyTorch, MXNet or PaddlePaddle

    TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

    This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.

    In the spirit of reconciliation, TikTok acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their Elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.

    #LI-DS4

    Why you should apply for a job to TikTok:

  • 4.5/5 in overall job satisfaction
  • 4.5/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.
  • We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.
  • Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.