, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that TikTok Shop's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability.
Responsibilities:
- Provide component stability solutions tailored to real-world business scenarios for TikTok Shop, based on collaboration mechanisms across teams, time zones, and regions.
- Continuously build component metadata and observability capabilities, and improve multi-dimensional observability solutions.
- Develop platform-based, data visualisation, and automated monitoring processes to enhance the efficiency of component operations and maintenance for TikTok Shop platforms.
- Gain deep understanding of e-commerce business to enable risk awareness and governance of components.
- Continuously follow up on the management and optimisation of components in international e-commerce.
Qualifications
Minimum Qualifications:
- Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline.
- Candidate should have at least 5 years of experience in one or more programming languages (such as Java, C++, Go), or scripting experience with Shell/Python.
- Familiar with component O&M (Operations and Maintenance) processes, and knowledgeable about trends in foundational component technologies.
- Familiar with the architecture of storage and computing components such as MySQL, Redis, MQ, RocksDB, MongoDB, Kubernetes (K8s), Docker, and service mesh technologies.
- Expertise in operating, deploying, and ensuring the high availability and quality assurance of large-scale distributed systems, with a strong focus on stability and performance.
- Strong sense of responsibility, proactive team spirit, and excellent analytical and problem-solving skills.
Preferred Qualifications
- 3+ years of experience in component O&M or platform development.
- Experience in e-commerce and cloud computing.