Model Infrastructure Engineer Graduate (TikTok Recommendation Architecture) - 2026 Start (BS/MS)

TikTok

4.5

(6)

Singapore

Why you should apply for a job to TikTok:

4.5/5 in overall job satisfaction

4.5/5 in supportive management

100% say women are treated fairly and equally to men

100% would recommend this company to other women

100% say the CEO supports gender diversity

Ratings are based on anonymous reviews by Fairygodboss members.

Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.

We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.

Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.

#7531726920771717384

Position summary

ume.

Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to TikTok and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.

Responsibilities

Optimize model performance and memory efficiency on GPU-based systems.
Collaborate with research and infra teams to deploy high-throughput training and inference pipelines.
Develop tools and libraries to accelerate deep learning workloads at scale.
Analyze system performance (e.g., GPU profiling, kernel analysis, throughput tuning).

Qualifications

Minimum Qualifications:

Final year graduate with a a background in Computer Science, Electrical Engineering, or other related field.
Solid programming skills in C++/CUDA/Trition/Python.
Familiarity with GPU architecture and distributed training is highly desirable.

Preferred Qualifications:

Experience building production-grade training and inference systems for large-scale models.
Hands-on experience optimizing Large Language Models (LLMs), including memory efficiency, latency, and throughput improvements.
Knowledge of distributed training frameworks (e.g., NCCL, Horovod, DeepSpeed, FSDP) is a plus.
Familiarity with deep learning compiler frameworks such as TVM or LLVM, and understanding of their underlying principles.
Contributions to open-source projects or relevant research publications.

By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://careers.tiktok.com/legal/privacy

If you have any questions, please reach out to us at [email protected]