Content Understanding Multimodal Model Algorithm Engineer-Global E-commerce

TikTok

4.5

(6)

Singapore

Why you should apply for a job to TikTok:

4.5/5 in overall job satisfaction

4.5/5 in supportive management

100% say women are treated fairly and equally to men

100% would recommend this company to other women

100% say the CEO supports gender diversity

Ratings are based on anonymous reviews by Fairygodboss members.

Employee well-being is supported via hybrid work, short-term counseling through our EAP and a premium subscription to Headspace.

We embrace diversity across all dimensions and provide employees with 9 employee resource groups globally, including our WOMEN ERG.

Comprehensive parental leave policy as well as fertility treatment through healthcare providers with a $20,000 lifetime maximum.

#7508757771219929362

Position summary

ontinue to evolve, various domains are encountering new risks and adversarial content, which pose fresh challenges for the application of foundation models. For example, existing open-source foundation models underperform in E-commerce moderation tasks involving PBR changes, long text, long sequences, multilingual content, few-shot scenarios, and AIGC-generated adversarial content. Consequently, there is an urgent need to develop foundation models specifically tailored for intelligent e-commerce moderation to improve their effectiveness and adaptability in e-commerce governance. In particular, we must explore high-quality data auto-generation, efficient MOE Embedding, Auto-prompt generation, high-quality COT output, and foundation model knowledge distillation. The model should also achieve high-accuracy autonomous decision-making and interpretable COT generation, significantly reducing misjudgments. For dynamic PBR changes, it should automatically retrieve similar moderation cases via RAG modules, decompose complex PBRs into simple atomic tasks, split rejection and exemption tasks, and auto-invoke corresponding tools, establishing an industry-leading intelligent review system that knows to reject and why. Ultimately, the large language model-based intelligent moderation system should approach or exceed human moderators' accuracy and evolve toward fully automated review.

Project Content:
Research on e-commerce intelligent moderation multimodal large language models includes, but is not limited to:
Modality fusion: Enhance fine-grained understanding of text, audio, image, video, and live-streaming data to enable high-accuracy autonomous decision-making and interpretable COT generation.
Few-shot capabilities: Address e-commerce multilingual, long-sequence, and few-shot challenges, strengthen Few-Shot/Zero-Shot capabilities, and enable complex instruction and auto-prompt generation for dynamic business rules.
Adversarial defense: Study AIGC image/video discrimination to enhance the review model's ability to defend against vague and abstract generated content.
Agent capabilities: Enable RAG module invocation, tool usage, and Auto-planning; improve the model's dynamic reasoning and reflection abilities.

Involved Research Directions:
Large language models, multimodal large language models, Few-shot learning, AIGC decision-making, AIGC data generation, reinforcement learning, Agent

Qualifications

Got doctor degree, preferably with a background in artificial intelligence, computer science, or mathematics.
Possess solid programming skills, a strong foundation in data structures and algorithms, and proficiency in using various algorithmic and engineering frameworks.
Prior publications in international conferences or journals (including but not limited to ACL, EMNLP, NeurIPS, ICML, ICLR, CVPR) are preferred.
Strong foundation in machine learning, with in-depth understanding and research experience in deep learning, reinforcement learning, NLP, or multimodal learning.
Demonstrate good communication and collaboration skills, with the ability to work closely with the team to explore new technologies and drive technical innovation.