Tech Lead, Intelligent Editing (Multimodality)

TikTok
Seattle, Washington

About the job

The Intelligent Creation Team is the AI, special effects, and audio-video creation technology team, responsible for the core technology and business development. It covers a variety of technical fields, including deep learning, computer vision, graphics, speech, recording and editing, special effects, client and server engineering, and provides cutting-edge content understanding, content creation, interactive experience, and consumption capabilities and industry solutions to other business lines within the company and external partners in various forms.

Responsibilities

Conduct cutting-edge research and development in computer vision and machine learning, especially in the areas of multi-modal understanding, vision and language, large-scale training, etc.

Transfer advanced technologies to the company's products;

Explore new products with artificial intelligence technology at its core.

Qualifications

Minimum

Masters or PhD in computer science, mathematics, engineering engineering with at least 5 years of research and practical experience in one or more areas of computer vision, including but not limited to:

- Experience in multimodal understanding, such as video highlight detection and slicing, audio/music understanding, etc.

- Experience in vision and language, such as image/video captioning, retrieval, VQA, and other related fields.

- Experience with language models and apply them in various downstream tasks, especially for intelligent editing.

- Experience in large-scale training and RLHF.

Experienced in implementing and optimizing complex and performance-critical systems.

Strong analytical and problem solving skills.

Preferred

Experience in managing or tech-leading a team in a fast-paced environment with record of shipping technologies to products.

Preferring candidates with publications in venues such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML or ACL, EMNLP, COLING, etc.