Research Scientist Voice AI - EMEA

MetaApplyPublished 27 days agoFirst seen 10 days ago
Apply

Description

The Voice AI team in EMEA, part of the Meta Superintelligence Labs, is looking for a Research Scientist (Speech and Language). The Voice AI team works on large language models (LLMs) with native supporting for processing, understanding and generating of audio and speech as a modality besides others such as text or vision. As part of this, we are leveraging knowledge in areas like speech/audio encoders/tokenizer, pre-training, post-training, (online) reinforcement learning, LLM alignment, multimodal modelling, speech and audio processing, speech recognition (ASR), speech synthesis (TTS), and multilingual modelling. Our work is focused on advancing core technologies to drive and advance core product experiences at Meta such as video dubbing on IG/FB or Meta AI which is available on e.g. RayBan Meta glasses or within WhatsApp.

Responsibilities

Apply relevant AI and machine learning techniques to build and advance audio and speech technologies using large language models that can be applied to a wide area of Meta production use cases Work towards long-term ambitious research and productization goals, while identifying intermediate milestones Work with large data, and contribute to development of large scale foundation models Influence progress of relevant research communities by producing publications

Qualifications

PhD degree in Artificial Intelligence (AI), computer science, related technical fields with 1+ years of experience, or BS degree with 3+ years of industrial research experience in the related field AI research experience in the domains of audio and speech processing First-author publications at peer-reviewed AI conferences (e.g. Interspeech, ICASSP, ASRU, SLT, NeurIPS, CVPR, ICML, ICLR, ICCV, ACL) Strong skills to communicating complex research for public audiences or peers Experience developing machine learning algorithms in e.g. Python, PyTorch, C/C++ Research experience in generative AI, especially in building and optimising large language models for areas of audio/ speech processing and understanding, computer vision and/or natural language understanding beyond black-box use Additional AI research experience in computer vision and/or NLP Previous internship(s) and/or research assistantship(s) in an AI research organization Industry experience working on Speech, Language, and LLM related topics and the experience to apply relevant AI and machine learning techniques to build intelligent rich speech & language systems for improving product experiences Interest in taking new research findings in this area and implementing them towards product needs