Speech Scientist Intern

ZoomApplyPublished 1 days agoFirst seen 11 hours ago
Apply

What you can expect

We are looking for a Research Scientist Intern with a solid background in speech recognition, speech synthesis, and speech processing. On this team you will develop state-of-the-art speech understanding models on large-scale datasets for Zoom products, converting speech signals into human and LLM-readable text to enable Zoom's vision of conversation to task completion. This internship will also have you collaborating with cross-functional teams, including products and science engineering teams, to deliver high-impact projects.

About the team

Zoom AI Speech Team is developing speech technologies to improve Zoom's conversational AI experience. This includes Zoom AI Companion, Zoom Meetings, Zoom Contact Center, Zoom Phone, Zoom Revenue Accelerator. As a Research Scientist Intern, you will develop novel solutions in automatic speech recognition (ASR), text-to-speech (TTS), voice agents, speech-to-speech translation, and speech LLMs to deliver a unique AI-powered collaboration platform that converts conversations into actionable tasks for users across the globe.

Responsibilities

  • Developing state-of-the-art speech understanding models on large-scale datasets for Zoom products, including ASR, TTS, voice agents, speech-to-speech translation, and speech LLMs.
  • Devising novel techniques where off-the-shelf solutions are not available.
  • Demonstrating technical judgment in model prototyping, training, optimization, and evaluation.
  • Collaborating with cross-functional teams, including products and science engineering teams, to deliver high-impact projects.
  • Contributing to research publications and technical presentations.

What we're looking for

  • Currently pursuing a PhD in Computer Science, Electrical Engineering or related fields.
  • Display knowledge in deep learning and hands-on programming skills in Python, shell scripts; have familiarity with ML frameworks such as PyTorch and TensorFlow.
  • Demonstrate experience in speech recognition, speech synthesis, speech processing, natural language processing or related fields in academic research.
  • Have domain expertise in one or more of the following areas: modern end-to-end ASR architectures, TTS and voice cloning, voice agents and conversational AI, speech-to-speech translation, speech LLMs, language modeling, decoding algorithms, personalization and adaptation, semi-/self-supervised learning, multilingual and robust systems, LLM-integrative speech models.
  • Have experience with speech toolkits and libraries such as Kaldi/k2, ESPNet, NeMo, TorchAudio, SpeechBrain or similar frameworks is a plus.
  • Have experience with large scale data processing and model training.
  • Demonstrate strong collaboration and communication skills.

Salary Range or On Target Earnings:

Minimum:

$66.50

Maximum:

$106.50

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure;  there may be a different range for candidates in this and other locations.

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits
As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us
Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.
We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Find room to grow with opportunities to stretch your skills and advance your career in a collaborative, growth-focused environment.


Our Commitment​

At Zoom, we believe great work happens when people feel supported and empowered. We’re committed to fair hiring practices that ensure every candidate is evaluated based on skills, experience, and potential. If you require an accommodation during the hiring process, let us know—we’re here to support you at every step.

We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records and any qualified applicants requiring reasonable accommodations in accordance with the law.

If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.

Think of this opportunity as a marathon, not a sprint! We're building a strong team at Zoom, and we're looking for talented individuals to join us for the long haul. No need to rush your application – take your time to ensure it's a good fit for your career goals. We continuously review applications, so submit yours whenever you're ready to take the next step.