Gen AI Audio Researcher

Department Icon Data Science Analytics & Machine Learning
149+ Applicants
Posted: 2 weeks ago
0-1 years
Bengaluru / Bangalore, Karnataka
work from office

Posted: 2 weeks ago
|
Applicants: 149+
Job Description
About Company
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. Youll research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines.
Key Responsibilities
  • Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech).
  • Build and fine-tune models using frameworks like PyTorch and HuggingFace.
  • Design training pipelines and datasets for scalable voice model training.
  • Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation.
  • Work closely with product and creative teams to ensure models meet quality and production constraints.
  • Stay on top of academic and industrial trends in speech synthesis and related fields.
Must Haves
  • Strong background in machine learning and deep learning, with focus on speech/audio.
  • Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
  • Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
  • Experience training models at scale and working with large audio datasets.
  • Familiarity with vocoders and transformer-based architectures.

    Looking to get Placed? Try our Placement Guarantee Plan

  • Strong problem-solving skills, ability to work autonomously in a remote-first environment.
Nice to Have
  • PhD degree in Computer Science/ Machine Learning and publications in top venues.
  • Contributions to open-source speech research or participation in relevant benchmarks.
  • Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control.
  • Experience with voice datasets or proprietary pipelines.

Skills

PythonDeep LearningMachine LearningAi

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

DNEG is a global visual effects company that provides services to the film and television industry. They are known for their work on major blockbuster films and television shows.

Important dates & deadlines?

Application Deadline

30 Mar 26, 02:55 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Gen AI Audio Researcher

Share with