LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Tavus›AI Researcher (Multimodal Audio/Video Generation)
Tavus

About Tavus

Transforming video marketing with AI cloning technology

🏢 Tech👥 21-100 employees📅 Founded 2020📍 SoMa, San Francisco, CA💰 $24.3m
B2BArtificial IntelligenceMarketingAugmented RealitySaaSVideo

Key Highlights

  • Raised $24.3 million in seed funding
  • Headquartered in SoMa, San Francisco, CA
  • Scalable AI video cloning platform for personalized content
  • Remote-friendly with teams across the globe

Tavus, headquartered in SoMa, San Francisco, CA, is an AI video cloning platform that enables businesses to create personalized videos for marketing, outreach, and recruiting. With $24.3 million in funding, Tavus leverages synthetic media technology to produce thousands of customizable videos quickl...

🎁 Benefits

Tavus offers unlimited paid time off, comprehensive medical, dental, and vision coverage with 100% of premiums covered, and a yearly stipend for learn...

🌟 Culture

Tavus fosters a progressive, open-minded meritocracy where debate and feedback are integral to the culture. The company emphasizes innovation in synth...

🌐 Website💼 LinkedInAll 15 jobs →
Tavus

AI Researcher (Multimodal Audio/Video Generation)

Tavus • San Francisco

Posted 2 months ago🏛️ On-SiteAi research engineer📍 San francisco
Apply Now →

Job Description

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role
We’re looking for an AI Researcher to join our core AI team and push forward the science of audio-visual avatar generation. If you thrive in high-speed startup environments, enjoy experimenting with generative models, and love seeing your research ship into production then you’ll feel right at home.

Your Mission 🚀

  • Research and develop audio-visual generation models for conversational agents (e.g. Neural Avatars, Talking-Heads).

  • Focus on models that are tightly coupled with conversation flow, ensuring verbal and non-verbal signals work seamlessly together.

  • Experiment with diffusion models (DDPMs, LDMs, etc.), long-video generation, and audio generation.

  • Collaborate with the Applied ML team to bring your research into real-world production.

  • Stay ahead of the latest advancements in multimodal generation — and help shape the next wave.

You’ll Be Great At This If You Have:

  • A PhD (or near completion) in a relevant field, or equivalent hands-on research experience.

  • Experience applying image/video generation models in practice.

  • Strong foundations in generative modeling and rapid prototyping.

  • Deep familiarity with diffusion models, including recent advances in efficiency.

  • Good understanding of video-language models and multimodal generation.

  • Proficiency in PyTorch and GPU-based inference.

Nice-to-Haves

  • Experience with long-video or audio generation.

  • Skills in 3D graphics, Gaussian splatting, or large-scale training setups.

  • Broader exposure to generative models and rendering.

  • Familiarity with software engineering best practices.

  • Publications in top-tier or respected venues (CVPR, NeurIPS, BMVC, ICASSP, etc.).

Location
Preferred: San Francisco (hybrid) or London (office opening soon). Remote within U.S. or Europe available for exceptional candidates.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Tavus.

Apply Now →Get Job Alerts