At Tavus, we're building the human layer of AI. Our mission is to make human-AI interaction as natural as face-to-face interaction, enabling the human touch where it has been previously unscalable. We achieve this through pioneering research in multi-modal AI models for human perception and understanding, combined with state-of-the-art human avatar rendering and communication models. Our models power everything from text-to-video AI avatars to real-time conversational video experiences across industries like healthcare, recruiting, sales, education, and more. By enabling AI to see, hear, and communicate with human-like authenticity, we're creating the foundation for the next generation of AI employees, assistants, and companions.
We're a Series A company backed by top investors, including Sequoia, Y Combinator, and Scale VC. Join us in driving the future of human-AI interaction.
At Tavus, we're building the human layer of AI. Our mission is to make human-AI interaction as natural as face-to-face interaction, enabling the human touch where it has been previously unscalable. We achieve this through pioneering research in multi-modal AI models for human perception and understanding, combined with state-of-the-art human avatar rendering and communication models. Our models power everything from text-to-video AI avatars to real-time conversational video experiences across industries like healthcare, recruiting, sales, education, and more. By enabling AI to see, hear, and communicate with human-like authenticity, we're creating the foundation for the next generation of AI employees, assistants, and companions.
We're a Series A company backed by top investors, including Sequoia, Y Combinator, and Scale VC. Join us in driving the future of human-AI interaction. Check it out for yourself 😎
We’re looking for a Perception Engineer to help advance the core visual understanding systems behind Tavus’ AI-generated video experiences. In this role, you’ll work on foundational models and systems that enable our avatars to "see" and interpret the world - from facial dynamics and motion tracking to scene understanding and multi-modal perception.
You’ll join a small, fast-moving applied ML team where experimentation is encouraged, and ownership is expected. We’re not just iterating - we’re inventing. If you’re excited about solving real-world computer vision problems and shipping production-ready models that power next-gen human-AI interaction, we want to talk.
_To learn more about our team culture, and benefits, check out _our hiring page!
Tavus is growing fast, and we’d like you to grow with us! Are you excited to get your hands dirty? Drop your resume and we’ll be in touch!
We are not looking for cultural fits, we are looking for culture creators. In fact, diversity is what drives our success – it’s at the core of how we hire, communicate, and work. We are inclusive to all and combine our diverse backgrounds, skill sets, and thinking to build the best experiences for our clients.
Python for our ML stack React/NodeJS for our customer platform
We're exploring new boundaries in voice synthesis, deep-tech and audio engineering.
Full-time
$102K–$118K
San Francisco, California
Other opportunities you might be interested in