Tavus capabilities
12 mapped capabilities, each graded and dated. The map shows what Tavus can do; the audit shows whether it’s worth consolidating — and a guide shows how to move.
Capabilities
CVI embedding — React component library, iframe, and Daily SDK
provisionalverified yesterdayFront-end options for dropping a live conversational video agent into a web app.
Conversational Video Interface (CVI) — real-time AI video agents
provisionalverified yesterdayAn API-first framework for building real-time, face-to-face conversational video agents: a lifelike digital human that sees the user, listens, and responds in a live two-way video call.
Cross-session memory
provisionalverified yesterdayLets an AI persona remember context about a user across separate conversations and devices, so interactions feel continuous and personal.
Knowledge Base (RAG grounding)
provisionalverified yesterdayGround a conversational persona in your own documents so the AI human answers from proprietary, domain-specific content via retrieval-augmented generation.
Phoenix real-time human rendering
provisionalverified yesterdayThe rendering model that synthesizes a photorealistic digital human face in real time, including lip-sync, micro-expressions, and natural listening behavior.
Plans and pricing
provisionalverified yesterdayTavus's developer (API) pricing tiers for conversational video and video generation, plus the consumer PALs tiers.
Pluggable LLM layer and tool / function calling
provisionalverified yesterdayChoose the language model that drives a conversational persona — Tavus-hosted models or your own LLM — and let it call external tools/functions mid-conversation.
Raven multimodal perception (emotion + ambient awareness)
provisionalverified yesterdayThe perception layer that lets the AI human see and interpret the user — facial expressions, gaze, emotion, posture, surroundings, and shared screen — and feeds that context to the LLM in real time.
Replica creation — custom AI digital twin and stock replicas
provisionalverified yesterdayCreate a photorealistic AI avatar (replica) of a real person that can speak, listen, and respond, or use a ready-made stock replica.
Sparrow conversational turn-taking and interruption handling
provisionalverified yesterdayThe dialogue-timing model that manages natural conversation flow — when the agent should speak, pause, or yield when the user interrupts — so the back-and-forth feels human.
Transparent / green-screen and custom backgrounds
provisionalverified yesterdayRender a replica with a transparent or green-screen background, or composite it over a website or supplied video, so the AI human can be overlaid on slides, demos, or branded scenes.