Create Your Account
Sign up for free at adai.tv/register. You receive 50 credits instantly — no credit card required. Set your username, password, and optionally configure your vision profile for a tailored experience.
Search our knowledge base or for instant assistance.
ADAI automatically generates professional audio descriptions for video — narrating actions, characters, and on-screen text so people who are blind or have low vision can fully experience your content. Upload a video, and AI does the rest.
AI-Powered
Multi-agent scene analysis
18+ Languages
Native multilingual output
WCAG 2.1 AA
ADA & Section 508 compliant
Broadcast-Ready
BWF/WAV audio export
Multiple Voice Tiers
Premium, Generative, Basic
Live Voice
Real-time visual AI assistant
Team Collaboration
Organizations & shared credits
ADAI (Audio Description AI) is an AI-powered platform that automatically generates professional audio descriptions for video content. Audio descriptions narrate the visual elements of a video — actions, settings, characters, facial expressions, on-screen text, and scene transitions — so that people who are blind or have low vision can fully experience the content independently.
ADAI is the first platform to fully automate the audio description workflow in a single agentic pipeline. You upload a video, and ADAI analyzes it frame by frame using multimodal AI (powered by Google Vertex AI and Gemini), detects scenes, extracts dialogue transcripts, reads on-screen text via OCR, identifies characters and actions, generates context-aware narration scripts, and synthesizes natural speech using Google Cloud Text-to-Speech — all without manual scripting, voice talent, or studio time.
The platform supports 18+ languages including English, Spanish, French, German, Portuguese, Italian, Japanese, Korean, Mandarin Chinese, Hindi, Arabic, Dutch, Polish, Swedish, Turkish, Vietnamese, Thai, and Indonesian. Multiple AI voice options are available across three quality tiers: Premium (highest fidelity, natural intonation), Generative (AI-styled with customizable style prompts), and Basic (clear and reliable for drafts).
Three processing modes let you match the output to your content type: Standard fits descriptions into natural pauses in dialogue, Extended pauses the video when needed for complete coverage, and Hybrid (recommended) intelligently combines both — only pausing when strictly necessary. After processing, a built-in scene editor lets you fine-tune individual descriptions, adjust timing, create snapshots, and collaborate with your team through threaded comments.
ADAI helps organizations meet accessibility requirements including WCAG 2.1 AA (Guideline 1.2.5 — Audio Description for Prerecorded Video), ADA (Americans with Disabilities Act), Section 508 (U.S. Rehabilitation Act), and FCC CVAA (21st Century Communications and Video Accessibility Act). All AI-generated content includes latent disclosure metadata as required by the California AI Transparency Act (SB 942).
Output can be exported as a complete video with embedded audio descriptions, a separate Broadcast Wave Format (BWF) audio track for professional post-production workflows, or streamed directly from ADAI's accessible video player with HLS and DASH adaptive streaming. Videos can be shared via unique links, embedded on external websites using the ADAI player or OEmbed, or published to the community showcase.
Teams can collaborate through organizations with shared credit pools, role-based access (Owner, Admin, Member), shared pronunciation lexicons, and team video sharing. A credit-based billing model charges 7 credits per minute of input video with no subscription required — credits never expire. New users receive 50 free credits (about 7 minutes of video) with no credit card required.
Additional capabilities include Live Voice, a real-time visual AI assistant powered by Gemini that describes what your camera sees and answers questions via voice conversation; a pronunciation lexicon with AI-suggested pronunciations for brand names and technical terms; a scene review system with a 7-dimension quality rubric; and a comprehensive support system including AI voice assistant, support tickets, and direct contact.
Go from upload to accessible video in four steps
Sign up for free at adai.tv/register. You receive 50 credits instantly — no credit card required. Set your username, password, and optionally configure your vision profile for a tailored experience.
Drag and drop your MP4, MOV, AVI, or MKV file (up to 2 GB / 45 min). Choose a processing mode, select a voice and language from 18+ options, and attach any custom pronunciations from your lexicon.
ADAI's multi-agent AI pipeline analyzes your video frame by frame — detecting scenes, extracting dialogue, reading on-screen text, identifying characters, and generating natural audio descriptions. Track progress in real time from your dashboard.
Watch the result in the built-in accessible player, or open the scene editor to fine-tune descriptions. Export the final video, download the separate BWF audio track, generate share links, or embed the player on your website.