Skip to Help Content
AI Support Available

How can we help you today?

Search our knowledge base or for instant assistance.

What is ADAI?

ADAI automatically generates professional audio descriptions for video — narrating actions, characters, and on-screen text so people who are blind or have low vision can fully experience your content. Upload a video, and AI does the rest.

AI-Powered

Multi-agent scene analysis

18+ Languages

Native multilingual output

WCAG 2.1 AA

ADA & Section 508 compliant

Broadcast-Ready

BWF/WAV audio export

Multiple Voice Tiers

Premium, Generative, Basic

Live Voice

Real-time visual AI assistant

Team Collaboration

Organizations & shared credits

ADAI Platform Details

ADAI (Audio Description AI) is an AI-powered platform that automatically generates professional audio descriptions for video content. Audio descriptions narrate the visual elements of a video — actions, settings, characters, facial expressions, on-screen text, and scene transitions — so that people who are blind or have low vision can fully experience the content independently.

ADAI is the first platform to fully automate the audio description workflow in a single agentic pipeline. You upload a video, and ADAI analyzes it frame by frame using multimodal AI (powered by Google Vertex AI and Gemini), detects scenes, extracts dialogue transcripts, reads on-screen text via OCR, identifies characters and actions, generates context-aware narration scripts, and synthesizes natural speech using Google Cloud Text-to-Speech — all without manual scripting, voice talent, or studio time.

The platform supports 18+ languages including English, Spanish, French, German, Portuguese, Italian, Japanese, Korean, Mandarin Chinese, Hindi, Arabic, Dutch, Polish, Swedish, Turkish, Vietnamese, Thai, and Indonesian. Multiple AI voice options are available across three quality tiers: Premium (highest fidelity, natural intonation), Generative (AI-styled with customizable style prompts), and Basic (clear and reliable for drafts).

Three processing modes let you match the output to your content type: Standard fits descriptions into natural pauses in dialogue, Extended pauses the video when needed for complete coverage, and Hybrid (recommended) intelligently combines both — only pausing when strictly necessary. After processing, a built-in scene editor lets you fine-tune individual descriptions, adjust timing, create snapshots, and collaborate with your team through threaded comments.

ADAI helps organizations meet accessibility requirements including WCAG 2.1 AA (Guideline 1.2.5 — Audio Description for Prerecorded Video), ADA (Americans with Disabilities Act), Section 508 (U.S. Rehabilitation Act), and FCC CVAA (21st Century Communications and Video Accessibility Act). All AI-generated content includes latent disclosure metadata as required by the California AI Transparency Act (SB 942).

Output can be exported as a complete video with embedded audio descriptions, a separate Broadcast Wave Format (BWF) audio track for professional post-production workflows, or streamed directly from ADAI's accessible video player with HLS and DASH adaptive streaming. Videos can be shared via unique links, embedded on external websites using the ADAI player or OEmbed, or published to the community showcase.

Teams can collaborate through organizations with shared credit pools, role-based access (Owner, Admin, Member), shared pronunciation lexicons, and team video sharing. A credit-based billing model charges 7 credits per minute of input video with no subscription required — credits never expire. New users receive 50 free credits (about 7 minutes of video) with no credit card required.

Additional capabilities include Live Voice, a real-time visual AI assistant powered by Gemini that describes what your camera sees and answers questions via voice conversation; a pronunciation lexicon with AI-suggested pronunciations for brand names and technical terms; a scene review system with a 7-dimension quality rubric; and a comprehensive support system including AI voice assistant, support tickets, and direct contact.

Quick Start Guide

Go from upload to accessible video in four steps

1

Create Your Account

Sign up for free at adai.tv/register. You receive 50 credits instantly — no credit card required. Set your username, password, and optionally configure your vision profile for a tailored experience.

Sign Up Free
2

Upload Your Video

Drag and drop your MP4, MOV, AVI, or MKV file (up to 2 GB / 45 min). Choose a processing mode, select a voice and language from 18+ options, and attach any custom pronunciations from your lexicon.

Go to Upload
3

AI Processes Your Video

ADAI's multi-agent AI pipeline analyzes your video frame by frame — detecting scenes, extracting dialogue, reading on-screen text, identifying characters, and generating natural audio descriptions. Track progress in real time from your dashboard.

View Dashboard
4

Review, Edit & Export

Watch the result in the built-in accessible player, or open the scene editor to fine-tune descriptions. Export the final video, download the separate BWF audio track, generate share links, or embed the player on your website.

View Videos