Technical Details

Looking for AI?

Build Video + AI experiences with Video + AI overview, Live Captions, and Post-Call Transcription.

The Vonage Video API lets you embed fully managed real-time sessions in web and mobile apps. Client SDKs capture and distribute media while handling signalling and layout events, and Server SDKs or REST endpoints create sessions, create tokens, and manage archives, broadcasts, SIP interconnect, and callbacks.

Architecture at a glance

Every Vonage Video app has a client, an app server, and runs inside a session in the Vonage cloud.

Client

  • Uses the client SDK (Web/iOS/Android/Desktop)
  • Obtains sessionId + token from your server
  • Connects to the session, publishes local A/V
  • Subscribes to remote streams, handles events

App Server

  • Uses a Server SDK / REST API
  • Creates sessions; generates tokens and roles
  • Implements business logic & webhooks/callbacks
  • Controls recording, broadcast, moderation, storage

Session

  • “Room” hosted in Vonage Video Cloud
  • Signaling, events, media routing
  • Publish/subscribe of streams

Authentication and session flow

  1. Create your account and application: generate your Vonage Video Application ID and Private Key.
  2. Create a session: generate a sessionId with a Server SDK or REST.
  3. Issue a token: create a token with the appropriate role and TTL.
  4. Join a session: frontend fetches {apiKey, sessionId, token} from your endpoint and connects with the client SDK.

Server-side controls

Feature overview

Real-time collaboration

Recording & production

  • Archiving to capture, store, and retrieve full session recordings.
  • Experience Composer to render a branded web layout into a composite stream for archives, broadcasts, or large-scale viewing.
  • Audio Connector to stream raw audio to AI pipelines or bring synthesized audio back into the session.

AI Integrations

Distribution

Tools

Debugging

  • Inspector to trace participants, streams, and quality events.
  • Archive Inspector for archive timelines, expected durations, and error states.

Diagnostic tools

  • Pre-Call Test to verify camera, microphone, bandwidth, and firewall readiness before a session.

Insights & analytics

Explore capabilities

  • Video Playground to prototype sessions, captions, experience composer, and more without writing code.
Vonage Video components

Starter guides & samples

  • Explore the Video sample repos on GitHub — Web, iOS, and Android — for ready-to-run projects covering captions, SIP, multi-party layouts, and more.
  • Fork the Video Reference Apps (Web, Android, iOS) to get production-ready meeting UX, CI, and testing workflows.

Note: TokBox is now Vonage Video.