Technical Details

Looking for AI?

Build Video + AI experiences with Video + AI overview, Live Captions, and Post-Call Transcription.

Before you start: see definitions in Glossary.


Architecture at a glance

Every Vonage Video app has a client, an app server, and runs inside a session in the Vonage cloud.

Client

  • Uses the client SDK (Web/iOS/Android/Desktop)
  • Obtains sessionId + token from your server
  • Connects to the session, publishes local A/V
  • Subscribes to remote streams, handles events

App Server

  • Uses a Server SDK / REST API
  • Creates sessions; generates tokens and roles
  • Implements business logic & webhooks/callbacks
  • Controls recording, broadcast, moderation, storage

Session

  • “Room” hosted in Vonage Video Cloud
  • Signaling, events, media routing
  • Publish/subscribe of streams
Vonage Video components

Developer map — where to find things

Client SDKs

Server SDKs

  • Server SDKs (overview) — create sessions, issue tokens/roles, manage archives, broadcasts, SIP, and callbacks.

Starter guides & samples

Tools


Authentication & request flow (conceptual)

  1. Credentials — Your backend holds the Vonage API Key and API Secret (never expose the Secret to clients).
  2. Create a session — Backend creates a sessionId using a Server SDK or REST.
  3. Issue a client token — Backend generates a short-lived token (optionally with a role/TTL).
  4. Client joins — Frontend fetches { apiKey, sessionId, token } from your endpoint and connects via the client SDK.

Feature overview — what you can build

Core real-time

Recording & delivery

Interconnect & moderation


Backend responsibilities

In production, your backend typically:

  • Creates sessions and issues tokens with roles,
  • Starts/stops archives and broadcasts; manages lifecycle and storage,
  • Handles webhooks/callbacks (archive completed, broadcast status),
  • Orchestrates moderation and enforces business rules,
  • Initiates SIP interconnect when needed,
  • Brokers speech/AI pipelines (e.g., via Audio Connector) and manages secrets/keys,
  • Audits usage and maps session data to your domain model.

Note: TokBox is now Vonage Video.