Official documentation for Vurbo.ai API Service (VAS). Covering real-time speech recognition, translation, text-to-speech, summarization and broadcasting.

Feature overview

    Real-time voice translation
    WebSocket streaming speech recognition and translation, supporting single-speaker, multi-speaker conversation, and bilingual interpretation modes.
    Audio import
    Upload audio for offline processing, returning transcripts, translations and summaries.
    Broadcast
    A presenter streams over WebSocket while multiple viewers receive real-time captions over SSE.
    Task management
    Query, update and export Tasks and Recordings via the REST API.
Copyright © 2026