Real-Time Audio
Real-Time Audio is a set of actions that can be performed in a Workstation to interact with the real-time audio of the Workstation.
📄️ Text-to-Speech
Play voice audio into the virtual microphone via a text-to-speech model. You must provide the exact copy for the agent to speak. The audio is played in realtime via byte streaming to reduce latency.
📄️ Speech-to-Text
The Real-Time Speech-to-Text (RSTT) endpoint provides a live streaming URL to listen for