# Webpage Audio — How Audio Works in Webpage Modes In webpage modes (`webpage-audio`, `webpage-av`, `webpage-av-screenshare`), the bot's audio doesn't go directly to the meeting. Instead, audio is routed to your webpage, which plays it through the browser — and FirstCall (meeting infrastructure) captures the browser's audio output into the meeting. This guide explains why this matters, how to handle it, and what happens during interruptions. ## Why Audio Routing Differs by Mode ### Audio Mode (simple) ``` AgentCall TTS/GetSun (collaborative voice intelligence) generates audio (24kHz) → Server resamples to 16kHz → Server sends audio.send to FirstCall (meeting infrastructure) → FirstCall (meeting infrastructure) plays it in the meeting ``` The server handles everything. Your agent doesn't touch audio. ### Webpage Modes (your page plays audio) ``` AgentCall TTS/GetSun (collaborative voice intelligence) generates audio (24kHz) → Server sends tts.webpage_audio event to your webpage → Your webpage decodes + plays via Web Audio API → FirstCall (meeting infrastructure) captures browser audio output → Meeting participants hear it ``` **Your webpage is responsible for playing the audio.** If your page doesn't play it, nobody hears anything. ### Why This Architecture? In webpage modes, FirstCall (meeting infrastructure) loads your webpage in the rendering environment. It captures: - **Video**: whatever renders on the page (canvas, HTML, video elements) - **Audio**: whatever the page plays (Web Audio API, `