An audio processor for a video conference system receives an audio signal from content to be shared over a video conference and an audio signal from a network. The audio signal from the shared content and the audio signal from the network are mixed together for output to a speaker. The audio processor may also receive a local audio signal from a microphone. The local audio signal is mixed with the audio signal of the shared to content to generate an outbound signal.