Embodiments of the present invention relate to audiovisual stream processing in videoconferences. For each audiovisual stream in a videoconference, a sound level of the audiovisual stream is detected. If the sound level exceeds a predefined threshold level, the audiovisual stream is processed with a first configuration. If the sound level is below the predefined threshold level, the audiovisual stream is processed with a second configuration. The second configuration is more resource-effective than the first configuration.