A 3D video conferencing station includes a video camera for capturing video signals and a depth map calculator for creating a depth map of a user of the 3D video conferencing station. The video signals together with the depth map are transmitted as 3D video data. A stereoscopic display device displays stereo images, which are calculated on the basis of received 3D video data. The depth map which is generated by the depth map calculator is also used for estimating the position of the user in order to control the calculation of the stereo images.