Disclosed are a video processing method and apparatus, and a storage medium. The method includes: receiving a selection instruction of having selected one or more video streams or key frames of the one or more video streams to be browsed; setting video stream thumbnails generated from the one or more video streams or key frame thumbnails generated from the key frames to a scene thumbnail to generate a picture layout stream according to the selection instruction, where the scene thumbnail is generated according to a scene displayed in an augmented reality/virtual reality (AR/VR) interface; and presenting the picture layout stream in the VR/AR interface, and providing a virtual layout interface of multiple video stream pictures.