Disclosed is a video acquisition method. The method includes acquiring at least two existing video segments selected by a user through a video selection interface, where the video selection interface is an interface which is switched from a video capture interface or a detail interface; and synthesizing the at least two existing video segments into a target video that has a duration less than or equal to a preset video duration based on the preset video duration. Further disclosed are a video acquisition device, a terminal and a storage medium.