A content editing apparatus (10) includes a CPU (12). When combining JPEG data and audio data forming a moving file 1 with JPEG data and audio data forming a moving file 2, the CPU (12) calculates a reproduction frequency of combined audio data based on a sampling frequency of the audio data in the movie file 1 and a sampling frequency of the audio data in the movie file 2. The calculated sampling frequency is a frequency that makes it possible to complete simultaneously the reproduction of the combined JPEG data and the combined audio data. This sampling frequency is written into the combined file.