Generation and usage of combined affine merge candidate are described. In a representative aspect, a method of video processing includes: generating, during a conversion between a current block of video and a bitstream of the video, an updated merge candidate list by adding at least one combined merge candidate to a first merge candidate list; and performing the conversion by using the updated merge candidate list, wherein the first merge candidate list includes one or more sub-block prediction based merge candidates.