Gemini is a family of multimodal AI models developed at Google that have been trained jointly across image, audio, video, and text data.