Imagine almost anything โ€“ then create it. Generate detailed, playful, realistic, or whimsical images. Or anything in-between.

The Gemini Image model uses deep language understanding to capture the nuance of your prompts โ€” bridging the gap between what you say and what you envision.

Capabilities

Multimodal understanding

Upload images and share text instructions with Gemini to create complex and detailed images.

Conversational inputs

Use everyday language while creating images, and keep the conversation going to refine what the model generates.

Real-world knowledge

Generate images that follow real-world logic, thanks to Geminiโ€™s advanced reasoning capabilities.


Model family

Gemini Image models are natively multimodal, and respond effectively and efficiently to even the most detailed prompts.

Nano Banana Pro (Gemini 3 Pro Image)

Create and edit images with studio-quality levels of precision and control.


Try Nano Banana (Gemini Image)