Powered by Gemini 3.1 Flash with multi-image input, Google Search grounding, and up to 4K resolution.
Nano Banana 2 is built on Google's Gemini 3.1 Flash architecture, bringing multimodal understanding to image generation. It accepts up to 14 reference images alongside a text prompt, enabling powerful composition, style matching, and character consistency across generated outputs.
A standout feature is Google Search grounding, which allows the model to incorporate real-time information from the web into its generations. This is especially useful for creating images of current events, trending styles, or specific real-world locations and products.
The model supports output resolutions up to 4K and offers a wide selection of aspect ratios including ultrawide 21:9. Combined with strong text rendering inherited from the Gemini architecture and flexible output format options, Nano Banana 2 is one of the most versatile image generation models available on the platform.