Google Whisk is a new way to create AI images using image prompts. Here’s how to try it

  • Google Whisk uses images as input instead of text-based directions
  • It is built on Google’s generative AI model Imagen 3
  • The experimental tool is free to try for users in the US

Google’s new AI tool makes it easier to create and remix your visual concepts. Instead of asking you to describe what you have in mind, Whisk lets you enter three image prompts: one for subject, one for scene, and one for style. Whisk takes care of the rest, making it a more intuitive way to experiment with different ideas.

While most of the best AI image generators require you to write a detailed prompt, Whisk takes care of that behind the scenes. When you place images into the web-based Whisk interface for inspiration, Google’s Gemini model automatically analyzes them and writes a detailed caption for each. These are then fed into the Imagen 3 model to create an appropriate image.