Google has introduced a brand new AI software referred to as Whisk that allows you to generate photographs utilizing different photographs as prompts as an alternative of requiring a protracted textual content immediate.
With Whisk, you’ll be able to provide photographs to recommend what you’d like as the topic, the scene, and the fashion of your AI-generated picture, and you’ll immediate Whisk with a number of photographs for every of these three issues. (In order for you, you’ll be able to fill in textual content prompts, too.) For those who don’t have photographs available, you’ll be able to click on a cube icon to have Google fill in some photographs for the prompts (although these photographs additionally seem like AI-generated). It’s also possible to enter some textual content right into a textual content field on the finish of the method if you wish to add further element concerning the picture you’re on the lookout for, but it surely’s not required.
Whisk will then generate photographs and a textual content immediate for every picture. You possibly can favourite or obtain the picture in the event you’re proud of the outcomes, or you’ll be able to refine a picture by coming into extra textual content into the textual content field or clicking the picture and enhancing the textual content immediate.
In a weblog put up, Google stresses that Whisk is designed to be for “speedy visible exploration, not pixel-perfect edits.” The corporate additionally says that Whisk could “miss the mark,” which is why it permits you to edit the underlying prompts.
Within the jiffy I’ve used the software whereas penning this story, it’s been entertaining to tinker with. Photos take just a few seconds to generate, which is annoying, and whereas the photographs have been a little bit unusual, all the things I’ve generated has been enjoyable to iterate on.
Google says Whisk makes use of the “newest” iteration of its Imagen 3 picture technology mannequin, which it introduced at present. Google additionally launched Veo 2, the following model of its video technology mannequin, which the corporate says has an understanding of “the distinctive language of cinematography” and hallucinates issues like further fingers “much less continuously” than different fashions (a type of different fashions might be OpenAI’s Sora). Veo 2 is coming first to Google’s VideoFX, which you will get on the Google Labs waitlist for, and it will likely be expanded to YouTube Shorts “different merchandise” someday subsequent 12 months.