Rounding up a barrage of "GenAI for visuals" announcements

Visual 1st Perspectives

February 19, 2025

Since our last issue there's been a barrage of news and perspectives published around generative AI solutions for creating or editing photos and/or videos. As a number of these GenAI announcements are to some extent related, I’ll focus specifically on GenAI in this issue.

What’s a real photo? One that isn’t fake, right? Or is that a backwards approach? In any case, are tech standards enough?

Galaxy S25. Content Credentials icon. When launching its Galaxy S25 smartphones a few weeks ago, Samsung heralded that when using the phones’ AI-powered Generative Edit features, you’ll see the C2PA Content Credentials icon, which can then reveal the metadata that indicate whether an image contains AI-generated content. Samsung is the first smartphone manufacturer to adopt this standard.

Google Magic Editor. SynthID watermark. Soon after, Google Photos announced that photos that have been edited using Google’s Magic Editor “reimagine” GenAI features, will now carry Google’s SynthID watermark. This watermark was already applied to AI images created from scratch using Google’s Imagen text-to-image model.

[SynthID is an invisible watermark specifically aimed at identifying AI-generated images and other media types (audio, text, video).

Content Credentials is visibly displayed as an icon that links to C2PA metadata about the source and history of any images. I.e. it is also used for real (i.e. camera-captured) photos].

So what’s wrong with the Galaxy S25 approach? It’s a backward way of telling you whether an image is a real photo or not, according to this excellent PetaPixel article: Samsung’s Image Authenticity Approach is Backward and It’s AI’s Fault.

The backwards part? [Samsung is] “responding to the public’s demand to know when an image they’re seeing isn’t real. But while that is what the public is saying, it’s not what they actually want. What the public really wants to know is when a photo isn’t AI — when it’s real.”

Source: PetaPixel

And guess what? “It is remarkably difficult to prove a photo is fake or was edited in some way, but it’s very easy to prove a photo is real and unchanged.”

So the approach that makes most sense is to not stray away from C2PA’s roots: promote the use of image metadata to prove that a photo is real and unedited – any other images should then be viewed with a healthy dose of suspicion.

Needed to (re-)trust photos: a multi-layered infrastructure. But there’s more to adhering to an industry tech standard like C2PA to let us trust real photos or to adding invisible watermarks that identify GenAI images. Industry friend Paul Melcher champions establishing a multi-layered infrastructure aimed at the survival of light-based photography—and its value as a truthful record of the world around us.

He advocates intertwining tech initiatives (tamper-evident metadata, cryptographic seals, AI forensics), in addition to behavioral standards (ISO-style ethics, trust-level systems, peer oversight), and legislative frameworks (legal mandates, licensing, global agreements) to establish an ecosystem where:

Authentic photographs are immediately verifiable,
Ethical accountability is enforced, not just encouraged, and
Clear legal consequences exist for those who exploit fabricated imagery to deceive.

Read his whole analysis on Kaptur.

Google. Whisk’s motto: Prompt less, Play more. Whisk, Google’s GenAI image remixing tool is now available outside the US – in more than 100 countries that is. By now we all know that creating images the way you want them solely from text prompts is pretty darn hard. Google Whisk tries to make things easier by letting you provide separate (text or image) prompts for subject, scene, and style. Whisk then uses Google’s Imagen 3 tech to remix all these inputs into a single image. Not quite what you wanted? Type something like “change pants to pink-striped pajamas” and Whisk takes it from there.

Adobe. GenAI video: Sora, here we come. Get your wallet out. A few months after OpenAI launched Sora, its text-to-video generator, Adobe’s Generate Video is now in public beta, allowing anyone to generate five-second video clips at 1080p. Based on either text or image prompts, Generate Video also includes various options to refine or guide the results, such as simulating styles, camera angles, motion, and shooting distances.

Did I say “anyone”? I meant: anyone who’s willing to fork over between $10-$30 per month for this beta – “introductory” prices that will change to unspecified new prices on March 15. Alas, it's a beta of a quality that is so bad Adobe shouldn’t be charging money according to Fstoppers. Ouch.

ByteDance. Pushing the GenAI video quality. GenAI video not quite ready for prime time? While that might be the case for Adobe, how about for TikTok owner ByteDance? Researchers from this company have demoed a new AI system, OmniHuman-1, which can generate perhaps the most realistic deepfake videos to date, according to this TechCrunch report.

Descript. From text-based video editing to video generation. Past Visual 1st presenter Descript, best known for its text-based video editing features (it transforms audio and video into editable text, allowing you to edit your media as easily as editing a document) launches AI Video Maker, aimed at turning initial ideas into a fully drafted video by means of Chat GPT. How does it work? It all starts with a chat in which you tell the AI of what you’re thinking of. It then will generate a script and set the video generation in motion.

Synthesia. Selfie Avatars with "real" backgrounds. You’ve probably seen the GenAI apps that let you create selfie avatars, some even with movement and audio. Past Visual 1st presenter Synthesia goes one step further with its new Selfie Avatars tool. Upload 5-10 selfies, enter a text prompt that describes the setting (say, “Skydiving wearing sunglasses”, record a voice over, and Synthesia generates your avatar, letting your animated self speak in your own voice, and situate you in the setting of your choice that you’d described in your text prompt. Put your synthetic self in a super market, a coffee shop, in the jungle, or piloting a plane - you get the idea.

PiktID. Next generation of GenAI face editing. Former Visual 1st presenter PiktId announces Erase ID 3.0, the most advanced version of its AI tech for creating synthetic identities at scale by replacing faces in original photos with GenAI-created versions. The beauty of it: take one photo and make many variations featuring different "people". Or alter their facial expressions, ethnicity, age or hair styles.

And a few more things ...

Looking Glass. Camera store robbery. Berkeley-based photo retailer Looking Glass, whose owner Jen Waicukauski shared her perspectives as a forward-looking photo retailer at Visual 1st in the past, has suffered a violent break-in after robbers rammed a car through the shop front. The store suffered significant damage to the building and at least $50K worth of merchandise was stolen.

Help them to rebuild their store! Donations are appreciated: GoFundMe.

Photo Imaging Connect. It's time to register. I’ll have the pleasure of giving a keynote at Gary Pageau’s Photo Imaging Connect conference, March 16-17, 2025, held in conjunction with WPPI in Las Vegas.

Check out the complete program and buy your ticket now! $50-off Early Bird tickets are still available as of this writing. Conference tickets also includes a full WPPI conference pass, which provides access to the trade show, classes, and WPPI parties, March 18-20, 2025.

IPC. May 14 awards luncheon. The International Photographic Council (IPC) will host its annual IPC Hall of Fame and Professional Photographer Achievement Awards Luncheon on May 14, 2025. This special event will take place in the Delegates Dining Room at the United Nations and celebrate some of the world’s most recognizable photographers.

This year’s IPC Hall of Fame recipient is Kurt Nowak. Nowak was born and raised in Vienna, Austria and is the founder of Colenta Labortechnik and a visionary in photographic and imaging technology.

Visit IPCPhoto.org to learn more or to secure your reservation to the May luncheon. Contact Kathy Jello, IPC Vice President for more information on sponsorship opportunities for 2025 or to donate to the James L. Chung Scholarship program.

Save the date: Visual 1st 2025!

When: Tuesday Oct. 28 (PM) + Wednesday Oct. 29 (AM + PM)

Where: Public TV & Radio station KQED’s state-of-the art headquarters, San Francisco

Secure your spot: Buy your $150 off Super Early Bird discount ticket now!

Hope to see you all in San Francisco, Oct. 28-29!

Hans Hartman

Archive & Subscribe | Share your news with us | Connect on LinkedIn

Visual 1st | Suite 48 Analytics