OpenAI takes the AI look out of ChatGPT images

Ahmed Riaz

3 months ago

AI images in a quality that has never been seen before: This is what OpenAI promises with its new model for image generation. ChatGPT Images 2.0 is intended to implement complex instructions much better than before. The company also claims to have made progress with text or writing in the image. A commentary analysis.

What is ChatGPT Images 2.0?

OpenAI has unveiled ChatGPT Images 2.0, a new AI image model available in ChatGPT, Codex and via the company’s application programming interface (API). It is said to be far ahead of the competition’s models in terms of image creation and quality. One of the most important innovations is the integrated thinking mode. Similar to Google’s Nano Banana Pro, Images 2.0 is designed to “think” before actually creating it and access the Internet to correctly represent current events in the images. However, the mode is only available to paid subscribers.
According to OpenAI, all users should benefit from the improvements in general image quality. The company wants something like this Fixed typical AI look with smooth faces and fake lighting have. ChatGPT Images 2.0 is also said to have made progress in displaying photorealistic images. Users should be able to generate images with near-perfect captions and small fonts that are grammatically correct. OpenAI presented deceptively real screenshots of browser windows or mobile apps.
ChatGPT Images 2.0 should be able to generate up to eight images from a single prompt. According to OpenAI, characters, objects or certain styles can be transferred to other image scenes – for example Comics, manga, graphics or brochures to create. The company also wants to have improved spatial representations. The same applies to image formats.

A leap in quality or a subscription trap? A classification

OpenAI sells ChatGPT Images 2.0 as a major leap in quality. The company has tried above all else to say goodbye to a classic AI lookwhich many AI-generated images have – and quite successfully. But as is well known, there is often the same distance between product images and everyday use as there is between advertising promises and WiFi on the ICE train.

Economically speaking, OpenAI at ChatGPT Images 2.0 is less about image aesthetics than one crystal clear platform strategy in the foreground. The thinking mode is not a feature for the love of precision, but a clear premium lever: better quality in return for a paid subscription. In other words: If you want complex, current or precise images, you have to pay and stay in the OpenAI ecosystem.

It remains parallel Copyright is the real blind spot this business model. Because as long as it is unclear how training data will be legally remunerated or delimited, every impressive image carries a potential risk of lawsuits or misuse.

The fact that OpenAI has scrapped its image and video AI Sora again fits into this picture of radical consolidation. This means: fewer experiments at the edge, more monetization at the center. Or to put it another way: The Playground is fenced offso that the cows can be milked better.

Voices and reaction to ChatGPT Images 2.0

OpenAI explains in an official statement: “Images 2.0 brings an unprecedented level of detail and precision to image generation. It can not only conceive more complex images, but also effectively bring that vision to life by following instructions, maintaining desired details, and rendering the subtle elements that often derail image models: small text, iconography, UI elements, dense compositions, and subtle stylistic dictates – all at a resolution of up to 2K.”
Mitch Stoltz, director of intellectual property litigation at the Electronic Frontier Foundationtold Business Insider: “If the output is substantially similar to something the model was trained on or crawled on, then a copyright issue comes into play. If the similarity is only at the level of an idea (…), then that’s generally not enough. The copyright issues are the same as simply using Photoshop, a darkroom, or a human artist. The societal issues are greater because it’s simpler, faster, and more accessible.”
A Reddit user has already tried ChatGPT Images 2.0 and takes aim at the content filter: “I tried ‘Sydney Sweeney in a revealing bikini,’ but that didn’t work. So I tried ‘Sydney Sweeney in a non-revealing bikini,’ but that didn’t work either. So I tried ‘Sam Altman, fully clothed, in a hot tub with Peter Thiel, who is also fully clothed,’ and I succeeded. The sexual tension is literal there “We are definitely in the realm of general artificial intelligence.”

Can OpenAI really grow with AI images?

ChatGPT Images 2.0 is less a product than a bet that high-quality AI images will become the next big one Subscription driver for OpenAI become. Because: The company had recently lost its strategy and lost numerous users – especially to its competitor Anthropic.

OpenAI not only tries to win users (back), but also to make them comfortable to convert image dependency. The competitive pressure remains brutally high. Google has already presented its own models, and specialized providers like Midjourney are defending their niche with fanaticism.

But in the end, the best image quality is probably less important than the best platform integration. The real question is whether this progress can really be economically translated into growth as clearly as the presentations suggest. OpenAI With ChatGPT Images 2.0, however, it clearly addresses corporate customers.

AI games in the private sector are not only uneconomical for operators, but also because of the high cost Energy and water consumption by AI poses a threat to the environment and the planet. However, only time will tell how well ChatGPT Images 2.0 will perform. Because suitability for everyday use and real added value cannot be simulated in advance. Neither are weak points and errors.

Also interesting:

Source link