ChatGPT image-generation feature gets an upgrade
In a recent livestream, OpenAI CEO Sam Altman introduced a significant update to ChatGPT’s image-generation capabilities. The announcement marks the first major change in over a year, signaling a shift in how the platform handles image creation and editing.
Table of Contents
ChatGPT image-generation feature gets an upgrade
ChatGPT now leverages the power of OpenAI’s GPT-4o model to generate and modify images natively. Previously, GPT-4o’s role was limited to processing text; now, it can also create detailed images, giving users a seamless experience that integrates both text and visual content.
The new functionality first went live for subscribers on the $200-a-month Pro plan, accessible via ChatGPT and Sora, OpenAI’s AI-powered video tool. OpenAI plans to extend this upgrade to Plus and free users as well as developers using their API service in the near future.
Read also:
How the ATProto community is rebuilding the web
Improved Accuracy and Detailed Image Edits
Unlike its predecessor which was solely text-focused, GPT-4o’s image output “thinks” a bit longer to produce more refined and accurate images. Users can now enjoy advanced features like editing existing images, transforming content, or even “inpainting” details such as foreground objects and background elements.
According to OpenAI, the enhanced image-generation capability is driven by publicly available data alongside exclusive content provided by partners, including renowned stock image platforms. This combination ensures that the new model delivers both creativity and precision.
Discover Truescho AI
Looking to elevate your AI experience further? Check out Truescho AI for innovative tools that blend creativity and technology seamlessly. Explore their offerings and see how they can transform your digital workflow.
Respecting Creative Integrity and Artist Rights
OpenAI emphasizes its commitment to protecting the rights of artists. The company has put strict policies in place to ensure that the generated images do not directly mimic any living artist’s work. Creators also have an opt-out option, allowing them to request their works be removed from training datasets if they prefer.
The company also honors requests to restrict its web-scraping bots from gathering training data, ensuring ethical use of creative content in its image generation process.
Looking at the Competitive Landscape
This upgrade for ChatGPT comes shortly after Google introduced its own experimental native image output for Gemini 2.0 Flash. While Google’s feature quickly went viral, it also attracted criticism over concerns like the removal of watermarks and images depicting copyrighted characters.
OpenAI’s latest feature is positioned as a refined alternative, backed by careful data curation and a focus on detailed image outputs. As highlighted in a recent statement to the Wall Street Journal, the improvements are a result of the model’s extensive training on a diverse range of data sources.
Conclusion
The introduction of native image-generation in ChatGPT, powered by the advanced GPT-4o model, represents a major leap for the platform. With enhanced capabilities for creating and editing images, ChatGPT is set to provide users a richer, more versatile AI experience. As the feature gains wider availability, both casual and professional users can expect a new level of detail and accuracy in AI-generated visuals.
Read also:
Cerebras Systems IPO is further delayed