The Times Australia
The Times World News

.
The Times Real Estate

.

AI art is everywhere right now. Even experts don't know what it will mean

  • Written by Rodolfo Ocampo, PhD student, Human–AI Creative Collaboration, UNSW Sydney
AI art is everywhere right now. Even experts don't know what it will mean

An art prize at the Colorado State Fair was awarded[1] last month to a work that – unbeknown to the judges – was generated by an artificial intelligence (AI) system.

Social media have also seen an explosion of weird images generated by AI from text descriptions, such as “the face of a shiba inu blended into the side of a loaf of bread on a kitchen bench, digital art”.

Or perhaps “A sea otter in the style of ‘Girl with a Pearl Earring’ by Johannes Vermeer”:

‘A sea otter in the style of ‘Girl with a Pearl Earring’ by Johannes Vermeer.’ OpenAI[2]

You may be wondering what’s going on here. As somebody who researches creative collaborations between humans and AI, I can tell you that behind the headlines and memes a fundamental revolution is under way – with profound social, artistic, economic and technological implications.

How we got here

You could say this revolution began in June 2020, when a company called OpenAI achieved a big breakthrough in AI with the creation of GPT-3[3], a system that can process and generate language in much more complex ways than earlier efforts. You can have conversations with it about any topic, ask it to write a research article or a story, summarise text, write a joke, and do almost any imaginable language task.

Read more: Robots are creating images and telling jokes. 5 things to know about foundation models and the next generation of AI[4]

In 2021, some of GPT-3’s developers turned their hand to images. They trained a model on billions of pairs of images and text descriptions, then used it to generate new images from new descriptions. They called this system DALL-E, and in July 2022 they released a much-improved new version, DALL-E 2[5].

Like GPT-3, DALL-E 2 was a major breakthrough. It can generate highly detailed images from free-form text inputs, including information about style and other abstract concepts.

For example, here I asked it to illustrate the phrase “Mind in Bloom” combining the styles of Salvador Dalí, Henri Matisse and Brett Whiteley.

An image generated by DALL-E from the prompt “Mind in Bloom’ combining the styles of Salvador Dali, Henri Matisse and Brett Whiteley’. Rodolfo Ocampo / DALL-E

Competitors enter the scene

Since the launch of DALL-E 2, a few competitors have emerged. One is the free-to-use but lower-quality DALL-E Mini (developed independently and now renamed Craiyon[6]), which was a popular source of meme content.

Images generated by Craiyon from the prompt ‘Darth Vader riding a tricycle outside on a sunny day’. Craiyon[7]

Around the same time, a smaller company called Midjourney[8] released a model that more closely matched DALL-E 2’s capabilities. Though still a little less capable than DALL-E 2, Midjourney has lent itself to interesting artistic explorations. It was with Midjourney that Jason Allen generated the artwork that won the Colorado State Art Fair competition.

Google too has a text-to-image model, called Imagen[9], which supposedly produces much better results than DALL-E and others. However, Imagen has not yet been released for wider use so it is difficult to evaluate Google’s claims.

Images generated by the Imagen text-to-image model, together with the text that produced them. Google / Imagen[10]

In July 2022, OpenAI began to capitalise on the interest in DALL-E, announcing[11] that 1 million users would be given access on a pay-to-use basis.

However, in August 2022 a new contender arrived: Stable Diffusion[12].

Stable Diffusion not only rivals DALL-E 2 in its capabilities, but more importantly it is open source. Anyone can use, adapt and tweak the code as they like.

Already, in the weeks since Stable Diffusion’s release, people have been pushing the code to the limits of what it can do.

To take one example: people quickly realised that, because a video is a sequence of images, they could tweak Stable Diffusion’s code to generate video from text.

Another fascinating tool built with Stable Diffusion’s code is Diffuse the Rest[13], which lets you draw a simple sketch, provide a text prompt, and generate an image from it. In the video below, I generated a detailed photo of a flower from a very rough sketch.

In a more complicated example below, I am starting to build software that lets you draw with your body, then use Stable Diffusion to turn it into a painting or photo.

The end of creativity?

What does it mean that you can generate any sort of visual content, image or video, with a few lines of text and a click of a button? What about when you can generate a movie script with GPT-3 and a movie animation with DALL-E 2?

And looking further forward, what will it mean when social media algorithms not only curate content for your feed, but generate it? What about when this trend meets the metaverse in a few years, and virtual reality worlds are generated in real time, just for you?

These are all important questions to consider.

Some speculate[14] that, in the short term, this means human creativity and art are deeply threatened.

Perhaps in a world where anyone can generate any images, graphic designers as we know them today will be redundant. However, history shows human creativity finds a way. The electronic synthesiser did not kill music, and photography did not kill painting. Instead, they catalysed new art forms.

I believe something similar will happen with AI generation. People are experimenting with including models like Stable Diffusion as a part of their creative process.

Or using DALL-E 2 to generate fashion-design prototypes:

A new type of artist is even emerging in what some call “promptology”, or “prompt engineering[15]”. The art is not in crafting pixels by hand, but in crafting the words that prompt the computer to generate the image: a kind of AI whispering.

Collaborating with AI

The impacts of AI technologies will be multidimensional: we cannot reduce them to good or bad on a single axis.

New artforms will arise, as will new avenues for creative expression. However, I believe there are risks as well.

Read more: So this is how it feels when the robots come for your job: what GitHub's Copilot 'AI assistant' means for coders[16]

We live in an attention economy that thrives on extracting screen time from users; in an economy where automation drives corporate profit but not necessarily higher wages, and where art is commodified as content; in a social context where it is increasingly hard to distinguish real from fake; in sociotechnical structures that too easily encode biases in the AI models we train. In these circumstances, AI can easily do harm.

How can we steer these new AI technologies in a direction that benefits people? I believe one way to do this is to design AI[17] that collaborates with, rather than replaces, humans.

References

  1. ^ awarded (arstechnica.com)
  2. ^ OpenAI (twitter.com)
  3. ^ GPT-3 (arxiv.org)
  4. ^ Robots are creating images and telling jokes. 5 things to know about foundation models and the next generation of AI (theconversation.com)
  5. ^ DALL-E 2 (arxiv.org)
  6. ^ Craiyon (www.craiyon.com)
  7. ^ Craiyon (www.craiyon.com)
  8. ^ Midjourney (www.midjourney.com)
  9. ^ Imagen (imagen.research.google)
  10. ^ Google / Imagen (imagen.research.google)
  11. ^ announcing (openai.com)
  12. ^ Stable Diffusion (stability.ai)
  13. ^ Diffuse the Rest (huggingface.co)
  14. ^ Some speculate (twitter.com)
  15. ^ prompt engineering (en.wikipedia.org)
  16. ^ So this is how it feels when the robots come for your job: what GitHub's Copilot 'AI assistant' means for coders (theconversation.com)
  17. ^ design AI (research.rodolfoocampo.com)

Read more https://theconversation.com/ai-art-is-everywhere-right-now-even-experts-dont-know-what-it-will-mean-189800

The Times Features

How to buy a coffee machine

For coffee lovers, having a home coffee machine can transform your daily routine, allowing you to enjoy café-quality drinks without leaving your kitchen. But with so many optio...

In the Digital Age, Online Promotion Isn't Just an Option for Small Businesses – It's a Necessity

The shift to an online-first consumer landscape means small businesses must embrace digital promotion to not only survive but thrive in 2025. From expanding reach to fostering cu...

Sorbet Balls by bubbleme Bring Bite-Sized Cool Spin to Frozen Snacking

A cool new frozen treat is rolling into the ice-cream aisle at Woolworths stores nationwide. Dairy-free, gluten-free and free from artificial colours, bubbleme Sorbet Balls ar...

Mind-Body Balance: The Holistic Approach of Personal Training in Moonee Ponds

Key Highlights Discover the benefits of a holistic approach to personal training in Moonee Ponds and nearby Maribyrnong, including residents from Strathmore. Learn how mind-b...

How Online Platforms Empower You to Find Affordable Removalists and Electricity Plans

When you move into a new home, you have many tasks to do. You need to hire removalists and set up your electricity.  In this article, we discuss how online platforms empower you ...

IS ROSEMARY OIL THE SECRET TO BETTER HAIR DAYS? HERE’S WHAT IT CAN DO

Rosemary hair oil is a straightforward natural solution that delivers exceptional results for anyone who wants to enhance their haircare process. It maintains its status in herba...

Times Magazine

CNC Machining Meets Stage Design - Black Swan State Theatre Company & Tommotek

When artistry meets precision engineering, incredible things happen. That’s exactly what unfolded when Tommotek worked alongside the Black Swan State Theatre Company on several of their innovative stage productions. With tight deadlines and intrica...

Uniden Baby Video Monitor Review

Uniden has released another award-winning product as part of their ‘Baby Watch’ series. The BW4501 Baby Monitor is an easy to use camera for keeping eyes and ears on your little one. The camera is easy to set up and can be mounted to the wall or a...

Top Benefits of Hiring Commercial Electricians for Your Business

When it comes to business success, there are no two ways about it: qualified professionals are critical. While many specialists are needed, commercial electricians are among the most important to have on hand. They are directly involved in upholdin...

The Essential Guide to Transforming Office Spaces for Maximum Efficiency

Why Office Fitouts MatterA well-designed office can make all the difference in productivity, employee satisfaction, and client impressions. Businesses of all sizes are investing in updated office spaces to create environments that foster collaborat...

The A/B Testing Revolution: How AI Optimized Landing Pages Without Human Input

A/B testing was always integral to the web-based marketing world. Was there a button that converted better? Marketing could pit one against the other and see which option worked better. This was always through human observation, and over time, as d...

Using Countdown Timers in Email: Do They Really Increase Conversions?

In a world that's always on, where marketers are attempting to entice a subscriber and get them to convert on the same screen with one email, the power of urgency is sometimes the essential element needed. One of the most popular ways to create urg...

LayBy Shopping