AI image generators appear to propagate gender and race stereotypes

By Fionna Agomuoh Published November 1, 2022

Experts have claimed that popular AI image generators such as Stable Diffusion are not so adept at picking up on gender and cultural biases when using machine learning algorithms to create art.

Many text-to-art generators allow you to input phrases and draft up a unique image on the other end. However, these generators can often be based on stereotypical biases, which can affect how machine learning models manufacture images Images can often be Westernized, or show favor to certain genders or races, depending on the types of phrases used, Gizmodo noted.

Recommended Videos

What's the difference between these two groups of people? Well, according to Stable Diffusion, the first group represents an 'ambitious CEO' and the second a 'supportive CEO'.
I made a simple tool to explore biases ingrained in this model: https://t.co/l4lqt7rTQj pic.twitter.com/xYKA8w3N8N

— Sasha Luccioni, PhD 🦋🌎✨🤗 (@SashaMTL) October 31, 2022

Sasha Luccioni, artificial intelligence researcher for Hugging Face, created a tool that demonstrates how the AI bias in text-to-art generators works in action. Using the Stable Diffusion Explorer as an example, inputting the phrase “ambitious CEO” garnered results for different types of men, while the phrase “supportive CEO” gave results that showed both men and women.

Similarly, the DALL-E 2 generator, which was created by the brand OpenAI has shown male-centric biases for the term “builder” and female-centric biases for the term “flight attendant” in image results, despite there being female builders and male flight attendants.

While many AI image generators appear to just take a few words and machine learning and out pops an image, there is a lot more that goes on in the background. Stable Diffusion, for example, uses the LAION image set, which hosts “billions of pictures, photos, and more scraped from the internet, including image-hosting and art sites,” Gizmodo noted.

Racial and cultural bias in online image searches has already been an ongoing topic long before the increasing popularity of AI image generators. Luccioni told the publication that systems, such as the LAION dataset ,are likely to home in on 90% of the images related to a prompt and use it for the image generator.

Topics

Computing Writer

Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…

Computing

I tried out Google’s latest AI tool that generates images in a fun, new way

Google's Whisk AI tool being used with images.

Google’s latest AI tool helps you automate image generation even further. The tool is called Whisk, and it's based on Google’s latest Imagen 3 image generation model. Rather than relying solely on text prompts, Whisk helps you create your desired images using other images as the base prompt.

Whisk is currently in an experimental phase, but once set up it's fairly easy to navigate. Google detailed in a blog post introducing Whisk that it is intended for “rapid visual exploration, not pixel-perfect edits.”

Computing

OpenAI could release its next-generation model by December

ChatGPT giving a response about its knowledge cutoff.

OpenAI plans to release its next-generation frontier model, code-named Orion and rumored to actually be GPT-5, by December, according to an exclusive report from The Verge. However, OpenAI boss Sam Altman is already pushing back.

According to "sources familiar with the plan," Orion will not initially be released to the general public, as the previous GPT-4 variants were. Instead, the company intends to hand the new model over to select businesses and partners, who will then use it as a platform to build their own products and services. This is the same strategy that Nvidia is pursuing with its NVLM 1.0 family of large language models (LLMs).

Computing

Midjourney’s AI image editing reimagines your uploaded photos

The new web UI for Midjourney.

Midjourney released its External Editor on Thursday, "a powerful new tool for unleashing your imagination." Available to select users, the AI tool will enable users to upload their own images, then adjust, modify and retexture them in a wide variety of artistic styles.

Previously, users could upload a reference image to Midjourney, either through the alpha web app or its discord server, then have the generation model use that as a reference to create a new image. You could not, however, make any edits to the source image itself. That's changing with the new External Editor. With it, you'll be able to add, modify, move, resize, remove, and restore specific assets within the image, as well as reskin it as a whole in an entirely new style — shifting it from, say, a photograph to pointillism to impressionist to anime. The system reportedly works on doodles and line drawings as well.

nproxy.org