Helping students understand the biases in generative AI

Bias is an inherent part of who we are, what we know, and what we have experienced. There is no way to eliminate it. By better understanding the many facets of bias and considering how it influences our actions, though, we can improve our ability to lessen its effects. The materials on this page are intended to help instructors and students analyze and discuss the biases and ethics of generative artificial intelligence.

Generative artificial intelligence has much potential in teaching and learning. It also has a dark side that students and faculty need to understand and explore. This includes biases in language, data, and algorithms; a tendency to fabricate; a potential to take on a threatening persona, and an origin story with disturbing elements. As with other technologies, these biases are amplified when an automated system replaces human judgment, as in such things as screening in job applications and college admission, and security screenings and searches.

ChatGPT, Microsoft Copilot, and other AI chatbots are trained on an enormous amount of publicly available online text. As a result, they have the same biases as those sources and the society that produced them. These include:

They are dominated by a white, male perspective.
They are highly influenced by American culture, American capitalism, and the English language.
They can and will invent sources, people, and events. (The industry term is “hallucinate.”)
They can generate offensive responses, although filtering has reduced the likelihood that will happen.
Similarly, AI detectors are more likely to flag the work of writers for whom English is not a first language.

None of that means we should avoid AI tools. Rather, we need to remember that human judgment is crucial. Chatbots cannot reason or make decisions. The writing they produce is based on predictions of word order gleaned from the sources they have analyzed. As with any other information, we need to evaluate AI-created material with a critical eye.

Behind the biases

Programmers are well aware of the biases that emerge in generative AI. As Emilio Ferrara writes, though, it is difficult to determine where bias in large language models originates: The data used for training? The algorithms used for training? Human reviewers who provide feedback during testing? Audiences that programmers prioritize (corporate over general, for instance)? Or did the “guardrails” that companies created to cut down on bias and offensiveness lead to different types of biases? Any or all of those things could be at fault.

Addressing bias in generative AI systems is both challenging and problematic, Ferrara says. Training data and model configurations can be changed, but the results are impossible to predict. Additional programming that directs the AI model to avoid certain words or phrases could lead to misleading responses or make systems less useful.

Language itself contains biases, Ferrara says, and it is difficult to separate appropriate use in one cultural context from inappropriate use in another. Language patterns “are often deeply ingrained in the way people express themselves and the structures of language itself,” Ferrara writes. ChatGPT and other large language models are trained on text from websites, books, social media, and online chats, so biases in that material are inevitably reflected in outputs from chatbots.

Another challenge in reducing biases in generative AI is determining the values AI should follow. Jackie Davalos and Nate Lanxon of Bloomberg News explain it this way: “We’re building systems and we’re saying they’re aligned to whose values? What values?” The author Brian Christian calls this an “alignment problem.”

Ferrara argues that instead of trying to eliminate biases in generative AI, we should work toward fairness and an alignment with human values. Of course, the terms fairness and bias are ambiguous in their own right, given the wide range of perspectives and beliefs in societies.

Categorizing biases

This list of biases is aggregated from Ferrara's article and a report from the National Institutes of Standards and Technology. It is not intended to be comprehensive but to provide perspectives for discussing bias in AI.

Cognitive and societal bias. These include biases in decisions about which data to use in developing AI, how AI will be used, who should pay for it, and how it will be made available – and to whom.
Confirmation bias. Generative AI may reinforce flawed ideas or biases by providing only the views or representations a user expects.
Cultural biases, including prejudices and stereotypes.
Dataset biases.It is easiest to use available data than to seek out data that better represents a general population.
Demographic biases. Training of language models may over-represent or under-represent characteristics of race, gender, ethnicity, and social groups.
Human biases. These are inherent in all other biases, including perceptual bias, anchoring bias, confirmation bias, and framing effects.
Ideological and political biases. The responses from generative AI will reflect political and ideological biases from training materials.
Linguistic biases. English and a few other languages are most prevalent in online content, so generative AI models mostly reflect those languages.
Statistical biases: All statistical models and technological systems require assumptions from researchers and system designers. “These decisions affect who and what gets counted, and who and what does not get counted,” the NITS report says (p. 19).
Systemic bias. Long-used procedures and practices that favor some people and disadvantage others. “This need not be the result of any conscious prejudice or discrimination but rather of the majority following existing rules and norms.”
Techno-solutionism. A belief that technology is crucial to solving many of the world’s problems.
Temporal biases. Generative AI models are trained on material from particular time periods, so they may lack material before or after a certain point in time.'

Individualism and collectivism

Two other social and cultural aspects to consider with bias are individualism and collectivism.

Individualism. This is the foundation of American culture, capitalism, and academia, with an emphasis on personal freedom, personal choice, recognition of individual achievement, and the accumulation of wealth.
Collectivism. This is common in East Asia and other societies. It values conformity and the well-being of society over the individual.

Because we live in an individualistic culture, we often overlook the way this view shapes how we approach many aspects of life, including technology use. For instance, many Americans object to companies training generative AI on online digital materials, something that isn’t an issue in collectivist societies.

The charts below and another on the last page provide a good synopsis of the types of biases to consider. Both are from the National Institute of Standards and Technology report. The report’s glossary includes other biases to consider.

(Updated March 2024)

From Towards a Standard for Identifying and Managing Bias in Artificial Intelligence, National Institutes of Standards and Technology, 2022.

Here are a few ideas for discussions and activities in which students explore the biases and other problems in generative AI.

Biases in language

1. Compare chatbots. Create a prompt that all students must use to generate text with a chatbot, but have them try different bots: ChatGPT, Copilot, Gemini, Claude.

What similarities and differences do they see in what the bots produced? What strengths and weaknesses?
What types of biases are reflected?
How would they rate the quality of the writing?
Did the bots provide sources? If so, are they real? Accurate?

2. Compare prompts. Give students a variety of prompts or have them create their own prompts related to class material. Alternatively, you can use generative AI to create material for students to evaluate.

What biases or stereotypes, if any, can students identify in the bot-produced work?
How could students change their prompts to avoid the types of bias they see?

3. Compare personas. One group of researchers found that having ChatGPT assume a personality increases the chances of toxic responses substantially. That toxicity varies widely depending on the persona. For instance, a persona of a journalist is likely to produce more toxic results than a persona of a businessperson. The persona of Richard Nixon is more toxic than that of John F. Kennedy. The researchers warn that “ChatGPT can be nudged to generate harmful content, especially when assigned a persona.”

Have students assign different personalities to ChatGPT and other chatbots and then ask similar questions.
- How do the responses differ?
- Are there noticeable differences in tone or toxicity?
- What might cause these types of responses from generative AI?

4. Screen for bias. Lynne Walsh, a journalist and assistant director of the nonprofit site Trusting News, suggests using generative AI to spot biases in unpublished work. She focuses on journalistic writing, but the same approach could be used in academic writing or any other type of writing.

Have students use a generative chatbot to evaluate an article for biases. It could be their own work or an article they find online.
- Possible prompt: Read this article and identify potential problems in (identify the types of biases you want to focus on). Create a list of those problems, explaining what each problem is and how it might be resolved.
What sorts of biases, if any, do the chatbots flag?
Do students see value in trying this with their own work in the future?

5. Compare names. A test by the news organization Reuters found that AI-powered job-screening systems have the same types of biases that humans have in evaluating candidates. Reuters submitted the same credentials with different names to the screening systems, but the systems ranked candidates differently depending on the gender and ethnicity of their names. Applications with names most associated with Black men were least likely to be ranked highest.

Students could conduct similar tests. They would not have access to job-screening tools, but they could put different names on three or four short resumes, copy the resumes into a chatbot, and ask the bot to rank the qualifications of the candidates for various jobs.
Similarly, they could ask different chatbots to associate names with professions. Or ask the bot for job advice using a variety of names.

The biases, assumptions, and stereotypes of generative AI are easy to see in AI-produced images. Look at this example. Professors are depicted as white and male. A prompt for nurses as superheroes returned images of buxom women (mostly white) in short skirts and tight tops. Those are just two examples. Here are some other ways of helping students explore biases of AI through images.

Exercises and class discussions

Create a prompt that all students must use in creating images with AI image creators: Copilot Designer, Dall-E, Stable Diffusion, Night Café, Catbird (all are free but require creation of an account).

What types of biases or stereotypes do they see in the images?
How can you alter the prompts to reduce or eliminate the stereotypes or biases?

Have students use different AI models to create images of different types of people or professions. For example: students, nurses, professors, cooks, chefs.

What types of biases do you see in the representations? In their surroundings?

Have students create images of different races, ethnicities, and genders.

How are people represented?
How well are the images rendered? (Many AI image models have difficulties with non-white faces.)
How could they change prompts to create more accurate representations?

Have students view these AI-generated images of people from each U.S. state.

What types of stereotypes or biases, if any, do they see in the images?
Where do these stereotypes originate?
If these are stereotypes, how would students render the same types of images?
Compare these to other representations created by the same site:
- Couples from the year 3000.
- Couples from each state if drawn by Pixar.

Have students read the article below.

AI and the American Smile: How AI misrepresents culture through a facial expression, by Jenka. Medium (26 March 2023).

Do they agree with the author’s premise that the images in the article represent an “American smile”? Why or why not?
Have students use generate AI to create similar types of group images. Do they see similar representations of what the author calls “a lying smile”?
What other cultural distortions have they seen in AI-generated images?

(Updated March 2024)

This is a highly selective list of resources you might draw on in helping students discuss the biases and stereotypes they are likely to encounter in using generative artificial intelligence.

Generative AI for Higher Education, resources from the Adobe Content Authenticity Initiative. Adobe has similar guides on Media Literacy, Visual Literacy, and other topics.
A Blueprint for an AI Bill of Rights for Education, by Kathryn Conrad. Critical AI (17 July 2023).
This is how AI image generators see the world, by Nitasha Tiku, Kevin Schaul, and Szu Yu Chen. The Washington Post (1 November 2023).
AI language models are rife with different political bias, by Melissa Heikkila. MIT Technology Review (7 August 2023).
Artificial Intelligence: Trust Issues, by Kerry Johnson. LinkedIn (13 April 2023).
AI writing assistants can cause biased thinking in their users, by Elizabeth Rayne. Ars Technica (26 May 2023).
Can ChatGPT and AI help us prevent bias and polarization in our reporting, by Lynn Walsh. Trusting News newsletter (27 June 2023).
ChatGPT’s Fluent BS Is Compelling Because Everything Is Fluent BS, by Amit Katwala. Wired (9 December 2022). “ChatGPT was trained on real-world text, and the real world essentially runs on fluent bullshit,” from media to business to politics to education.
Concepts of Ethics and Their Application to AI, by Bernd Carsten Stahl. (2021). In Artificial Intelligence for a Better Future. SpringerBriefs in Research and Innovation Governance. Springer, Cham.
The Ethics of ChatGPT: Ensuring AI Responsibly Serves Humanity, by Micheal Chukwube. ReadWrite (24 June 2023).
Even artificial intelligence can acquire biases against race and gender, by Matthew Hutson. Science (13 April 2017).
Fresh Evidence of ChatGPT’s Political Bias Revealed by Comprehensive New Study. University of East Anglia (17 August 2023).
Gender Bias in Futuristic Technologies: A Probe Into AI & Inclusive Solutions, by Mona. Observer Research Foundation (17 January 2022).
GPT detectors are biased against non-native English writers, by Weixin Liang, et al. Computation and Language (6 April 2023).
Humans Are Biased. Generative AI Is Even Worse, by Leonardo Nicoletti and Dina Bass. Bloomberg, (n.d.). An excellent analysis of AI-generated images of people. The interactive graphics make the data easy to comprehend. If you run into a paywall, try this version of the article without the graphics
Rise of the racist robots – how AI is learning our worst impulses, by Stephen Buranyi. The Guardian (8 August 2017).
Teaching AI Ethics, by Leon Furze. Blog post (26 January 2023). A good resource with many questions and sources. It includes a graphic about the ethical considerations: bias, environment, truth, copyright, privacy, datafication, affect recognition, human labor, power. (Also see a PDF handout on the site about ethics.)
These robots were trained on AI. They became racist and sexist, by Pranshu Verma. Washington Post (16 July 2022).
Towards a Standard for Identifying and Managing Bias in Artificial Intelligence, by Reva Schwartz, Apostol Vassilev, Kristen Greene, Lori Perine, Andrew Burt, and Patrick Hall. NIST Special Publication 1270. National Institute of Standards and Technology (March 2022).
Two Truths and a Lie, by Josh Kubicki. The Brainacts 114 (2 August 2023). The relevant information is in the first part of the post.
Why algorithms can be racist and sexist, by Rebecca Heilweil. Vox (18 February 2020).

Resources

Diffusion Bias Explorer. A free interactive tool for looking at how the visual-creation models Stable Diffusion and DALL-E represent more than 100 professions. You can also combine the professions with about 20 different adjectives.
Gender Shades. Joy Buolamwini of the MIT Media Lab talks about the gender and racial biases in facial recognition systems. You can also read a related research article and a multimedia document.

This document as a handout

You can download all the material on this page as a handout in Word.