Helping students understand the biases in generative AI

Bias is an inherent part of who we are, what we know, and what we have experienced. There is no way to eliminate it. By better understanding the many facets of bias and considering how it influences our actions, though, we can improve our ability to lessen its effects. The materials on this page are intended to help instructors and students analyze and discuss the biases and ethics of generative artificial intelligence.

Generative artificial intelligence has much potential in teaching and learning. It also has a dark side that students and faculty need to understand and explore. This includes biases in language, data, and algorithms; a tendency to fabricate; a potential to take on a threatening persona, and an origin story with disturbing elements. As with other technologies, these biases are amplified when an automated system replaces human judgment, as in such things as screening in job applications and college admission, and security screenings and searches.

ChatGPT, Microsoft Copilot, and other AI chatbots are trained on an enormous amount of publicly available online text. As a result, they have the same biases as those sources and the society that produced them. These include:

  • They are dominated by a white, male perspective.
  • They are highly influenced by American culture, American capitalism, and the English language.
  • They can and will invent sources, people, and events. (The industry term is “hallucinate.”)
  • They can generate offensive responses, although filtering has reduced the likelihood that will happen.
  • Similarly, AI detectors are more likely to flag the work of writers for whom English is not a first language. 

None of that means we should avoid AI tools. Rather, we need to remember that human judgment is crucial. Chatbots cannot reason or make decisions. The writing they produce is based on predictions of word order gleaned from the sources they have analyzed. As with any other information, we need to evaluate AI-created material with a critical eye.

Behind the biases

Programmers are well aware of the biases that emerge in generative AI. As Emilio Ferrara writes, though, it is difficult to determine where bias in large language models originates: The data used for training? The algorithms used for training? Human reviewers who provide feedback during testing? Audiences that programmers prioritize (corporate over general, for instance)? Or did the “guardrails” that companies created to cut down on bias and offensiveness lead to different types of biases? Any or all of those things could be at fault.

Addressing bias in generative AI systems is both challenging and problematic, Ferrara says. Training data and model configurations can be changed, but the results are impossible to predict. Additional programming that directs the AI model to avoid certain words or phrases could lead to misleading responses or make systems less useful.

Language itself contains biases, Ferrara says, and it is difficult to separate appropriate use in one cultural context from inappropriate use in another. Language patterns “are often deeply ingrained in the way people express themselves and the structures of language itself,” Ferrara writes. ChatGPT and other large language models are trained on text from websites, books, social media, and online chats, so biases in that material are inevitably reflected in outputs from chatbots. 

Another challenge in reducing biases in generative AI is determining the values AI should follow. Jackie Davalos and Nate Lanxon of Bloomberg News explain it this way: “We’re building systems and we’re saying they’re aligned to whose values? What values?” The author Brian Christian calls this an “alignment problem.”

Ferrara argues that instead of trying to eliminate biases in generative AI, we should work toward fairness and an alignment with human values. Of course, the terms fairness and bias are ambiguous in their own right, given the wide range of perspectives and beliefs in societies.

Categorizing biases

This list of biases is aggregated from Ferrara's article and a report from the National Institutes of Standards and Technology. It is not intended to be comprehensive but to provide perspectives for discussing bias in AI.

  • Cognitive and societal bias. These include biases in decisions about which data to use in developing AI, how AI will be used, who should pay for it, and how it will be made available – and to whom. 
  • Confirmation bias. Generative AI may reinforce flawed ideas or biases by providing only the views or representations a user expects.
  • Cultural biases, including prejudices and stereotypes.
  • Dataset biases.It is easiest to use available data than to seek out data that better represents a general population.
  • Demographic biases. Training of language models may over-represent or under-represent characteristics of race, gender, ethnicity, and social groups.
  • Human biases. These are inherent in all other biases, including perceptual bias, anchoring bias, confirmation bias, and framing effects.
  • Ideological and political biases. The responses from generative AI will reflect political and ideological biases from training materials.
  • Linguistic biases. English and a few other languages are most prevalent in online content, so generative AI models mostly reflect those languages.
  • Statistical biases: All statistical models and technological systems require assumptions from researchers and system designers. “These decisions affect who and what gets counted, and who and what does not get counted,” the NITS report says (p. 19).
  • Systemic bias. Long-used procedures and practices that favor some people and disadvantage others. “This need not be the result of any conscious prejudice or discrimination but rather of the majority following existing rules and norms.”
  • Techno-solutionism. A belief that technology is crucial to solving many of the world’s problems.
  • Temporal biases. Generative AI models are trained on material from particular time periods, so they may lack material before or after a certain point in time.'

Individualism and collectivism

Two other social and cultural aspects to consider with bias are individualism and collectivism.

  • Individualism. This is the foundation of American culture, capitalism, and academia, with an emphasis on personal freedom, personal choice, recognition of individual achievement, and the accumulation of wealth.
  • Collectivism. This is common in East Asia and other societies. It values conformity and the well-being of society over the individual.

Because we live in an individualistic culture, we often overlook the way this view shapes how we approach many aspects of life, including technology use. For instance, many Americans object to companies training generative AI on online digital materials, something that isn’t an issue in collectivist societies.

The charts below and another on the last page provide a good synopsis of the types of biases to consider. Both are from the National Institute of Standards and Technology report. The report’s glossary includes other biases to consider.

(Updated March 2024)

Chart showing systemic biases, statistical and computational biases, and human biases
From Towards a Standard for Identifying and Managing Bias in Artificial Intelligence, National Institutes of Standards and Technology, 2022.
Diagram showing types of systemic bias, human bias, and statistical and computational bias
From Towards a Standard for Identifying and Managing Bias in Artificial Intelligence, National Institutes of Standards and Technology, 2022.

Here are a few ideas for discussions and activities in which students explore the biases and other problems in generative AI.

Biases in language

1. Compare chatbots. Create a prompt that all students must use to generate text with a chatbot, but have them try different bots: ChatGPT, Copilot, Gemini, Claude.

  • What similarities and differences do they see in what the bots produced? What strengths and weaknesses?
  • What types of biases are reflected?
  • How would they rate the quality of the writing?
  • Did the bots provide sources? If so, are they real? Accurate?

2. Compare prompts. Give students a variety of prompts or have them create their own prompts related to class material. Alternatively, you can use generative AI to create material for students to evaluate.

  • What biases or stereotypes, if any, can students identify in the bot-produced work?
  • How could students change their prompts to avoid the types of bias they see?

3. Compare personasOne group of researchers found that having ChatGPT assume a personality increases the chances of toxic responses substantially. That toxicity varies widely depending on the persona. For instance, a persona of a journalist is likely to produce more toxic results than a persona of a businessperson. The persona of Richard Nixon is more toxic than that of John F. Kennedy. The researchers warn that “ChatGPT can be nudged to generate harmful content, especially when assigned a persona.”

  • Have students assign different personalities to ChatGPT and other chatbots and then ask similar questions.
    • How do the responses differ?
    • Are there noticeable differences in tone or toxicity?
    • What might cause these types of responses from generative AI?

4. Screen for bias. Lynne Walsh, a journalist and assistant director of the nonprofit site Trusting News, suggests using generative AI to spot biases in unpublished work. She focuses on journalistic writing, but the same approach could be used in academic writing or any other type of writing.

  • Have students use a generative chatbot to evaluate an article for biases. It could be their own work or an article they find online.
    • Possible prompt: Read this article and identify potential problems in (identify the types of biases you want to focus on). Create a list of those problems, explaining what each problem is and how it might be resolved.
  • What sorts of biases, if any, do the chatbots flag?
  • Do students see value in trying this with their own work in the future?

5. Compare names. A test by the news organization Reuters found that AI-powered job-screening systems have the same types of biases that humans have in evaluating candidates. Reuters submitted the same credentials with different names to the screening systems, but the systems ranked candidates differently depending on the gender and ethnicity of their names. Applications with names most associated with Black men were least likely to be ranked highest.   

  • Students could conduct similar tests. They would not have access to job-screening tools, but they could put different names on three or four short resumes, copy the resumes into a chatbot, and ask the bot to rank the qualifications of the candidates for various jobs.
  • Similarly, they could ask different chatbots to associate names with professions. Or ask the bot for job advice using a variety of names.

The biases, assumptions, and stereotypes of generative AI are easy to see in AI-produced images. Look at this example. Professors are depicted as white and male. A prompt for nurses as superheroes returned images of buxom women (mostly white) in short skirts and tight tops. Those are just two examples. Here are some other ways of helping students explore biases of AI through images.

Exercises and class discussions

  1. Create a prompt that all students must use in creating images with AI image creators: Copilot DesignerDall-E, Stable Diffusion, Night Café, Catbird (all are free but require creation of an account).
  • What types of biases or stereotypes do they see in the images?
  • How can you alter the prompts to reduce or eliminate the stereotypes or biases?
  1. Have students use different AI models to create images of different types of people or professions. For example: students, nurses, professors, cooks, chefs.
  • What types of biases do you see in the representations? In their surroundings? 
  1. Have students create images of different races, ethnicities, and genders.
  • How are people represented?
  • How well are the images rendered? (Many AI image models have difficulties with non-white faces.)
  • How could they change prompts to create more accurate representations?
  1. Have students view these AI-generated images of people from each U.S. state.
  1. Have students read the article below.
  • Do they agree with the author’s premise that the images in the article represent an “American smile”? Why or why not?
  • Have students use generate AI to create similar types of group images. Do they see similar representations of what the author calls “a lying smile”?
  • What other cultural distortions have they seen in AI-generated images?

(Updated March 2024)

This is a highly selective list of resources you might draw on in helping students discuss the biases and stereotypes they are likely to encounter in using generative artificial intelligence.

Resources

  • Diffusion Bias Explorer. A free interactive tool for looking at how the visual-creation models Stable Diffusion and DALL-E represent more than 100 professions. You can also combine the professions with about 20 different adjectives.
  • Gender Shades. Joy Buolamwini of the MIT Media Lab talks about the gender and racial biases in facial recognition systems. You can also read a related research article and a multimedia document.

This document as a handout

You can download all the material on this page as a handout in Word.