List · 20 entries · power

20 Landmark Studies on Gender Bias You Should Know

The empirical case against 'it's just how things are' is built on specific studies — audit experiments, meta-analyses, and longitudinal tracking that made particular forms of bias legible. This is a working index of the ones that shifted the field.

  1. 1

    Zimmerman & West (1975) — Interruption in mixed-sex conversation

    In 11 mixed-sex conversations, men interrupted women 33 times for every 7 times women interrupted men. The foundational study of conversational dominance.

    Zimmerman & West, Language and Sex: Difference and Dominance (1975)

  2. 2

    Goldberg (1968) — Are women prejudiced against women?

    Participants rated identical academic papers significantly higher when bylined with male names. The first experimental demonstration of implicit gender bias in competence judgment.

    Goldberg, Transaction (1968)

  3. 3

    Steinpreis, Anders & Ritzke (1999) — CV audit for tenure-track psychology

    Faculty evaluated the same CV, labeled either 'Karen Miller' or 'Brian Miller.' 'Brian' was more likely to be recommended for hire; evaluators suggested 'Karen' needed more experience.

    Sex Roles (1999)

  4. 4

    Moss-Racusin et al. (2012) — Science faculty gender bias

    127 biology, chemistry, and physics faculty rated the same lab-manager CV as more competent, hirable, and worthy of $4,000 more in starting salary when the name was male.

    PNAS (2012)

  5. 5

    Correll, Benard & Paik (2007) — Motherhood penalty

    Matched résumés revealed mothers were recommended for hire at 47% the rate of equivalent childless women. Fathers were recommended more than childless men — the 'fatherhood bonus.'

    American Journal of Sociology (2007)

  6. 6

    Bowles, Babcock & Lai (2007) — Negotiation backlash

    Women who negotiated starting salaries using the same scripts as men were rated 'difficult to work with' and were less likely to be hired. Men using identical scripts received no penalty.

    Organizational Behavior and Human Decision Processes (2007)

  7. 7

    Heilman et al. (2004) — Likability penalty for agentic women

    Successful women in male-typed roles were rated as competent but disliked. Equivalent successful men were rated competent and likable. The double-bind quantified.

    Journal of Applied Psychology (2004)

  8. 8

    Goldin & Rouse (2000) — Blind auditions

    Symphony orchestras that adopted blind auditions (candidates played behind a screen) increased the probability of women advancing by 50%. The paper that launched hiring-blind as a policy lever.

    American Economic Review (2000)

  9. 9

    Spencer, Steele & Quinn (1999) — Stereotype threat in math

    Women performed worse on a hard math test when reminded the test showed gender differences. Framing the test as 'not diagnostic of gender differences' eliminated the gap.

    Journal of Experimental Social Psychology (1999)

  10. 10

    Terrell et al. (2017) — GitHub pull-request gender bias

    Across 3M+ pull requests: women's code was accepted at higher rates than men's when gender was not visible, and lower rates when it was.

    PeerJ Computer Science (2017)

  11. 11

    Hoffmann & Tarzian (2001) — 'The Girl Who Cried Pain'

    The landmark review documenting that women with identical pain complaints receive less and slower analgesia than men. Transformed how medical bias research is framed.

    Journal of Law, Medicine & Ethics (2001)

  12. 12

    Tsugawa et al. (2017) — Mortality and female physicians

    In 1.58M Medicare hospitalizations, patients treated by female internists had lower 30-day mortality (11.07% vs 11.49%) than comparable patients of male internists.

    JAMA Internal Medicine (2017)

  13. 13

    Hyde (2005) — Gender similarities hypothesis

    Meta-analysis of meta-analyses: most psychological sex differences are small or zero. The paper that reframed the scientific default from 'men and women are different' to 'show the effect size.'

    American Psychologist (2005)

  14. 14

    Daminger (2019) — The cognitive dimension of household labor

    Identified four dimensions of mental load (anticipating, identifying, deciding, monitoring) and measured women doing 70% of it in dual-earner households.

    American Sociological Review (2019)

  15. 15

    Salerno & Peter-Hagene (2015) — Anger and persuasion in juries

    In mock jury deliberations, men's anger *increased* persuasive weight; women's anger *decreased* it — even among female jurors. The gendered discount on anger, measured cleanly.

    Law and Human Behavior (2015)

  16. 16

    Paustian-Underdahl, Walker & Woehr (2014) — Leadership effectiveness meta

    Meta-analysis of 99 samples: women rated equal to or higher than men on leadership effectiveness across nearly all contexts, contrary to the male-leader-as-default assumption.

    Journal of Applied Psychology (2014)

  17. 17

    Glick & Fiske (1996) — Ambivalent Sexism Inventory

    Introduced the hostile/benevolent sexism distinction and the measurement instrument that remains standard. Showed 'positive' sexism correlates with worse outcomes for women globally.

    Journal of Personality and Social Psychology (1996)

  18. 18

    Mehl et al. (2007) — Who talks more?

    EAR-device recording of 396 participants found men and women each spoke about 16,000 words a day — statistically indistinguishable. Debunked the popular 20,000-vs-7,000 figure.

    Science (2007)

  19. 19

    Pope et al. (2000) — Missed heart attack diagnoses in ER

    Women under 55 were 7× more likely than men to be discharged from the ER mid-heart-attack. The study that forced cardiology to reckon with gender-atypical symptom training.

    New England Journal of Medicine (2000)

  20. 20

    Goldin (2014) — Grand gender convergence

    Claudia Goldin's Nobel-winning analysis of why the gender earnings gap persists despite converging educational attainment. The gap lives in non-linear work schedules — jobs that punish flexibility.

    American Economic Review (2014)

Sources

Frequently asked

Which study started the stereotype threat literature?

Steele & Aronson 1995 on race; Spencer, Steele & Quinn 1999 applied the concept to gender and math. Both have generated hundreds of replications and extensions.

What's the most-replicated gender bias finding?

The identical-CV effect — rating the same credentials higher when labeled with a male name. First demonstrated by Goldberg (1968), replicated across fields, countries, and decades.

What is the motherhood penalty?

Correll, Benard & Paik (2007) showed matched résumés with motherhood cues (PTA membership) reduced callback rates by 50% relative to identical childless résumés. Fathers showed the opposite pattern.

Did Claudia Goldin win the Nobel Prize for this work?

Yes — the 2023 Nobel Prize in Economic Sciences, for her historical analysis of women's participation in the labor market over centuries.

What's the easiest way to access these papers?

Google Scholar (scholar.google.com) will find open-access versions of most. ResearchGate and the authors' university profiles are also reliable sources for PDF access.