Humor as a Mirror: The New Yorker Captions as Reflections of Society and Stereotypes

Abstract

Humor is a universal yet complex form of expression. The more we think about it and the more complexes it becomes! An example of humor purposes could be in the way of dealing with more ‘serious’ matters [1] or easing social tensions and establishing social bonds [2].

With this in mind, this project aim to investigate how jokes reflect societal traits and values. We will begin by examining what is generally perceived as “funny” and “unfunny,” and then explore how humor operates within two social domains : professions and gender. Finally, we will synthesize our findings to understand what humor reveals about the norms of contemporary society.

Research Questions

Axis 1 – What Is Considered Funny
- 1.1 : What makes captions funny ?
- 1.2 : Are some topics inherently funnier than others and do they increase your chances of winning ?
Axis 2 – How professions are laughed about
- 2.1 : Where do occupations appear and how frequent are they ?
- 2.2 : Are some categories of occupation funnier than others ?
- 2.3 : Do we appreciate work of other people ? Temporal & topic analysis.
Axis 3 – Gender Roles and stereotypes
- 3.1 : How are men and women depicted in cartoons and captions, and do these depictions reflect traditional gender roles or stereotypes?
- 3.2 : How does audience response relate to gendered content - do captions about one gender receive more positive attention than the other?

Additional Datasets

Five datasets are used to construct an exhaustive list of occupations:
The main purpose of these datasets is to construct a comprehensive list of occupations. Other data in said datasets is not used.
Dictionary of gendered words:

This dictionary was constructed based on Danielle Sucher's "Jailbreak the Patriarchy" (https://github.com/DanielleSucher/Jailbreak-the-Patriarchy)

Methods

Axis 1: What Is Considered Funny

This part aims to understand which characteristics of a caption explain its perceived funniness.

Data preparation : Each caption is associated with:

definition of a new funny score : Built a new funniness metric combining (i) vote-type proportions (funny / somewhat_funny / not_funny) and (ii) caption popularity (number of votes), then scaled scores for comparability across contests.
A set of linguistic and semantic features, including: Length, Punctuation, Lexical diversity, Sentiment polarity (computed using TextBlob). take well into account the numbers of votes. The different group of funninness will then be compared in boxplot and their difference in mena will be assesd with a Student't t test.

Topic modelling : Applied BERTopic to captions; selected a suitable min_topic_size using quality criteria (e.g., coherence/diversity/silhouette/outlier-rate trade-offs), then compared humor-score distributions across topics (density/boxplot views + distribution tests). Long-tail distribution effect: Quantified whether topics were over-represented among top captions using a percentile stratification + enrichment score approach.

Axis 2: Professions in Jokes - Dynamics of Humour and Work

This section looks at frequency, distribution of funniness scores of occupations and occupational categories, performs topic and sentiment analysis of captions with occupations.

Data Preparation

Created comprehensive occupations list
Built TF-IDF matrix
Extracted occupation mentions from the dataset with attributes like number of occurences, contest ids, funniness scores.

Analysis

Identified where occupations/categories appear, their frequency, and compared difference in funniness scores (statistical tests)
Performed topic modelling with BERTopic on captions with occupations.
Performed sentiment analysis with VADER on captions with different categories of occupations and compared them.

Visualisations

Bar charts for frequency of occupations topics, median funniness scores.
Box plots for category distributions.
treemaps topics co-occuring with occupations
Histograms for funniness score/sentiment distribution
Heatmaps for statistical test results.

Axis 3: Gender Roles

This section investigates how men and women are depicted in cartoons and captions, the language patterns associated with each gender, and how audience responses relate to these depictions.

Data Preparation

Gender annotation: Identify men, women, or gender-neutral characters in cartoons and captions using a gendered dictionary.

Analysis

Language Patterns: Analyze word usage by generating word clouds, look at topics using BERTopic
Audience Response: Compare the funny score of each gender of the whole dataset, then only at the top and worst captions. Use statistical testing to support our hypothesis.

Visualizations

Bar charts for gender frequencies and mentions.
Word clouds for words occurences
Treemap for topics
Histogram for distribution of funny score

Website

URL : https://adacore42-website2025.streamlit.app/

Members contribution

General Statistical necessities: Cyrielle
Axis 1: Katia & Cyrielle
Axis 2: Andras
Axis 3: Amélie
Data preprocessing & website: Dominic & Katia
Datastory : All members wrote the part of the website that corresponds to their analysis.

Sources

[1] Powell, C. (1988). A Phenomenological Analysis of Humour in Society. In: Powell, C., Paton, G.E.C. (eds) Humour in Society. Palgrave Macmillan, London. https://doi.org/10.1007/978-1-349-19193-2_5

[2] Palagi, E., Caruana, F., & de Waal, F. B. M. (2022). The naturalistic approach to laughter in humans and other animals: Towards a unified theory. Philosophical Transactions of the Royal Society B: Biological Sciences, 377(1863), 20210175. https://doi.org/10.1098/rstb.2021.0175

Name		Name	Last commit message	Last commit date
Latest commit History 538 Commits
.vscode		.vscode
_ ChangeLog		_ ChangeLog
_LatexRepports		_LatexRepports
_Other		_Other
_webappp		_webappp
data		data
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
pip_requirements.txt		pip_requirements.txt
repo_structure.txt		repo_structure.txt
results.ipynb		results.ipynb
results_MS2.ipynb		results_MS2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Humor as a Mirror: The New Yorker Captions as Reflections of Society and Stereotypes

Abstract

Research Questions

Additional Datasets

Methods

Axis 1: What Is Considered Funny

Axis 2: Professions in Jokes - Dynamics of Humour and Work

Axis 3: Gender Roles

Website

Members contribution

Sources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Humor as a Mirror: The New Yorker Captions as Reflections of Society and Stereotypes

Abstract

Research Questions

Additional Datasets

Methods

Axis 1: What Is Considered Funny

Axis 2: Professions in Jokes - Dynamics of Humour and Work

Axis 3: Gender Roles

Website

Members contribution

Sources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages