Skip to content

Week 8 Memo #14

@ShiyangLai

Description

@ShiyangLai

Post a memo that explores the readings and class topics relevant to your final project by Thursday, Feb 26, 11:59 PM. The memo should be 300–500 words and include the following: (1) state your research question (for the imagined final project) succinctly in a single sentence at the beginning (this can and should evolve across the weeks of the quarter as the project becomes more concrete); (2) propose a research design that helps to address that question, and not proposed in any prior memo, informed by at least one of the required readings and one of the supplemental readings from this week; (3) a visual figure or diagram that draws upon pilot (nonhallucinated) data, simulated data with defensible assumptions, or provides a clear conceptual illustration that persuades the reader (aka James and TAs) of the appropriateness and fruitfulness of the design to address your question. You will then pilot this design as the final question in the Week 8 Homework due the following Wednesday.

By 10AM Friday, each student will up-vote (“thumbs up”) what they think are the five most interesting memos for the week!

Required
AI Foundations: “Convolutional Neural Networks” and “Diffusion Models” in Deep Learning: Foundations and Concepts, chapters 10 and 20.
“REACT: Synergizing Reasoning and Acting in Large Language Models.” Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao. 2025.
AI Designs: “GPT-4V(ision) System Card.” OpenAI. 2023.
“Embodied Task Planning with Large Language Models.” Song, C. H., Wu, J., Washington, C., et al. arXiv. 2023.
Social Designs: “Machine Learning as a Tool for Hypothesis Generation”, Jens Ludwig, Sendhil Mullainathan. The Quarterly Journal of Economics 2024.
“Age and gender distortion in online media and large language models”. Douglas Guilbeault, Solène Delecourt & Bhargav Srinivasa Desikan. 2025.

Supplemental
AI Foundations: “Deep Computer Vision Using Convolutional Neural Networks”, “Representation Learning and Generative Learning Using Autoencoders and GANs”, Hands-On Machine Learning with Scikit-Learn, Keras & Tensorflow, chapters 14, 17.
“Multi-modal Transformers”, Deep Learning: Foundations and Concepts, chapter section 12.4.
AI Perspectives: “Explainable Deep Learning: Concepts, Methods, and New Developments” by Wojciech Samek in Explainable Deep Learning. 2023.
AI Designs: “Guiding a Diffusion Model with a Bad Version of Itself.” Karras et al, NeurIPS. 2024.
“Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction.” 2024. Tian et al. NeurIPS.
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes”. 2020.
Social Designs: “Online images amplify gender bias,” Guilbeault, Douglas, Solène Delecourt, Tasker Hull, Bhargav Srinivasa Desikan, Mark Chu, and Ethan Nadler. Nature. 2024.
“Sixteen facial expressions occur in similar contexts worldwide” 2020.
Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States” 2017.
Computer vision uncovers predictors of physical urban change.” 2017.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions