11---
2- title : " PSYC 51.17 : Models of Language and Conversation "
2+ title : " PSYC 51.07 : Models of Language and Communication "
33geometry : margin=1in
44header-includes :
55 - \usepackage{fontspec}
66 - \usepackage{booktabs}
7- - \setmainfont{Berkeley Mono}
7+ - \directlua{luaotfload.add_fallback("emojifallback", {"NotoColorEmoji:mode=harf"})}
8+ - \defaultfontfeatures{RawFeature={fallback=emojifallback}}
9+ - \setmainfont{Fira Code}
810output : pdf
911---
1012
@@ -109,22 +111,26 @@ We strive to create an inclusive learning environment where all students feel su
109111### Week 1: Introduction & String Manipulation (January 5--9)
110112
111113** Monday, January 5** (Lecture 1): Course Introduction, Is ChatGPT Conscious?
114+
112115 - Topics: Course overview, capabilities of LLMs, consciousness debate
113116 - Discussion: What is consciousness? Can machines be conscious?
114117 - Reading: [ Fedorenko et al. (2024)] ( https://www.nature.com/articles/s41593-024-01711-5 ) ; [ Schrimpf et al. (2021)] ( https://www.pnas.org/doi/10.1073/pnas.2105646118 )
115118 - Slides: [ \href{https://contextlab.github.io/llm-course/slides/week1/lecture1.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides/week1/lecture1.html}{HTML} ]
116119
117120** Wednesday, January 7** (Lecture 2): Pattern Matching & ELIZA
121+
118122 - Topics: String operations in Python, regular expressions, pattern matching
119123 - Reading: [ Weizenbaum (1966)] ( https://dl.acm.org/doi/10.1145/365153.365168 )
120124 - Slides: [ \href{https://contextlab.github.io/llm-course/slides/week1/lecture2.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides/week1/lecture2.html}{HTML} ]
121125
122126** Thursday, January 8** (X-hour 1): ELIZA Deep Dive
127+
123128 - Topics: Extended discussion of pattern matching, implementation strategies
124129 - Hands-on: Start Assignment 1
125130 - Demo: [ \href{https://contextlab.github.io/llm-course/slides/week1/xhour_eliza_demo.html}{Interactive Notebook}]
126131
127132** Friday, January 9** (Lecture 3): ELIZA Implementation & The ELIZA Effect
133+
128134 - Topics: Implementing ELIZA from scratch, psychological implications
129135 - ** 📝 Assignment 1 Released:** [ \href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%201\%3A\%20ELIZA/README.md}{Building the ELIZA Chatbot}]
130136 - Reading: [ Natale (2021)] ( https://www.tandfonline.com/doi/full/10.1080/24701475.2020.1814847 )
@@ -133,21 +139,25 @@ We strive to create an inclusive learning environment where all students feel su
133139### Week 2: Computational Linguistics (January 12--16)
134140
135141** Monday, January 12** (Lecture 4): Data Cleaning & Preprocessing
142+
136143 - Topics: Web scraping with Beautiful Soup, data cleaning, text normalization
137144 - Reading: HuggingFace NLP Course Chapter 2
138145 - Slides: [ \href{https://contextlab.github.io/llm-course/slides/week2/lecture4.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides/week2/lecture4.html}{HTML} ]
139146
140147** Wednesday, January 14** (Lecture 5): Tokenization
148+
141149 - Topics: Byte-Pair Encoding (BPE), WordPiece, SentencePiece
142150 - Reading: [ Sennrich et al. (2016)] ( https://aclanthology.org/P16-1162/ ) ; [ Kudo & Richardson (2018)] ( https://aclanthology.org/D18-2012/ )
143151 - Slides: [ \href{https://contextlab.github.io/llm-course/slides/week2/lecture5.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides/week2/lecture5.html}{HTML} ]
144152
145153** Thursday, January 15** (X-hour 2): Text Classification Workshop
154+
146155 - Topics: Building classifiers, feature engineering for text
147156 - Hands-on: Explore different classification approaches
148157 - Demo: [ \href{https://contextlab.github.io/llm-course/slides/week2/xhour_classification_demo.html}{Interactive Notebook}]
149158
150159** Friday, January 16** (Lecture 6): POS Tagging & Sentiment Analysis
160+
151161 - Topics: Part-of-speech tagging, named entity recognition, sentiment analysis
152162 - ** 📝 Assignment 2 Released:** [ \href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%202\%3A\%20SPAM\%20classifier/README.md}{SPAM Classifier}]
153163 - ** ✅ Assignment 1 Due**
@@ -158,16 +168,19 @@ We strive to create an inclusive learning environment where all students feel su
158168** Monday, January 19** : Martin Luther King Jr. Day (No Class)
159169
160170** Wednesday, January 21** (Lecture 7): Classic Embeddings
171+
161172 - Topics: Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA)
162173 - Reading: [ Landauer & Dumais (1997)] ( https://psycnet.apa.org/record/1997-02478-006 ) ; [ Blei et al. (2003)] ( https://www.jmlr.org/papers/v3/blei03a )
163174 - Slides: [ \href{https://contextlab.github.io/llm-course/slides/week3/lecture7.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides/week3/lecture7.html}{HTML} ]
164175
165176** Thursday, January 22** (X-hour 3): Embeddings Workshop
177+
166178 - Topics: Implementing classic embeddings (LSA, LDA)
167179 - Hands-on: Compare embedding methods on real data
168180 - Demo: [ \href{https://contextlab.github.io/llm-course/slides/week3/xhour_embeddings_demo.html}{Interactive Notebook}]
169181
170182** Friday, January 23** (Lecture 8): Word Embeddings
183+
171184 - Topics: Word2Vec (CBOW and Skip-gram), GloVe, FastText
172185 - ** 📝 Assignment 3 Released:** [ \href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%203\%3A\%20Wikipedia/README.md}{Wikipedia Embeddings Comparison}]
173186 - ** ✅ Assignment 2 Due**
@@ -177,71 +190,83 @@ We strive to create an inclusive learning environment where all students feel su
177190### Week 4: Text Embeddings II (January 26--30)
178191
179192** Monday, January 26** (Lecture 9): Contextual Embeddings
193+
180194 - Topics: ELMo, Universal Sentence Encoder, BERT embeddings
181195 - Reading: [ Peters et al. (2018)] ( https://aclanthology.org/N18-1202/ ) ; [ Cer et al. (2018)] ( https://arxiv.org/abs/1803.11175 )
182- - Slides: [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML} ]
196+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week3-4/lecture.html}{HTML} ]
183197
184198** Wednesday, January 28** (Lecture 10): Dimensionality Reduction
199+
185200 - Topics: PCA, t-SNE, UMAP for visualizing embeddings
186201 - Reading: [ McInnes et al. (2018)] ( https://arxiv.org/abs/1802.03426 )
187- - Slides: [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML} ]
202+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week3-4/lecture.html}{HTML} ]
188203
189204** Friday, January 30** (Lecture 11): Cognitive Models of Semantic Representation
205+
190206 - Topics: Distributional semantics, neural representation of meaning
191207 - Reading: [ Anderson et al. (2016)] ( https://www.jneurosci.org/content/36/45/11444 )
192- - Slides: [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML} ]
208+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week3-4/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week3-4/lecture.html}{HTML} ]
193209
194210### Week 5: Transformers & Attention (February 2--6)
195211
196212** Monday, February 2** (Lecture 12): Attention Mechanisms
213+
197214 - Topics: Sequence-to-sequence models, attention mechanism fundamentals
198215 - ** ✅ Assignment 3 Due**
199216 - Reading: [ Bahdanau et al. (2015)] ( https://arxiv.org/abs/1409.0473 ) ; [ Vaswani et al. (2017)] ( https://arxiv.org/abs/1706.03762 )
200- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
217+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
201218
202219** Wednesday, February 4** (Lecture 13): Transformer Architecture
220+
203221 - Topics: Multi-head attention, positional encoding, transformer blocks
204222 - Reading: [ Vaswani et al. (2017)] ( https://arxiv.org/abs/1706.03762 ) ; HuggingFace NLP Course Chapter 3
205- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
223+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
206224
207225** Friday, February 6** (Lecture 14): Training Transformers
226+
208227 - Topics: Pre-training objectives, masked language modeling, next token prediction
209228 - ** 📝 Assignment 4 Released:** [ \href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%204\%3A\%20Customer\%20Service\%20Chatbot/README.md}{Context-Aware Customer Service Chatbot}]
210- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
229+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
211230
212231### Week 6: Encoder Models (February 9--13)
213232
214233** Monday, February 9** (Lecture 15): BERT Deep Dive
234+
215235 - Topics: BERT architecture, bidirectional pre-training, fine-tuning
216236 - Reading: [ Devlin et al. (2019)] ( https://aclanthology.org/N19-1423/ ) ; HuggingFace NLP Course Chapter 4
217- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
237+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
218238
219239** Wednesday, February 11** (Lecture 16): BERT Variants
240+
220241 - Topics: RoBERTa, ALBERT, DistilBERT, and other encoder models
221242 - Reading: [ Liu et al. (2019)] ( https://arxiv.org/abs/1907.11692 ) ; [ Sanh et al. (2019)] ( https://arxiv.org/abs/1910.01108 )
222- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
243+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
223244
224245** Friday, February 13** (Lecture 17): Applications of Encoder Models
246+
225247 - Topics: Classification, NER, question answering with BERT
226248 - ** ✅ Assignment 4 Due**
227- - Slides: [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML} ]
249+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week5-6/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week5-6/lecture.html}{HTML} ]
228250
229251### Week 7: Decoder Models & GPT (February 16--20)
230252
231253** Monday, February 16** (Lecture 18): GPT Architecture
254+
232255 - Topics: Autoregressive language models, GPT-1 and GPT-2
233256 - Reading: [ Radford et al. (2018)] ( https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf ) ; [ Radford et al. (2019)] ( https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf )
234- - Slides: [ \href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML} ]
257+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week7/lecture.html}{HTML} ]
235258
236259** Wednesday, February 18** (Lecture 19): Scaling Up to GPT-3 and Beyond
260+
237261 - Topics: GPT-3, in-context learning, few-shot prompting, GPT-4 and Claude
238262 - ** 📝 Assignment 5 Released:** [ \href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%205\%3A\%20GPT/README.md}{Build and Train a GPT Model}]
239263 - Reading: [ Brown et al. (2020)] ( https://arxiv.org/abs/2005.14165 ) ; [ OpenAI (2023)] ( https://arxiv.org/abs/2303.08774 )
240- - Slides: [ \href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML} ]
264+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week7/lecture.html}{HTML} ]
241265
242266** Friday, February 20** (Lecture 20): Implementing GPT from Scratch
267+
243268 - Topics: Building GPT architecture, training considerations
244- - Slides: [ \href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML} ]
269+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week7/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week7/lecture.html}{HTML} ]
245270
246271### Week 8: No Classes (February 23--27)
247272
@@ -250,24 +275,28 @@ We strive to create an inclusive learning environment where all students feel su
250275### Week 9: Advanced Topics (March 2--6)
251276
252277** Monday, March 2** (Lecture 21): Retrieval Augmented Generation (RAG)
278+
253279 - Topics: Vector databases, retrieval mechanisms, RAG architectures
254280 - ** ✅ Assignment 5 Due**
255281 - Reading: [ Lewis et al. (2020)] ( https://arxiv.org/abs/2005.11401 )
256- - Slides: [ \href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML} ]
282+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week9/lecture.html}{HTML} ]
257283
258284** Wednesday, March 4** (Lecture 22): Mixture of Experts & Efficiency
285+
259286 - Topics: MoE architectures, model compression, distillation
260287 - Reading: [ Fedus et al. (2022)] ( https://arxiv.org/abs/2101.03961 ) ; [ Jiang et al. (2024)] ( https://arxiv.org/abs/2401.04088 )
261- - Slides: [ \href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML} ]
288+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week9/lecture.html}{HTML} ]
262289
263290** Friday, March 6** (Lecture 23): Ethics, Bias, and Safety
291+
264292 - Topics: Bias in LLMs, alignment, safety considerations
265293 - Reading: [ Bender et al. (2021)] ( https://dl.acm.org/doi/10.1145/3442188.3445922 )
266- - Slides: [ \href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML} ]
294+ - Slides: [ \href{https://contextlab.github.io/llm-course/slides/ week9/lecture.pdf}{PDF} ] [ \href{https://contextlab.github.io/llm-course/slides /week9/lecture.html}{HTML} ]
267295
268296### Week 10: Final Projects (March 9)
269297
270298** Monday, March 9** (Lecture 24): Final Project Presentations & Wrap-up
299+
271300 - Final project presentations (all teams)
272301 - Course wrap-up and reflections
273302 - Last day of classes
0 commit comments