Skip to content

Commit 2fae50b

Browse files
Merge pull request #23 from ContextLab/fix/slides-refactoring-issue-17
Fix/slides refactoring issue 17
2 parents 4e0c42b + 42c50c3 commit 2fae50b

File tree

118 files changed

+55138
-2972
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

118 files changed

+55138
-2972
lines changed

.github/workflows/build-slides.yml

Lines changed: 166 additions & 134 deletions
Large diffs are not rendered by default.

.gitignore

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -172,3 +172,16 @@ demos/11-analogies/data/glove.6B.*.txt
172172
node_modules/
173173
package-lock.json
174174
demos/15-chatbot-evolution/alice-aiml-original/
175+
176+
# LaTeX auxiliary files
177+
*.aux
178+
*.log
179+
*.nav
180+
*.out
181+
*.snm
182+
*.toc
183+
*.vrb
184+
*.fdb_latexmk
185+
*.fls
186+
*.synctex.gz
187+
luatex.*/

admin/DartmouthRuzicka-Bold.ttf

223 KB
Binary file not shown.
217 KB
Binary file not shown.

admin/DartmouthRuzicka-Regular.ttf

217 KB
Binary file not shown.
206 KB
Binary file not shown.

admin/syllabus.md

Lines changed: 46 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,12 @@
11
---
2-
title: "PSYC 51.17: Models of Language and Conversation"
2+
title: "PSYC 51.07: Models of Language and Communication"
33
geometry: margin=1in
44
header-includes:
55
- \usepackage{fontspec}
66
- \usepackage{booktabs}
7-
- \setmainfont{Berkeley Mono}
7+
- \directlua{luaotfload.add_fallback("emojifallback", {"NotoColorEmoji:mode=harf"})}
8+
- \defaultfontfeatures{RawFeature={fallback=emojifallback}}
9+
- \setmainfont{Fira Code}
810
output: pdf
911
---
1012

@@ -109,22 +111,26 @@ We strive to create an inclusive learning environment where all students feel su
109111
### Week 1: Introduction & String Manipulation (January 5--9)
110112

111113
**Monday, January 5** (Lecture 1): Course Introduction, Is ChatGPT Conscious?
114+
112115
- Topics: Course overview, capabilities of LLMs, consciousness debate
113116
- Discussion: What is consciousness? Can machines be conscious?
114117
- Reading: [Fedorenko et al. (2024)](https://www.nature.com/articles/s41593-024-01711-5); [Schrimpf et al. (2021)](https://www.pnas.org/doi/10.1073/pnas.2105646118)
115118
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week1/lecture1.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week1/lecture1.html}{HTML}]
116119

117120
**Wednesday, January 7** (Lecture 2): Pattern Matching & ELIZA
121+
118122
- Topics: String operations in Python, regular expressions, pattern matching
119123
- Reading: [Weizenbaum (1966)](https://dl.acm.org/doi/10.1145/365153.365168)
120124
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week1/lecture2.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week1/lecture2.html}{HTML}]
121125

122126
**Thursday, January 8** (X-hour 1): ELIZA Deep Dive
127+
123128
- Topics: Extended discussion of pattern matching, implementation strategies
124129
- Hands-on: Start Assignment 1
125130
- Demo: [\href{https://contextlab.github.io/llm-course/slides/week1/xhour_eliza_demo.html}{Interactive Notebook}]
126131

127132
**Friday, January 9** (Lecture 3): ELIZA Implementation & The ELIZA Effect
133+
128134
- Topics: Implementing ELIZA from scratch, psychological implications
129135
- **📝 Assignment 1 Released:** [\href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%201\%3A\%20ELIZA/README.md}{Building the ELIZA Chatbot}]
130136
- Reading: [Natale (2021)](https://www.tandfonline.com/doi/full/10.1080/24701475.2020.1814847)
@@ -133,21 +139,25 @@ We strive to create an inclusive learning environment where all students feel su
133139
### Week 2: Computational Linguistics (January 12--16)
134140

135141
**Monday, January 12** (Lecture 4): Data Cleaning & Preprocessing
142+
136143
- Topics: Web scraping with Beautiful Soup, data cleaning, text normalization
137144
- Reading: HuggingFace NLP Course Chapter 2
138145
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week2/lecture4.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week2/lecture4.html}{HTML}]
139146

140147
**Wednesday, January 14** (Lecture 5): Tokenization
148+
141149
- Topics: Byte-Pair Encoding (BPE), WordPiece, SentencePiece
142150
- Reading: [Sennrich et al. (2016)](https://aclanthology.org/P16-1162/); [Kudo & Richardson (2018)](https://aclanthology.org/D18-2012/)
143151
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week2/lecture5.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week2/lecture5.html}{HTML}]
144152

145153
**Thursday, January 15** (X-hour 2): Text Classification Workshop
154+
146155
- Topics: Building classifiers, feature engineering for text
147156
- Hands-on: Explore different classification approaches
148157
- Demo: [\href{https://contextlab.github.io/llm-course/slides/week2/xhour_classification_demo.html}{Interactive Notebook}]
149158

150159
**Friday, January 16** (Lecture 6): POS Tagging & Sentiment Analysis
160+
151161
- Topics: Part-of-speech tagging, named entity recognition, sentiment analysis
152162
- **📝 Assignment 2 Released:** [\href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%202\%3A\%20SPAM\%20classifier/README.md}{SPAM Classifier}]
153163
- **✅ Assignment 1 Due**
@@ -158,16 +168,19 @@ We strive to create an inclusive learning environment where all students feel su
158168
**Monday, January 19**: Martin Luther King Jr. Day (No Class)
159169

160170
**Wednesday, January 21** (Lecture 7): Classic Embeddings
171+
161172
- Topics: Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA)
162173
- Reading: [Landauer & Dumais (1997)](https://psycnet.apa.org/record/1997-02478-006); [Blei et al. (2003)](https://www.jmlr.org/papers/v3/blei03a)
163174
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week3/lecture7.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week3/lecture7.html}{HTML}]
164175

165176
**Thursday, January 22** (X-hour 3): Embeddings Workshop
177+
166178
- Topics: Implementing classic embeddings (LSA, LDA)
167179
- Hands-on: Compare embedding methods on real data
168180
- Demo: [\href{https://contextlab.github.io/llm-course/slides/week3/xhour_embeddings_demo.html}{Interactive Notebook}]
169181

170182
**Friday, January 23** (Lecture 8): Word Embeddings
183+
171184
- Topics: Word2Vec (CBOW and Skip-gram), GloVe, FastText
172185
- **📝 Assignment 3 Released:** [\href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%203\%3A\%20Wikipedia/README.md}{Wikipedia Embeddings Comparison}]
173186
- **✅ Assignment 2 Due**
@@ -177,71 +190,83 @@ We strive to create an inclusive learning environment where all students feel su
177190
### Week 4: Text Embeddings II (January 26--30)
178191

179192
**Monday, January 26** (Lecture 9): Contextual Embeddings
193+
180194
- Topics: ELMo, Universal Sentence Encoder, BERT embeddings
181195
- Reading: [Peters et al. (2018)](https://aclanthology.org/N18-1202/); [Cer et al. (2018)](https://arxiv.org/abs/1803.11175)
182-
- Slides: [\href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML}]
196+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.html}{HTML}]
183197

184198
**Wednesday, January 28** (Lecture 10): Dimensionality Reduction
199+
185200
- Topics: PCA, t-SNE, UMAP for visualizing embeddings
186201
- Reading: [McInnes et al. (2018)](https://arxiv.org/abs/1802.03426)
187-
- Slides: [\href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML}]
202+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.html}{HTML}]
188203

189204
**Friday, January 30** (Lecture 11): Cognitive Models of Semantic Representation
205+
190206
- Topics: Distributional semantics, neural representation of meaning
191207
- Reading: [Anderson et al. (2016)](https://www.jneurosci.org/content/36/45/11444)
192-
- Slides: [\href{https://contextlab.github.io/llm-course/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week3-4/lecture.html}{HTML}]
208+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week3-4/lecture.html}{HTML}]
193209

194210
### Week 5: Transformers & Attention (February 2--6)
195211

196212
**Monday, February 2** (Lecture 12): Attention Mechanisms
213+
197214
- Topics: Sequence-to-sequence models, attention mechanism fundamentals
198215
- **✅ Assignment 3 Due**
199216
- Reading: [Bahdanau et al. (2015)](https://arxiv.org/abs/1409.0473); [Vaswani et al. (2017)](https://arxiv.org/abs/1706.03762)
200-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
217+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
201218

202219
**Wednesday, February 4** (Lecture 13): Transformer Architecture
220+
203221
- Topics: Multi-head attention, positional encoding, transformer blocks
204222
- Reading: [Vaswani et al. (2017)](https://arxiv.org/abs/1706.03762); HuggingFace NLP Course Chapter 3
205-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
223+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
206224

207225
**Friday, February 6** (Lecture 14): Training Transformers
226+
208227
- Topics: Pre-training objectives, masked language modeling, next token prediction
209228
- **📝 Assignment 4 Released:** [\href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%204\%3A\%20Customer\%20Service\%20Chatbot/README.md}{Context-Aware Customer Service Chatbot}]
210-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
229+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
211230

212231
### Week 6: Encoder Models (February 9--13)
213232

214233
**Monday, February 9** (Lecture 15): BERT Deep Dive
234+
215235
- Topics: BERT architecture, bidirectional pre-training, fine-tuning
216236
- Reading: [Devlin et al. (2019)](https://aclanthology.org/N19-1423/); HuggingFace NLP Course Chapter 4
217-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
237+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
218238

219239
**Wednesday, February 11** (Lecture 16): BERT Variants
240+
220241
- Topics: RoBERTa, ALBERT, DistilBERT, and other encoder models
221242
- Reading: [Liu et al. (2019)](https://arxiv.org/abs/1907.11692); [Sanh et al. (2019)](https://arxiv.org/abs/1910.01108)
222-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
243+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
223244

224245
**Friday, February 13** (Lecture 17): Applications of Encoder Models
246+
225247
- Topics: Classification, NER, question answering with BERT
226248
- **✅ Assignment 4 Due**
227-
- Slides: [\href{https://contextlab.github.io/llm-course/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week5-6/lecture.html}{HTML}]
249+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week5-6/lecture.html}{HTML}]
228250

229251
### Week 7: Decoder Models & GPT (February 16--20)
230252

231253
**Monday, February 16** (Lecture 18): GPT Architecture
254+
232255
- Topics: Autoregressive language models, GPT-1 and GPT-2
233256
- Reading: [Radford et al. (2018)](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf); [Radford et al. (2019)](https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf)
234-
- Slides: [\href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML}]
257+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week7/lecture.html}{HTML}]
235258

236259
**Wednesday, February 18** (Lecture 19): Scaling Up to GPT-3 and Beyond
260+
237261
- Topics: GPT-3, in-context learning, few-shot prompting, GPT-4 and Claude
238262
- **📝 Assignment 5 Released:** [\href{https://github.com/ContextLab/llm-course/blob/main/assignments/Assignment\%205\%3A\%20GPT/README.md}{Build and Train a GPT Model}]
239263
- Reading: [Brown et al. (2020)](https://arxiv.org/abs/2005.14165); [OpenAI (2023)](https://arxiv.org/abs/2303.08774)
240-
- Slides: [\href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML}]
264+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week7/lecture.html}{HTML}]
241265

242266
**Friday, February 20** (Lecture 20): Implementing GPT from Scratch
267+
243268
- Topics: Building GPT architecture, training considerations
244-
- Slides: [\href{https://contextlab.github.io/llm-course/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week7/lecture.html}{HTML}]
269+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week7/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week7/lecture.html}{HTML}]
245270

246271
### Week 8: No Classes (February 23--27)
247272

@@ -250,24 +275,28 @@ We strive to create an inclusive learning environment where all students feel su
250275
### Week 9: Advanced Topics (March 2--6)
251276

252277
**Monday, March 2** (Lecture 21): Retrieval Augmented Generation (RAG)
278+
253279
- Topics: Vector databases, retrieval mechanisms, RAG architectures
254280
- **✅ Assignment 5 Due**
255281
- Reading: [Lewis et al. (2020)](https://arxiv.org/abs/2005.11401)
256-
- Slides: [\href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML}]
282+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week9/lecture.html}{HTML}]
257283

258284
**Wednesday, March 4** (Lecture 22): Mixture of Experts & Efficiency
285+
259286
- Topics: MoE architectures, model compression, distillation
260287
- Reading: [Fedus et al. (2022)](https://arxiv.org/abs/2101.03961); [Jiang et al. (2024)](https://arxiv.org/abs/2401.04088)
261-
- Slides: [\href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML}]
288+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week9/lecture.html}{HTML}]
262289

263290
**Friday, March 6** (Lecture 23): Ethics, Bias, and Safety
291+
264292
- Topics: Bias in LLMs, alignment, safety considerations
265293
- Reading: [Bender et al. (2021)](https://dl.acm.org/doi/10.1145/3442188.3445922)
266-
- Slides: [\href{https://contextlab.github.io/llm-course/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/week9/lecture.html}{HTML}]
294+
- Slides: [\href{https://contextlab.github.io/llm-course/slides/week9/lecture.pdf}{PDF}][\href{https://contextlab.github.io/llm-course/slides/week9/lecture.html}{HTML}]
267295

268296
### Week 10: Final Projects (March 9)
269297

270298
**Monday, March 9** (Lecture 24): Final Project Presentations & Wrap-up
299+
271300
- Final project presentations (all teams)
272301
- Course wrap-up and reflections
273302
- Last day of classes

0 commit comments

Comments
 (0)