Skip to content

Commit 4cfb0ed

Browse files
committed
Added LLM resource.
1 parent 0282900 commit 4cfb0ed

File tree

2 files changed

+39
-8
lines changed

2 files changed

+39
-8
lines changed

slides/0a.qmd

Lines changed: 39 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -16,20 +16,23 @@ format:
1616
[github.com/jtkiley](https://github.com/jtkiley/)
1717
:::
1818

19+
1920
## Related
2021

2122
- CARMA 2020 (overlaps with this course): Introduction to Python and Content Analysis of Text. ([Github](https://github.com/jtkiley/2020_carma_python))
2223
- Seminar materials (overlaps with this course): Text Analysis: Planning to Publication. ([Github](https://github.com/jtkiley/text_seminar))
2324
- Text analysis and machine learning workshop at WU (Oct. 2018) and RSM (Oct. 2019).
2425
- AOM Big Data workshop with Tim Hannigan, Hovig Tchalian, and Laura Nelson. ([Github](https://github.com/jtkiley/curation_workshop))
2526

27+
2628
## Course Agenda
2729

2830
- Tools: Python, packages and environments
2931
- Basics: Python syntax and conventions, Jupyter Notebooks
3032
- Data handling and project planning
3133
- Data gathering and assembly
3234

35+
3336
# Overview
3437

3538
## Overview
@@ -38,6 +41,7 @@ format:
3841
- Resources for learning.
3942
- A brief R comparison.
4043

44+
4145
# What do I really need to know about Python?
4246

4347
## Why Python?
@@ -47,13 +51,15 @@ format:
4751
- Learning resources: wide popularity in academia and practice means that there are extensive resources.
4852
- Scalability: from your computer, to the cloud, to a computing cluster, you can use largely the same tools.
4953

54+
5055
## Python Fluency
5156

5257
- [Basics]{style="color:lightblue;"}
5358
- [Data Preparation]{style="color:lightgreen;"}
5459
- [Good-enough Programming]{style="color:lightyellow;"}
5560
- [Software Engineering]{style="color:pink;"}
5661

62+
5763
## [Basics]{style="color:lightblue;"}
5864

5965
- Skills
@@ -123,36 +129,55 @@ Comparison with Stata
123129
See more: [pandas documentation](https://pandas.pydata.org/docs/getting_started/comparison/comparison_with_stata.html)
124130
:::
125131

126-
## Stack Overflow
132+
133+
## Python for Data Analysis
127134

128135
::: columns
129136
::: {.column width="50%" #vcenter}
130-
Search for what you are trying to do, merging on multiple columns with different names, in this case.
137+
Wes McKinney is the creator of pandas and other open source projects.
131138
:::
132139

133140
::: {.column width="50%" #vcenter}
134-
![](_img/0a_so.png)
141+
![](_img/0a_book.png){width="80%"}
135142
:::
136143
:::
137144

138145
::: footer
139-
See more: [Stack Overflow](https://stackoverflow.com/questions/41815079/pandas-merge-join-two-data-frames-on-multiple-columns)
146+
For more: [Wes McKinney](https://wesmckinney.com/book/)
140147
:::
141148

142-
## Python for Data Analysis
149+
150+
## Large Language Models
143151

144152
::: columns
145153
::: {.column width="50%" #vcenter}
146-
Wes McKinney is the creator of pandas and other open source projects.
154+
- Ask it to explain code or a concept.
155+
- Ask it to rewrite code to be more efficient or readable.
156+
- Give it code and an error message, and ask it to fix the code.
157+
- **Careful:** Don't undercut your learning. LLMs are best when you already know the topic.
147158
:::
148159

149160
::: {.column width="50%" #vcenter}
150-
![](_img/0a_book.png){width="80%"}
161+
![](_img/0a_llm.png)
162+
:::
163+
:::
164+
165+
166+
## Stack Overflow
167+
168+
::: columns
169+
::: {.column width="50%" #vcenter}
170+
- Search for what you are trying to do, merging on multiple columns with different names, in this case.
171+
- These kinds of sites are getting displaced by LLMs.
172+
:::
173+
174+
::: {.column width="50%" #vcenter}
175+
![](_img/0a_so.png)
151176
:::
152177
:::
153178

154179
::: footer
155-
For more: [Wes McKinney](https://wesmckinney.com/book/)
180+
See more: [Stack Overflow](https://stackoverflow.com/questions/41815079/pandas-merge-join-two-data-frames-on-multiple-columns)
156181
:::
157182

158183

@@ -162,6 +187,7 @@ For more: [Wes McKinney](https://wesmckinney.com/book/)
162187
- Self-study tracks from my seminar. Includes resources for data handling, data retrieval, machine learning.
163188
- YouTube. Has many content creators, covering Python, data science, and software development.
164189

190+
165191
# Python and R
166192

167193
## What About R?
@@ -173,6 +199,7 @@ For more: [Wes McKinney](https://wesmckinney.com/book/)
173199
- Gather and prep data in Python.
174200
- If needed, use R for analyses.
175201

202+
176203
## Stack Overflow - Most Popular
177204

178205
![](_img/0a_r.png)
@@ -181,6 +208,7 @@ For more: [Wes McKinney](https://wesmckinney.com/book/)
181208
See more: [Stack Overflow](https://survey.stackoverflow.co/2023/#programming-scripting-and-markup-languages)
182209
:::
183210

211+
184212
## Stack Overflow - Most Desired
185213

186214
![](_img/0a_r2.png)
@@ -189,6 +217,7 @@ See more: [Stack Overflow](https://survey.stackoverflow.co/2023/#programming-scr
189217
See more: [Stack Overflow](https://survey.stackoverflow.co/2023/#section-admired-and-desired-programming-scripting-and-markup-languages)
190218
:::
191219

220+
192221
# Getting Started
193222

194223
## Getting Started
@@ -198,6 +227,7 @@ See more: [Stack Overflow](https://survey.stackoverflow.co/2023/#section-admired
198227
- VS Code, Docker, local container
199228
- You'll see me use both.
200229

230+
201231
# Hands on
202232

203233
## Summary
@@ -206,4 +236,5 @@ See more: [Stack Overflow](https://survey.stackoverflow.co/2023/#section-admired
206236
- Capturing all of our work in code that runs is a best practice that promotes reproducibility, and that helps us most of all.
207237
- We will start using our container in the next segment, so make sure it is set up and ready (or ask for help).
208238

239+
209240
# Break

slides/_img/0a_llm.png

438 KB
Loading

0 commit comments

Comments
 (0)