Skip to content

Commit 4cc00b0

Browse files
Merge pull request #265 from UBC-DSCI/additional-resources-py
Additional resources conversion to Py
2 parents bd60eb0 + 9821339 commit 4cc00b0

File tree

7 files changed

+27
-38
lines changed

7 files changed

+27
-38
lines changed

source/classification2.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2134,7 +2134,7 @@ and guidance that the worksheets provide will function as intended.
21342134
two chapters, you'll learn about another kind of predictive modeling setting,
21352135
so it might be worth visiting the website only after reading through those
21362136
chapters.
2137-
- *An Introduction to Statistical Learning* {cite:p}`james2013introduction` provides
2137+
- [*An Introduction to Statistical Learning*](https://www.statlearning.com/) {cite:p}`james2013introduction` provides
21382138
a great next stop in the process of
21392139
learning about classification. Chapter 4 discusses additional basic techniques
21402140
for classification that we do not cover, such as logistic regression, linear

source/inference.md

Lines changed: 1 addition & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1340,17 +1340,8 @@ and guidance that the worksheets provide will function as intended.
13401340

13411341
## Additional resources
13421342

1343-
- Chapters 7 to 10 of *Modern Dive* {cite:p}`moderndive` provide a great
1344-
next step in learning about inference. In particular, Chapters 7 and 8 cover
1345-
sampling and bootstrapping using `tidyverse` and `infer` in a slightly more
1346-
in-depth manner than the present chapter. Chapters 9 and 10 take the next step
1347-
beyond the scope of this chapter and begin to provide some of the initial
1348-
mathematical underpinnings of inference and more advanced applications of the
1349-
concept of inference in testing hypotheses and performing regression. This
1350-
material offers a great starting point for getting more into the technical side
1351-
of statistics.
13521343
- Chapters 4 to 7 of *OpenIntro Statistics* {cite:p}`openintro`
1353-
provide a good next step after *Modern Dive*. Although it is still certainly
1344+
provide a good next step in learning about inference. Although it is still certainly
13541345
an introductory text, things get a bit more mathematical here. Depending on
13551346
your background, you may actually want to start going through Chapters 1 to 3
13561347
first, where you will learn some fundamental concepts in probability theory.

source/reading.md

Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1566,15 +1566,14 @@ and guidance that the worksheets provide will function as intended.
15661566
## Additional resources
15671567

15681568
- The [`pandas` documentation](https://pandas.pydata.org/docs/getting_started/index.html)
1569-
provides the documentation for many of the reading functions we cover in this chapter.
1570-
It is where you should look if you want to learn more about the functions in this
1571-
chapter, the full set of arguments you can use, and other related functions.
1572-
The site also provides a very nice cheat sheet that summarizes many of the data
1573-
wrangling functions from this chapter.
1574-
- Sometimes you might run into data in such poor shape that none of the reading
1575-
functions we cover in this chapter work. In that case, you can consult the
1576-
[data loading chapter](https://wesmckinney.com/book/accessing-data.html#io_flat_files) from *Python for Data Analysis* {cite:p}`mckinney2012python`, which goes into a lot more detail about how Python parses
1577-
text from files into data frames.
1569+
provides the documentation for the functions we cover in this chapter.
1570+
It is where you should look if you want to learn more about these functions, the
1571+
full set of arguments you can use, and other related functions.
1572+
- Sometimes you might run into data in such poor shape that the reading
1573+
functions we cover in this chapter do not work. In that case, you can consult the
1574+
[data loading chapter](https://wesmckinney.com/book/accessing-data.html#io_flat_files)
1575+
from [*Python for Data Analysis*](https://wesmckinney.com/book/) {cite:p}`mckinney2012python`, which goes into a lot
1576+
more detail about how Python parses text from files into data frames.
15781577
- A [video](https://www.youtube.com/embed/ephId3mYu9o) from the Udacity
15791578
course *Linux Command Line Basics* provides a good explanation of absolute versus relative paths.
15801579
- If you read the subsection on obtaining data from the web via scraping and

source/regression2.md

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1357,14 +1357,6 @@ and guidance that the worksheets provide will function as intended.
13571357
useful [tutorials](https://scikit-learn.org/stable/tutorial/index.html) and [an extensive list
13581358
of more advanced examples](https://scikit-learn.org/stable/auto_examples/index.html#general-examples)
13591359
that you can use to continue learning beyond the scope of this book.
1360-
- *Modern Dive* {cite:p}`moderndive` is another textbook that uses the
1361-
`tidyverse` / `tidymodels` framework. Chapter 6 complements the material in
1362-
the current chapter well; it covers some slightly more advanced concepts than
1363-
we do without getting mathematical. Give this chapter a read before moving on
1364-
to the next reference. It is also worth noting that this book takes a more
1365-
"explanatory" / "inferential" approach to regression in general (in Chapters 5,
1366-
6, and 10), which provides a nice complement to the predictive tack we take in
1367-
the present book.
13681360
- *An Introduction to Statistical Learning* {cite:p}`james2013introduction` provides
13691361
a great next stop in the process of
13701362
learning about regression. Chapter 3 covers linear regression at a slightly

source/version-control.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1170,7 +1170,7 @@ you can expand your knowledge through the resources listed below:
11701170
learning about Git and GitHub.
11711171
- [Good enough practices in scientific
11721172
computing](https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005510#sec014)
1173-
[@wilson2014best] provides more advice on useful workflows and "good enough"
1173+
{cite:p}`wilson2014best` provides more advice on useful workflows and "good enough"
11741174
practices in data analysis projects.
11751175
- In addition to [GitHub](https://github.com), there are other popular Git
11761176
repository hosting services such as [GitLab](https://gitlab.com) and
@@ -1181,3 +1181,11 @@ you can expand your knowledge through the resources listed below:
11811181
token](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/creating-a-personal-access-token)
11821182
is an excellent additional resource to consult if you need help
11831183
generating and using personal access tokens.
1184+
1185+
## References
1186+
1187+
+++
1188+
1189+
```{bibliography}
1190+
:filter: docname in docnames
1191+
```

source/viz.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2025,17 +2025,19 @@ and guidance that the worksheets provide will function as intended.
20252025

20262026
## Additional resources
20272027

2028-
- The [altair documentation](https://altair-viz.github.io/){cite:p}`altair` is
2028+
- The [altair documentation](https://altair-viz.github.io/) {cite:p}`altair` is
20292029
where you should look if you want to learn more about the functions in this
20302030
chapter, the full set of arguments you can use, and other related functions.
20312031
The site also provides a very nice cheat sheet that summarizes many of the data
20322032
wrangling functions from this chapter.
2033-
- The *Fundamentals of Data Visualization* {cite:p}`wilkeviz` has
2033+
- The [*Fundamentals of Data Visualization*](https://clauswilke.com/dataviz/) {cite:p}`wilkeviz` has
20342034
a wealth of information on designing effective visualizations. It is not
20352035
specific to any particular programming language or library. If you want to
20362036
improve your visualization skills, this is the next place to look.
2037-
- The [dates and times](https://wesmckinney.com/book/time-series.html){cite:p}`mckinney2012python`
2038-
chapter is where you should look if you want to learn about `date` and `time`, including how to create them, and how to use them to effectively handle durations, etc
2037+
- The [dates and times](https://wesmckinney.com/book/time-series.html) chapter
2038+
of [*Python for Data Analysis*](https://wesmckinney.com/book/) {cite:p}`mckinney2012python`
2039+
is where you should look if you want to learn about `date` and `time`, including
2040+
how to create them, and how to use them to effectively handle durations, etc
20392041

20402042
## References
20412043

source/wrangling.md

Lines changed: 2 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1900,9 +1900,7 @@ and guidance that the worksheets provide will function as intended.
19001900
- The [`pandas` package documentation](https://pandas.pydata.org/docs/reference/index.html) is
19011901
another resource to learn more about the functions in this
19021902
chapter, the full set of arguments you can use, and other related functions.
1903-
The site also provides a very nice cheat sheet that summarizes many of the
1904-
data wrangling functions from this chapter.
1905-
- *Python for Data Analysis* {cite:p}`mckinney2012python` has a few chapters related to
1903+
- [*Python for Data Analysis*](https://wesmckinney.com/book/) {cite:p}`mckinney2012python` has a few chapters related to
19061904
data wrangling that go into more depth than this book. For example, the
19071905
[data wrangling chapter](https://wesmckinney.com/book/data-wrangling.html) covers tidy data,
19081906
`melt` and `pivot`, but also covers missing values
@@ -1911,8 +1909,7 @@ and guidance that the worksheets provide will function as intended.
19111909
`groupby`, aggregating functions, `apply`, etc.
19121910
- You will occasionally encounter a case where you need to iterate over items
19131911
in a data frame, but none of the above functions are flexible enough to do
1914-
what you want. In that case, you may consider using [a for
1915-
loop](https://wesmckinney.com/book/python-basics.html#control_for) {cite:p}`mckinney2012python`.
1912+
what you want. In that case, you may consider using [a for loop](https://wesmckinney.com/book/python-basics.html#control_for) {cite:p}`mckinney2012python`.
19161913

19171914

19181915
## References

0 commit comments

Comments
 (0)