Skip to content

Commit 9b0b44d

Browse files
trevorcampbelllheagyjoelostblom
authored
Front matter (#96)
* preface python * remove foreword * added editors page * fix appendix,references * added py acks * minor ed * Update editors.md add Lindsey bio * Add joels bio Co-authored-by: Lindsey Heagy <[email protected]> Co-authored-by: Joel Ostblom <[email protected]>
1 parent 2b75c31 commit 9b0b44d

File tree

8 files changed

+109
-24
lines changed

8 files changed

+109
-24
lines changed

source/_toc.yml

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,12 @@ parts:
44
- caption: Front Matter
55
chapters:
66
- file: preface-text.md
7-
- file: foreword-text.md
7+
#- file: foreword.md
88
- file: acknowledgements.md
9+
- file: acknowledgements-python.md
910
- file: authors.md
10-
- file: setup.md
11+
- file: editors.md
12+
#- file: setup.md
1113
- caption: Chapters
1214
numbered: 3
1315
chapters:
@@ -24,4 +26,4 @@ parts:
2426
- caption: Appendix
2527
chapters:
2628
- file: appendixA.md
27-
- file: references.md
29+
#- file: references.md

source/acknowledgements-python.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
---
2+
jupytext:
3+
cell_metadata_filter: -all
4+
formats: py:percent,md:myst,ipynb
5+
text_representation:
6+
extension: .md
7+
format_name: myst
8+
format_version: 0.13
9+
jupytext_version: 1.13.8
10+
kernelspec:
11+
display_name: Python 3 (ipykernel)
12+
language: python
13+
name: python3
14+
---
15+
16+
# Acknowledgments for the Python Edition
17+
18+
We'd like to thank everyone that has contributed to the development of
19+
[*Data Science: A First Introduction (Python Edition)*](https://ubc-dsci.github.io/introduction-to-datascience-python/).
20+
This is an open source Python translation of the original [*Data Science: A First Introduction*](https://datasciencebook.ca);
21+
the original focused on the R programming language. Both of these books are
22+
used to teach DSCI 100, a new introductory data science course
23+
at the University of British Columbia (UBC).
24+
25+
We will finalize this acknowledgements section after the book is complete!

source/acknowledgements.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# Acknowledgments -- TBD
16+
# Acknowledgments
1717

1818
We'd like to thank everyone that has contributed to the development of
1919
[*Data Science: A First Introduction*](https://datasciencebook.ca).

source/appendixA.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# Appendix
17-
18-
# Downloading files from JupyterHub {#appendixA}
16+
# Downloading files from JupyterHub
1917

2018
This section will help you
2119
save your work from a JupyterHub web-based platform to your own computer.

source/authors.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# About the authors -- TBD
16+
# About the authors
1717

1818
**Tiffany Timbers** is an Assistant Professor of Teaching in the Department of
1919
Statistics and Co-Director for the Master of Data Science program (Vancouver

source/editors.md

Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
---
2+
jupytext:
3+
cell_metadata_filter: -all
4+
formats: py:percent,md:myst,ipynb
5+
text_representation:
6+
extension: .md
7+
format_name: myst
8+
format_version: 0.13
9+
jupytext_version: 1.13.8
10+
kernelspec:
11+
display_name: Python 3 (ipykernel)
12+
language: python
13+
name: python3
14+
---
15+
16+
# About the editors of the Python Edition
17+
18+
**Trevor Campbell** is an Assistant Professor in the Department of Statistics at
19+
the University of British Columbia. His research focuses on automated, scalable
20+
Bayesian inference algorithms, Bayesian nonparametrics, streaming data, and
21+
Bayesian theory. He was previously a postdoctoral associate advised by Tamara
22+
Broderick in the Computer Science and Artificial Intelligence Laboratory
23+
(CSAIL) and Institute for Data, Systems, and Society (IDSS) at MIT, a Ph.D.
24+
candidate under Jonathan How in the Laboratory for Information and Decision
25+
Systems (LIDS) at MIT, and before that he was in the Engineering Science
26+
program at the University of Toronto.
27+
28+
+++
29+
30+
**Lindsey Heagy** is an Assistant Professor in the Department of Earth, Ocean, and Atmospheric
31+
Sciences and director of the Geophysical Inversion Facility at the University of British Columbia.
32+
Her research combines computational methods in numerical simulations, inversions, and machine
33+
learning to answer questions about the subsurface of the Earth. Primary applications include
34+
mineral exploration, carbon sequestration, groundwater and environmental studies. She
35+
completed her BSc at the University of Alberta, her PhD at the University of British Columbia,
36+
and held a Postdoctoral research position at the University of California Berkeley prior to
37+
starting her current position at UBC.
38+
39+
+++
40+
41+
**Joel Ostblom** is an Assistant Professor of Teaching in the Department of
42+
Statistics at the University of British Columbia.
43+
During his PhD, Joel developed a passion for data science and reproducibility
44+
through the development of quantitative image analysis pipelines for studying
45+
stem cell and developmental biology. He has since co-created or lead the
46+
development of several courses and workshops at the University of Toronto and
47+
is now an assistant professor of teaching in the statistics department at the
48+
University of British Columbia. Joel cares deeply about spreading data literacy
49+
and excitement over programmatic data analysis, which is reflected in his
50+
contributions to open source projects and data science learning resources. You
51+
can read more about Joel on his [personal page](https://joelostblom.com/).

source/preface-text.md

Lines changed: 22 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -13,11 +13,16 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# Preface -- TBD
16+
# Preface
17+
18+
```{index} data science, auditable, reproducible
19+
```
20+
21+
1722

1823
This textbook aims to be an approachable introduction to the world of data science.
19-
In this book, we define **data science** \index{data science!definition} as the process of generating
20-
insight from data through **reproducible** \index{reproducible} and **auditable** \index{auditable} processes.
24+
In this book, we define **data science** as the process of generating
25+
insight from data through **reproducible** and **auditable** processes.
2126
If you analyze some data and give your analysis to a friend or colleague, they should
2227
be able to re-run the analysis from start to finish and get the same result you did (*reproducibility*).
2328
They should also be able to see and understand all the steps in the analysis, as well as the history of how
@@ -29,19 +34,17 @@ At a high level, in this book, you will learn how to
2934
(1) identify common problems in data science, and
3035
(2) solve those problems with reproducible and auditable workflows.
3136

32-
Figure \@ref(fig:img-chapter-overview) summarizes what you will learn in each chapter
33-
of this book.
34-
Throughout, you will learn how to use the R programming language [@Rlanguage] to perform
37+
{numref}`preface-overview-fig` summarizes what you will learn in each chapter
38+
of this book. Throughout, you will learn how to use the [Python programming language](https://www.python.org/) to perform
3539
all the tasks associated with data analysis. You will
36-
spend the first four chapters learning how to use R to load, clean, wrangle
40+
spend the first four chapters learning how to use Python to load, clean, wrangle
3741
(i.e., restructure the data into a usable format) and visualize data
3842
while answering descriptive and exploratory data analysis questions. In the next
3943
six chapters, you will learn how to answer predictive, exploratory, and inferential
4044
data analysis questions with common methods in data science, including
4145
classification, regression, clustering, and estimation.
4246
In the final chapters
43-
(\@ref(getting-started-with-jupyter)&ndash;\@ref(move-to-your-own-machine)),
44-
you will learn how to combine R code, formatted text, and images
47+
you will learn how to combine Python code, formatted text, and images
4548
in a single coherent document with Jupyter, use version control for
4649
collaboration, and install and configure the software needed for data science
4750
on your own computer. If you are reading this book as part of a course that you are
@@ -51,20 +54,26 @@ But if you are reading this independently, you may want to jump to these last th
5154
early before going on to make sure your computer is set up in such a way that you can
5255
try out the example code that we include throughout the book.
5356

54-
```{r img-chapter-overview, echo = FALSE, message = FALSE, warning = FALSE, fig.cap = "Where are we going?", out.width="100%", fig.retina = 2, fig.align = "center"}
55-
knitr::include_graphics("img/chapter_overview.jpeg")
57+
```{figure} img/chapter_overview.jpeg
58+
---
59+
height: 400px
60+
name: preface-overview-fig
61+
---
62+
Where are we going?
5663
```
5764

65+
66+
5867
Each chapter in the book has an accompanying worksheet that provides exercises
5968
to help you practice the concepts you will learn. We strongly recommend that you
6069
work through the worksheet when you finish reading each chapter
6170
before moving on to the next chapter. All of the worksheets
6271
are available at
63-
[https://github.com/UBC-DSCI/data-science-a-first-intro-worksheets#readme](https://github.com/UBC-DSCI/data-science-a-first-intro-worksheets#readme);
72+
[https://github.com/UBC-DSCI/data-science-a-first-intro-python-worksheets#readme](https://github.com/UBC-DSCI/data-science-a-first-intro-python-worksheets#readme);
6473
the "Exercises" section at the end of each chapter points you to the right worksheet for that chapter.
6574
For each worksheet, you can either launch an interactive version of the worksheet in your browser by clicking the "launch binder" button,
6675
or preview a non-interactive version of the worksheet by clicking "view worksheet."
6776
If you instead decide to download the worksheet and run it on your own machine,
6877
make sure to follow the instructions for computer setup
69-
found in Chapter \@ref(move-to-your-own-machine). This will ensure that the automated feedback
78+
found in the {ref}`move-to-your-own-machine` chapter. This will ensure that the automated feedback
7079
and guidance that the worksheets provide will function as intended.

source/references.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,6 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
`r if (knitr:::is_html_output()) '
17-
# References -- TBD
18-
'`
16+
# References
17+
18+

0 commit comments

Comments
 (0)