You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+23-6Lines changed: 23 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,18 +15,31 @@ There are two objectives for this syllabus:
15
15
1. To ensure students have a comprehensive grasp of a data-driven research process. *Data as a Science* guides learners to confidence in the ethics, curation, analysis, and presentation of data, integrating each of these topics into each lesson.
16
16
2. To support the growing desire for universities around the world, but especially in emerging-market countries, to offer Data Science degree courses, by providing a free, openly-licenced core curriculum for adoption and adaptation by their degree programs.
17
17
18
-
## Lesson structure and approach
18
+
## Pedagogy
19
19
20
-
The course is based on the [Sloyd model](https://en.wikipedia.org/wiki/Sloyd) of technical training. Each lesson is discrete, building on the previous lesson, and provides a functional and holistic understanding of the scientific method as it applies to data. It is not about learning an algorithm and applying it to abstract, arbitrary data.
20
+
The course is based on the [Sloyd model](https://en.wikipedia.org/wiki/Sloyd) of technical training. Each lesson is discrete, building on the previous lesson, and provides a functional and holistic understanding of the scientific method as it applies to data. It is not about learning an algorithm and applying it to abstract, arbitrary data. The course has the objective of training complete data scientists, you will learn how research works and apply tools to a specific case-study.
21
21
22
22
Each lesson starts with a research question, and progresses by teaching a complete, and practical, set of skills allowing students to learn at their own pace and in an order which suites their current understanding. Case-studies and tutorials are drawn from public health, economics and social issues, and the course is accessible to anyone with an interest in data. Course materials, case studies and guided tutorials are presented in Jupyter Notebooks permitting learners to test running code and gain hands-on understanding of the techniques discussed.
23
23
24
-
Each research question is guided by the following four topics:
24
+
### Lesson structure and approach
25
25
26
-
-**Ethics**: determining the social and behavioural challenges posed by a research question;
26
+
Each lesson is guided by the following four topics:
27
+
28
+
-**Ethics**: determine the social and behavioural challenges posed by a research question;
27
29
-**Curation**: establish the research requirements for data collection and management;
28
-
-**Analysis**: Investigate, explore and analyse research data;
29
-
-**Presentation**: Prepare and present the results of analysis to promote a response;
30
+
-**Analysis**: investigate, explore and analyse research data;
31
+
-**Presentation**: prepare and present the results of analysis to promote a response;
32
+
33
+
### Case-studies: review and replicate
34
+
35
+
Science is a set of defined methods that stands up to scrutiny, supports replication, and is supported by ethical measurement data acquired during the study process. The way to gain confidence in these methods is to review the work of others.
36
+
37
+
Each lesson will guide you through review of published scholarly work in the following ways:
38
+
39
+
-**Review**: apply learned techniques to open access published research, and review and reflect on the methodology, analysis and results presented;
40
+
-**Replication**: using source- or synthetic data, reproduce the methodology used in open access published research to test whether claimed analysis and results are replicable;
41
+
42
+
Synthetic data will include lessons in dependent randomisation, as well as agent-based modelling.
30
43
31
44
On completion of each lesson, students gain useful and meaningful skills, and are not left stranded. This means that even partial completion of the material permits students to be productive members of a research team. The first lesson will ensure students can become professional data wranglers, and – on completion of the first ten lessons – graduates will be capable of taking on a responsible data research role.
32
45
@@ -73,6 +86,10 @@ My name is [Gavin Chait](https://gavinchait.com), and I am an independent data s
73
86
74
87
I have extensive experience in leading research projects, implementing open source software initiatives, and developing and leading seminars and workshops. I have taught for 25 years, including for undergraduates, adult education, and technical and analytical teaching at all levels.
75
88
89
+
This pedagogy and syllabus structure was developed with support from the Gates Foundation and WHO. Initial research into the need for education capacity building arose as a result of research supported by the Hewlett Foundation, Wellcome Trust and Public Health Research Data Forum.
90
+
91
+
> Chait, Gavin; Sujith, Eramangalath; Grzywinska, Dominika; Wainwright, Mark (2018): Supporting capacity and skills development for public health data research management in low- and medium income countries. Wellcome Trust. Journal contribution. https://doi.org/10.6084/m9.figshare.6087161.v1
92
+
76
93
## Licensing and release
77
94
78
95
Course content, materials and approach are copyright Gavin Chait, and released under both the [Creative Commons Attribution-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-sa/4.0/) and the [GNU Affero General Public License](https://www.gnu.org/licenses/agpl-3.0.en.html) licences.
0 commit comments