Code and Description for Visualizations for the Master in Computational Biology Statistic Data Analysis and Visualization course in 2020 Spain Mortality INE datasets(2022)
Authors : Lucía Martín Fernandez, Ana Solbas Casajús, Ángela Gómez Sacristán and Natalia García Sánchez
Data corresponds to the following datasets and come from the following corresponding sources:
| Filename | Description | Used in code | Source |
|---|---|---|---|
Provincias_datos.csv |
Mortality rate in Spanish provinces (2020) - manually preprocessed | BarplotsINEDeaths.Rmd, mortalityRatePerYear.Rmd, Spain_maps.Rmdlinks |
datos.gob.es |
crudeMortality.csv |
Crude mortality rate (per year) | BarplotsINEDeaths.Rmd, mortalityRatePerYear.Rmd, Spain_maps.Rmd links |
INE |
mortalityRate.csv |
Mortality rate per year, age and sex | mortalityRatePerYear.Rmd links |
|
causas_muerte.csv |
Death causes from 2010 to 2020 | mortalityRatePerYear.Rmd links |
INE |
ccaa_provincia.csv |
File mapping Autonomous Community to Province - manually preprocessed | BarplotsINEDeaths.Rmd links |
INE |
CCAA_DR_Mortality_Population.csv |
Crude and per 100,000 Mortality rates per Autonomous Community in 2020 | Spain_maps.Rmdlink |
crafted in BarplotsINEDeaths.Rmd |
In addition, there are three R markdown files that have been used to generate the visualizations for the assignment.
BarplotsINEDeaths.Rmd: script used to generate the barplot and boxplots charts.Spain_maps.Rmd: script used to generate the Spain map plots containing information about the 2020 death rates by province and Autonomous Community.mortalityRatePerYear.Rmd: script used to generate the time series plots for death rate and life expectancy in Spain as well as the death rate per age and sex in 2020 barplot.
Dependencies : mapSpain, reshape2, ggplot2, sf, dplyr, tidyr, tidyverse should be preinstalled in an R environment