Skip to content

Commit a4efeb3

Browse files
Merge pull request #7 from jon-fuller-ukhsa/main
fix hyperlink
2 parents cb1e107 + 639dd3b commit a4efeb3

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -41,8 +41,7 @@ If you are on macOS please add the flag ```--platform=linux/amd64```
4141

4242
### The UK COVID-19 Vocal Audio Dataset
4343
The full UK COVID-19 Vocal Audio Dataset is not publicly available as is classed as 'Special Category Personal Data'. Access may be requested from UKHSA (DataAccess@ukhsa.gov.uk), and will be granted subject to approval and a data sharing contract. To learn about how to apply for UKHSA data, visit:
44-
[https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukh]{https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data}
45-
44+
[https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data](https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data)
4645

4746
We understand that this might not be practical for a number of users interested in our work and therefore we have created a new curated dataset which has been classed as 'Open Access' data (there will be a downloadable link which anyone can use, without the need to even register). In order to achieve this the 'sentence' modality has been removed, leaving behind the 'cough', 'three cough' and 'exahaltion' modalities. In addition, to meet open access requirements, some select attributes of the meta data have been aggregated (to prevent groups of individuals of smaller than 3 being singled out on selection of attributes). This means that the 'sentence' modality results are not replicable or the creation of the train-test splits. We note that this just applies for the the open access version of the data and that our full stack is replicable with the original dataset which can be accessed following the instructions above. We note that we provide the train-test splits in _.csv_ form so that the machine learning experiments can be replicated with the open access data. This open access dataset has been created however, is waiting final UKHSA approval before we upload it to zenodo.
4847

0 commit comments

Comments
 (0)