Skip to content

Commit cb6e8e7

Browse files
Adding license information for Openbookcorpus (#3525)
* Update README.md * Update datasets/bookcorpusopen/README.md Co-authored-by: Quentin Lhoest <[email protected]>
1 parent cd3ce34 commit cb6e8e7

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

datasets/bookcorpusopen/README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -160,7 +160,9 @@ The data fields are the same among all splits.
160160

161161
### Licensing Information
162162

163-
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
163+
The books have been crawled from smashwords.com, see their [terms of service](https://www.smashwords.com/about/tos) for more information.
164+
165+
A data sheet for this dataset has also been created and published in [Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus](https://arxiv.org/abs/2105.05241)
164166

165167
### Citation Information
166168

@@ -178,4 +180,4 @@ The data fields are the same among all splits.
178180

179181
### Contributions
180182

181-
Thanks to [@vblagoje](https://github.com/vblagoje) for adding this dataset.
183+
Thanks to [@vblagoje](https://github.com/vblagoje) for adding this dataset.

0 commit comments

Comments
 (0)