contrib — June 6, 2022 dataset
·
38705 commits
to main
since this release
This dataset consolidates the contractual documents of 285 service providers, in all their versions that were accessible online between October 3, 2012 and June 6, 2022.
This dataset is tailored for datascientists and other analysts. You can also explore all these versions interactively on https://github.com/OpenTermsArchive/contrib-versions.
It has been generated with Open Terms Archive.
Dataset format
This dataset represents each version of a document as a separate Markdown file, nested in a directory with the name of the service provider and in a directory with the name of the document type. The filesystem layout will look like below.
├ README.md
├┬ Service provider 1 (e.g. Facebook)
│├┬ Document type 1 (e.g. Terms of Service)
││├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-08-01T01-03-12Z.md)
┆┆┆
││└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-10-03T08-12-25Z.md)
┆┆
│└┬ Document type X (e.g. Privacy Policy)
│ ├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-05-02T03-02-15Z.md)
┆ ┆
│ └ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-11-14T12-36-45Z.md)
┆
└┬ Service provider Y (e.g. Google)
├┬ Document type 1 (e.g. Developer Terms)
│├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2019-03-12T04-18-22Z.md)
┆┆
│└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-12-04T22-47-05Z.md)
└┬ Document type Z (e.g. Privacy Policy)
┆
├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-05-02T03-02-15Z.md)
┆
└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-11-14T12-36-45Z.md)
License
This dataset is made available under an Open Database (OdBL) License by Open Terms Archive Contributors.