@@ -89,9 +89,8 @@ the public domain of software.
8989Non-software licences
9090---------------------
9191
92- Open source software licences can also be used for works that are not software.
93- They are often also the best choice, especially if the works in question are
94- edited and versioned as source code.
92+ Open source software licences are often used for works that are not software.
93+ However, they are often not the best choice.
9594
9695Data, media, :abbr: `etc. ( et cetera ) `
9796~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
@@ -104,37 +103,49 @@ they are `not recommended for software
104103<https://creativecommons.org/faq/#can-i-apply-a-creative-commons-license-to-software> `_.
105104
106105.. tip ::
107- The `DEAL consortium <https://deal-konsortium.de/en/ >`_ recommends the CC BY
108- licence for Open Access publications, see also `Why CC BY is the Best Choice
109- for Open Access Publishing <https://deal-konsortium.de/en/why-ccby> `_.
110-
111- :abbr: `RADAR ( Research Data Repository ) `, a cross-disciplinary repository for
112- archiving and publishing research data, recommends only one of the `CC
113- licences
114- <https://radar.products.fiz-karlsruhe.de/en/radarfeatures/lizenzen-fuer-forschungsdaten#cc-licenses> `_.
115-
116- The `Open Knowledge Foundation <https://okfn.org/en/ >`_ has also published a set
117- of `Open Data Commons <https://opendatacommons.org >`_ licences for
118- data/databases:
119-
120- `Open Data Commons Open Database License (ODbL) v1.0 <https://opendatacommons.org/licenses/odbl/1-0/ >`_
121- Attribution and sharing under equal terms.
122- `Open Data Commons Attribution License (ODC-By) v1.0 <https://opendatacommons.org/licenses/by/1-0/ >`_
123- Attribution.
124- `Open Data Commons Public Domain Dedication and License (PDDL) v1.0 <https://opendatacommons.org/licenses/pddl/1-0/ >`_
125- The PDDL places the data in the public domain and waives all rights.
126-
127- `GovData <https://www.govdata.de >`_ has submitted the *Data Licence Germany * in two variants:
128-
129- * `Datenlizenz Deutschland – Namensnennung – Version 2.0
130- <https://www.govdata.de/dl-de/by-2-0> `_
131- * `Datenlizenz Deutschland – Zero – Version 2.0
132- <https://www.govdata.de/dl-de/zero-2-0> `_
133-
134- When using the `Community Data License Agreement – Permissive, Version 2.0 <https://cdla.dev/permissive-2-0/ >`_ the copyright notices must be retained.
135-
136- Another possible licence for artistic works is the `Free Art License 1.3
137- <https://artlibre.org/licence/lal/en/> `_.
106+ * The `DEAL consortium <https://deal-konsortium.de/en/ >`_ recommends the CC
107+ BY licence for Open Access publications of research results, see also `Why
108+ CC BY is the Best Choice for Open Access Publishing
109+ <https://deal-konsortium.de/en/why-ccby> `_.
110+
111+ * :abbr: `RADAR ( Research Data Repository ) `, a cross-disciplinary repository
112+ for archiving and publishing research data, recommends only one of the `CC
113+ licences
114+ <https://radar.products.fiz-karlsruhe.de/en/radarfeatures/lizenzen-fuer-forschungsdaten#cc-licenses> `_.
115+
116+ * The `Open Knowledge Foundation <https://okfn.org/en/ >`_ has also published a
117+ set of `Open Data Commons <https://opendatacommons.org >`_ licences for
118+ data/databases:
119+
120+ `Open Data Commons Open Database License (ODbL) v1.0 <https://opendatacommons.org/licenses/odbl/1-0/ >`_
121+ Attribution and sharing under equal terms.
122+ `Open Data Commons Attribution License (ODC-By) v1.0 <https://opendatacommons.org/licenses/by/1-0/ >`_
123+ Attribution.
124+ `Open Data Commons Public Domain Dedication and License (PDDL) v1.0 <https://opendatacommons.org/licenses/pddl/1-0/ >`_
125+ The PDDL places the data in the public domain and waives all rights.
126+
127+ * `GovData <https://www.govdata.de >`_ has submitted the *Data Licence Germany *
128+ in two variants:
129+
130+ * `Datenlizenz Deutschland – Namensnennung – Version 2.0
131+ <https://www.govdata.de/dl-de/by-2-0> `_
132+ * `Datenlizenz Deutschland – Zero – Version 2.0
133+ <https://www.govdata.de/dl-de/zero-2-0> `_
134+
135+ * The Community Data License Agreement <https://cdla.dev>_ can be used in four
136+ different ways:
137+
138+ * `Community Data License Agreement – Permissive, Version 2.0
139+ <https://cdla.dev/permissive-2-0/> `_
140+ * `Community Data License Agreement – Sharing, Version 1.0
141+ <https://cdla.dev/sharing-1-0/> `_
142+ * `Open Use of Data Agreement, Version 1.0
143+ <https://cdla.dev/open-use-of-data-agreement-v1-0/> `_
144+ * `Computational Use of Data Agreement, Version 1.0
145+ <https://cdla.dev/computational-use-of-data-agreement-v1-0/> `_
146+
147+ * Another possible licence for artistic works is the `Free Art License 1.3
148+ <https://artlibre.org/licence/lal/en/> `_.
138149
139150Machine learning models
140151~~~~~~~~~~~~~~~~~~~~~~~
@@ -164,19 +175,93 @@ been developed for a company or specific models:
164175* `BigScience OpenRAIL-M (Responsible AI License
165176 <https://www.licenses.ai/blog/2022/8/26/bigscience-open-rail-m-license> `_
166177
167- RAIL-D
178+ There are other `Responsible AI Licenses (RAIL) <https://www.licenses.ai >`_
179+ with various restrictions on use:
180+
181+ OpenRAIL-D
168182 contains usage restrictions that only apply to the data.
169- RAIL -A
183+ OpenRAIL -A
170184 contains usage restrictions that only apply to the
171185 application/executability.
172- RAIL -M
186+ OpenRAIL -M
173187 contains usage restrictions that only apply to the model.
174- RAIL-S
188+
189+ .. seealso ::
190+ `RAIL-M
191+ <https://www.licenses.ai/blog/2022/8/26/bigscience-open-rail-m-license> `_
192+
193+ OpenRAIL-S
175194 contains usage restrictions that only apply to the source code.
176195
196+ AI models that are licensed under an open source licence but whose training data
197+ and programmes have **not ** been published are not compliant with the `Debian
198+ Free Software Guidelines (DFSG)
199+ <https://de.wikipedia.org/wiki/Debian_Free_Software_Guidelines> `_, see also
200+ `Interpretation of DFSG on Artificial Intelligence (AI) Models
201+ <https://www.debian.org/vote/2025/vote_002> `_.
202+
203+ For the `Open Source Initiative (OSI) <https://opensource.org/ >`__, the
204+ definition of open source AI also goes far beyond the use of a model – it must
205+ also be understandable how the model was created, and the model must also be
206+ able to be modified and shared with others for any purpose. These four freedoms
207+ are fulfilled with
208+
209+ Open Data
210+ Sufficiently detailed information about the data used to train the system so
211+ that an essentially equivalent system can be built
212+ Open Code
213+ The complete source code used to train and operate the system under licences
214+ approved by the OSI
215+ Open Weights
216+ Model parameters, such as weights or other configuration settings under
217+ licences approved by the OSI
218+
219+ Accordingly, the OSI developed `OSAID 1.0
220+ <https://opensource.org/ai/open-source-ai-definition> `_, which applies to the
221+ following models:
222+
223+ * EleutherAI: `Pythia <https://github.com/EleutherAI/pythia >`_, `GPT-J
224+ <https://www.eleuther.ai/artifacts/gpt-j> `_
225+ * The Allen Institute for Artificial Intelligence: `OLMo 2
226+ <https://allenai.org/olmo> `_, `Molmo <https://allenai.org/blog/molmo >`_
227+ * LLM360: `K2
228+ <https://huggingface.co/collections/LLM360/k2-6622ae6911e3eb6219690039> `_,
229+ `Amber
230+ <https://huggingface.co/collections/LLM360/amber-65e7333ff73c7bbb014f2f2f> `_,
231+ `CrystalCoder <https://huggingface.co/LLM360/Crystal >`_
232+ * Google: `T5
233+ <https://github.com/google-research/text-to-text-transfer-transformer> `_
234+
235+ Presumably the following models would also fulfil the requirements if they were
236+ to change their legal conditions:
237+
238+ * BigScience: `Bloom <https://huggingface.co/bigscience/bloom >`_
239+ * BigCode: `StarCoder 2 <https://github.com/bigcode-project/starcoder2 >`_
240+ * Technology Innovation Institute: `Falcon
241+ <https://huggingface.co/collections/tiiuae/falcon-h1-6819f2795bc406da60fab8df> `_
242+
243+ However, there are also some models that have been analysed and failed because
244+ they lack required components and/or legal agreements:
245+
246+ * Meta: Llama2
247+ * xAI: Grok
248+ * Microsoft: Phi-2
249+ * Mistral AI: Mixtral
250+
177251.. seealso ::
178252 * `Licensing Machine Learning models
179- <https://book.the-turing-way.org/reproducible-research/licensing/licensing-ml> `_ by The Turing Way Community
253+ <https://book.the-turing-way.org/reproducible-research/licensing/licensing-ml> `_
254+ by The Turing Way Community
255+ * Alek Tarkowski, Open Future in partnership with the Open Source Initiative:
256+ `Data Governance in Open Source AI
257+ <https://opensource.org/wp-content/uploads/2025/02/2025-OSI-DataGovernanceOSAI-final-v5.pdf> `_
258+
259+ Databases
260+ ~~~~~~~~~
261+
262+ One of the few licences for databases is the `Open Data Commons Open Database
263+ License (ODbL) v1.0 <https://opendatacommons.org/licenses/odbl/1-0/> `_, which is
264+ used, for example, by `OpenStreetMap (OSM) <https://www.openstreetmap.org >`_.
180265
181266Documentation
182267~~~~~~~~~~~~~
0 commit comments