Skip to content

Commit 3469361

Browse files
authored
Merge pull request #415 from aai-institute/doc/add-glossary-of-terms
Doc/add glossary of terms
2 parents 62295d4 + 8fc2a2b commit 3469361

File tree

5 files changed

+19
-4
lines changed

5 files changed

+19
-4
lines changed

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,8 @@ the board, with a focus on documentation and usability.
1111
[PR #365](https://github.com/aai-institute/pyDVL/pull/365)
1212
- Enabled parallel computation for Leave-One-Out values
1313
[PR #406](https://github.com/aai-institute/pyDVL/pull/406)
14+
- Added more abbreviations to documentation
15+
[PR #415](https://github.com/aai-institute/pyDVL/pull/415)
1416

1517
### Changed
1618
- Replaced sphinx with mkdocs for documentation. Major overhaul of documentation

CONTRIBUTING.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -250,6 +250,16 @@ def f(x: float) -> float:
250250
return 1/(x*x)
251251
```
252252

253+
### Abbreviations
254+
255+
We keep the abbreviations used in the documentation inside the
256+
[docs_include/abbreviations.md](docs_includes%2Fabbreviations.md) file.
257+
258+
The syntax for abbreviations is:
259+
260+
```markdown
261+
*[ABBR]: Abbreviation
262+
```
253263

254264
## CI
255265

docs/value/index.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@ there are additional desiderata, like having a value function that does not
7676
increase with repeated samples. Game-theoretic methods are all rooted in axioms
7777
that by construction ensure different desiderata, but despite their practical
7878
usefulness, none of them are either necessary or sufficient for all
79-
applications. For instance, *[SV]s try to equitably distribute all value
79+
applications. For instance, SV methods try to equitably distribute all value
8080
among all samples, failing to identify repeated ones as unnecessary, with e.g. a
8181
zero value.
8282

@@ -332,8 +332,7 @@ nature of every (non-trivial) ML problem can have an effect:
332332
[@wang_data_2022] prove that by relaxing one of the Shapley axioms
333333
and considering the general class of semi-values, of which Shapley is an
334334
instance, one can prove that a choice of constant weights is the best one can
335-
do in a utility-agnostic setting. So-called *Data Banzhaf* is on our to-do
336-
list!
335+
do in a utility-agnostic setting. So-called *Data Banzhaf*.
337336

338337
* **Data set size**: Computing exact Shapley values is NP-hard, and Monte Carlo
339338
approximations can converge slowly. Massive datasets are thus impractical, at

docs/value/semi-values.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -117,7 +117,7 @@ values = compute_generic_semivalues(
117117
u=utility,
118118
coefficient=beta_coefficient(alpha=1, beta=16),
119119
done=AbsoluteStandardError(threshold=1e-4),
120-
)
120+
)
121121
```
122122

123123
Allowing any coefficient can help when experimenting with models which are more

docs_includes/abbreviations.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,7 @@
99
*[MSE]: Mean Squared Error
1010
*[SV]: Shapley Value
1111
*[TMCS]: Truncated Monte Carlo Shapley
12+
*[IF]: Influence Function
13+
*[iHVP]: inverse Hessian-vector product
14+
*[LiSSA]: Linear-time Stochastic Second-order Algorithm
15+
*[DUL]: Data Utility Learning

0 commit comments

Comments
 (0)