aai-institute
diff --git a/‎CHANGELOG.md‎
Lines changed: 3 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/30-data-valuation.rst‎
Lines changed: 36 additions & 7 deletions b/‎docs/30-data-valuation.rst‎
Lines changed: 36 additions & 7 deletions
diff --git a/‎src/pydvl/utils/__init__.py‎
Lines changed: 1 addition & 0 deletions b/‎src/pydvl/utils/__init__.py‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎src/pydvl/utils/numeric.py‎
Lines changed: 1 addition & 17 deletions b/‎src/pydvl/utils/numeric.py‎
Lines changed: 1 addition & 17 deletions
diff --git a/‎src/pydvl/utils/score.py‎
Lines changed: 126 additions & 0 deletions b/‎src/pydvl/utils/score.py‎
Lines changed: 126 additions & 0 deletions
diff --git a/‎src/pydvl/utils/types.py‎
Lines changed: 6 additions & 56 deletions b/‎src/pydvl/utils/types.py‎
Lines changed: 6 additions & 56 deletions
@@ -2,6 +2,9 @@
 
 ## Unreleased
 
+- Added `Scorer` class for a cleaner interface. Fix minor bugs around
+  Group-Testing Shapley and switched to cvxpy for the constraint solver.
+  [PR #264](https://github.com/appliedAI-Initiative/pyDVL/pull/264)
 - Generalised stopping criteria for valuation algorithms. Improved classes
   `ValuationResult` and `Status` with more operations. Some minor issues fixed.
   [PR #252](https://github.com/appliedAI-Initiative/pyDVL/pull/250)
 
@@ -118,6 +118,34 @@ is implemented, it is important not to reuse `Utility` objects for different
 datasets. You can read more about :ref:`caching setup` in the installation guide
 and the documentation of the :mod:`pydvl.utils.caching` module.
 
+Using custom scorers
+^^^^^^^^^^^^^^^^^^^^
+
+The `scoring` argument of :class:`~pydvl.utils.utility.Utility` can be used to
+specify a custom :class:`~pydvl.utils.utility.Scorer` object. This is a simple
+wrapper for a callable that takes a model, and test data and returns a score.
+
+More importantly, the object provides information about the range of the score,
+which is used by some methods by estimate the number of samples necessary, and
+about what default value to use when the model fails to train.
+
+.. note::
+   The most important property of a `Scorer` is its default value. Because many
+   models will fail to fit on small subsets of the data, it is important to
+   provide a sensible default value for the score.
+
+It is possible to skip the construction of the :class:`~pydvl.utils.utility.Scorer`
+when constructing the `Utility` object. The two following calls are equivalent:
+
+.. code-block:: python
+
+   utility = Utility(
+       model, dataset, "explained_variance", score_range=(-np.inf, 1), default_score=0.0
+   )
+   utility = Utility(
+       model, dataset, Scorer("explained_variance", range=(-np.inf, 1), default=0.0)
+   )
+
 Learning the utility
 ^^^^^^^^^^^^^^^^^^^^
 
@@ -369,14 +397,15 @@ $$
    but we don't advocate its use because of the speed and memory cost. Despite
    our best efforts, the number of samples required in practice for convergence
    can be several orders of magnitude worse than with e.g. Truncated Monte Carlo.
+   Additionally, the CSP can sometimes turn out to be infeasible.
 
 Usage follows the same pattern as every other Shapley method, but with the
-addition of an ``eps`` parameter required for the solution of the CSP. It should
-be the same value used to compute the minimum number of samples required. This
-can be done with :func:`~pydvl.value.shapley.gt.num_samples_eps_delta`, but note
-that the number returned will be huge! In practice, fewer samples can be enough,
-but the actual number will strongly depend on the utility, in particular its
-variance.
+addition of an ``epsilon`` parameter required for the solution of the CSP. It
+should be the same value used to compute the minimum number of samples required.
+This can be done with :func:`~pydvl.value.shapley.gt.num_samples_eps_delta`, but
+note that the number returned will be huge! In practice, fewer samples can be
+enough, but the actual number will strongly depend on the utility, in particular
+its variance.
 
 .. code-block:: python
 
@@ -550,7 +579,7 @@ nature of every (non-trivial) ML problem can have an effect:
 
   pyDVL offers a dedicated :func:`function composition
   <pydvl.utils.types.compose_score>` for scorer functions which can be used to
-  squash a score. The following is defined in module :mod:`~pydvl.utils.numeric`:
+  squash a score. The following is defined in module :mod:`~pydvl.utils.scorer`:
 
   .. code-block:: python
 
 
@@ -4,6 +4,7 @@
 from .numeric import *
 from .parallel import *
 from .progress import *
+from .score import *
 from .status import *
 from .types import *
 from .utility import *
@@ -4,13 +4,10 @@
 """
 
 from itertools import chain, combinations
-from typing import Collection, Generator, Iterator, Optional, Tuple, TypeVar, overload
+from typing import Collection, Generator, Iterator, Optional, Tuple, TypeVar
 
 import numpy as np
 from numpy.typing import NDArray
-from scipy.special import expit
-
-from pydvl.utils.types import compose_score
 
 FloatOrArray = TypeVar("FloatOrArray", float, NDArray[np.float_])
 IntOrArray = TypeVar("IntOrArray", int, NDArray[np.int_])
@@ -26,8 +23,6 @@
     "random_powerset",
     "random_subset_of_size",
     "top_k_value_accuracy",
-    "squashed_r2",
-    "squashed_variance",
 ]
 
 T = TypeVar("T", bound=np.generic)
@@ -277,14 +272,3 @@ def top_k_value_accuracy(
     top_k_pred_values = np.argsort(y_pred)[-k:]
     top_k_accuracy = len(np.intersect1d(top_k_exact_values, top_k_pred_values)) / k
     return top_k_accuracy
-
-
-def sigmoid(x: float) -> float:
-    result: float = expit(x).item()
-    return result
-
-
-squashed_r2 = compose_score("r2", sigmoid, "squashed r2")
-squashed_variance = compose_score(
-    "explained_variance", sigmoid, "squashed explained variance"
-)
@@ -0,0 +1,126 @@
+"""
+This module provides a :class:`Scorer` class that wraps scoring functions with
+additional information.
+
+Scorers can be constructed in the same way as in scikit-learn: either from 
+known strings or from a callable. Greater values must be better. If they are not,
+a negated version can be used, see scikit-learn's `make_scorer()
+<https://scikit-learn.org/stable/modules/generated/sklearn.metrics.make_scorer.html>`_.
+
+:class:`Scorer` provides additional information about the scoring function, like
+its range and default values.
+"""
+from typing import Callable, Optional, Protocol, Tuple, Union
+
+import numpy as np
+from numpy.typing import NDArray
+from scipy.special import expit
+from sklearn.metrics import get_scorer
+
+from pydvl.utils.types import SupervisedModel
+
+__all__ = ["Scorer", "compose_score", "squashed_r2", "squashed_variance"]
+
+
+class ScorerCallable(Protocol):
+    """Signature for a scorer"""
+
+    def __call__(self, model: SupervisedModel, X: NDArray, y: NDArray) -> float:
+        ...
+
+
+class Scorer:
+    """A scoring callable that takes a model, data, and labels and returns a
+    scalar.
+
+    :param scoring: Either a string or callable that can be passed to
+        `get_scorer
+        <https://scikit-learn.org/stable/modules/generated/sklearn.metrics.get_scorer.html>`_.
+    :param default: score to be used when a model cannot be fit, e.g. when too
+        little data is passed, or errors arise.
+    :param range: numerical range of the score function. Some Monte Carlo
+        methods can use this to estimate the number of samples required for a
+        certain quality of approximation. If not provided, it can be read from
+        the ``scoring`` object if it provides it, for instance if it was
+        constructed with :func:`~pydvl.utils.types.compose_score`.
+    :param name: The name of the scorer. If not provided, the name of the
+        function passed will be used.
+
+    .. versionadded:: 0.5.0
+
+    """
+
+    _name: str
+    range: NDArray[np.float_]
+
+    def __init__(
+        self,
+        scoring: Union[str, ScorerCallable],
+        default: float = np.nan,
+        range: Tuple = (-np.inf, np.inf),
+        name: Optional[str] = None,
+    ):
+        self._scorer = get_scorer(scoring)
+        self.default = default
+        # TODO: auto-fill from known scorers ?
+        self.range = np.array(range)
+        self._name = getattr(self._scorer, "__name__", name or "scorer")
+
+    def __call__(self, model: SupervisedModel, X: NDArray, y: NDArray) -> float:
+        return self._scorer(model, X, y)  # type: ignore
+
+    def __str__(self):
+        return self._name
+
+    def __repr__(self):
+        capitalized_name = "".join(s.capitalize() for s in self._name.split(" "))
+        return f"{capitalized_name} (scorer={self._scorer})"
+
+
+def compose_score(
+    scorer: Scorer,
+    transformation: Callable[[float], float],
+    range: Tuple[float, float],
+    name: str,
+) -> Scorer:
+    """Composes a scoring function with an arbitrary scalar transformation.
+
+    Useful to squash unbounded scores into ranges manageable by data valuation
+    methods.
+
+    .. code-block:: python
+       :caption: Example usage
+
+       sigmoid = lambda x: 1/(1+np.exp(-x))
+       compose_score(Scorer("r2"), sigmoid, range=(0,1), name="squashed r2")
+
+    :param scorer: The object to be composed.
+    :param transformation: A scalar transformation
+    :param range: The range of the transformation. This will be used e.g. by
+        :class:`~pydvl.utils.utility.Utility` for the range of the composed.
+    :param name: A string representation for the composition, for `str()`.
+    :return: The composite :class:`Scorer`.
+    """
+
+    class NewScorer(Scorer):
+        def __call__(self, model: SupervisedModel, X: NDArray, y: NDArray) -> float:
+            score = self._scorer(model=model, X=X, y=y)
+            return transformation(score)
+
+    return NewScorer(scorer, range=range, name=name)
+
+
+def _sigmoid(x: float) -> float:
+    result: float = expit(x).item()
+    return result
+
+
+squashed_r2 = compose_score(Scorer("r2"), _sigmoid, (0, 1), "squashed r2")
+""" A scorer that squashes the R² score into the range [0, 1] using a sigmoid."""
+
+
+squashed_variance = compose_score(
+    Scorer("explained_variance"), _sigmoid, (0, 1), "squashed explained variance"
+)
+""" A scorer that squashes the explained variance score into the range [0, 1] using
+    a sigmoid."""
@@ -2,12 +2,11 @@
 transformations. Some of it probably belongs elsewhere.
 """
 import inspect
-from typing import Callable, Optional, Protocol, Type, Union
+from typing import Callable, Protocol, Type
 
-from numpy import ndarray
-from sklearn.metrics import get_scorer
+from numpy.typing import NDArray
 
-__all__ = ["SupervisedModel", "Scorer", "compose_score"]
+__all__ = ["SupervisedModel"]
 
 
 class SupervisedModel(Protocol):
@@ -18,19 +17,16 @@ class SupervisedModel(Protocol):
     `score()`.
     """
 
-    def fit(self, x: ndarray, y: ndarray):
+    def fit(self, x: NDArray, y: NDArray):
         pass
 
-    def predict(self, x: ndarray) -> ndarray:
+    def predict(self, x: NDArray) -> NDArray:
         pass
 
-    def score(self, x: ndarray, y: ndarray) -> float:
+    def score(self, x: NDArray, y: NDArray) -> float:
         pass
 
 
-Scorer = Callable[[SupervisedModel, ndarray, ndarray], float]
-
-
 def unpackable(cls: Type) -> Type:
     """A class decorator that allows unpacking of all attributes of an object
     with the double asterisk operator.
@@ -103,49 +99,3 @@ def wrapper(*args, **kwargs):
         return fun(*args, **kwargs)
 
     return wrapper
-
-
-# FIXME: This probably should be somewhere else
-def compose_score(
-    score: Union[str, Scorer],
-    transformation: Callable[[float], float],
-    name: str = None,
-):
-    """Composes a scoring function with an arbitrary scalar transformation.
-
-    Useful to squash unbounded scores into ranges manageable by data valuation
-    methods.
-
-    .. code-block:: python
-       :caption: Example usage
-
-       sigmoid = lambda x: 1/(1+np.exp(-x))
-       compose_score("r2", sigmoid, "squashed r2")
-
-    :param score: Either a callable or a string naming any of sklearn's scorers
-    :param transformation: A scalar transformation
-    :param name: A string representation for the composition, for `str()`.
-
-    :return: The function composition.
-    """
-    scoring_function: Scorer = get_scorer(score) if isinstance(score, str) else score
-
-    class NewScorer(object):
-        def __init__(self, scorer: Scorer, name: Optional[str] = None):
-            self._scorer = scorer
-            self._name = name or "Composite " + getattr(
-                self._scorer, "__name__", "scorer"
-            )
-
-        def __call__(self, *args, **kwargs):
-            score = self._scorer(*args, **kwargs)
-            return transformation(score)
-
-        def __str__(self):
-            return self._name
-
-        def __repr__(self):
-            capitalized_name = "".join(s.capitalize() for s in self._name.split(" "))
-            return f"{capitalized_name} (scorer={self._scorer})"
-
-    return NewScorer(scoring_function, name=name)