Merge branch 'notebooks'

ruivieira · ruivieira · commit 28e18978b9af · 2021-07-05T14:32:24.000+01:00
diff --git a/README.md b/README.md
@@ -62,3 +62,7 @@ You can see the `sumSkipModel` in the [LIME tests](./tests/test_limeexplainer.py
 ## Examples
 
 You can look at the [tests](./tests) for working examples.
+
+There are also Jupyter notebooks available:
+
+- [Counterfactual explanations](notebooks/Counterfactuals.ipynb)
diff --git a/notebooks/Counterfactuals.ipynb b/notebooks/Counterfactuals.ipynb
@@ -0,0 +1,296 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "8f0f2186",
+   "metadata": {},
+   "source": [
+    "# Counterfactual explanations"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "569777b3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import trustyai\n",
+    "\n",
+    "trustyai.init(\n",
+    "    path=[\n",
+    "        \"../dep/org/kie/kogito/explainability-core/1.8.0.Final/*\",\n",
+    "        \"../dep/org/slf4j/slf4j-api/1.7.30/slf4j-api-1.7.30.jar\",\n",
+    "        \"../dep/org/apache/commons/commons-lang3/3.12.0/commons-lang3-3.12.0.jar\",\n",
+    "        \"../dep/org/optaplanner/optaplanner-core/8.8.0.Final/optaplanner-core-8.8.0.Final.jar\",\n",
+    "        \"../dep/org/apache/commons/commons-math3/3.6.1/commons-math3-3.6.1.jar\",\n",
+    "        \"../dep/org/kie/kie-api/7.55.0.Final/kie-api-7.55.0.Final.jar\",\n",
+    "        \"../dep/io/micrometer/micrometer-core/1.6.6/micrometer-core-1.6.6.jar\",\n",
+    "    ]\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "512462ee",
+   "metadata": {},
+   "source": [
+    "## Simple example\n",
+    "\n",
+    "We start by defining our black-box model, typically represented by\n",
+    "\n",
+    "$$\n",
+    "f(\\mathbf{x}) = \\mathbf{y}\n",
+    "$$\n",
+    "\n",
+    "Where $\\mathbf{x}=\\{x_1, x_2, \\dots,x_m\\}$ and $\\mathbf{y}=\\{y_1, y_2, \\dots,y_n\\}$.\n",
+    "\n",
+    "Our example toy model, in this case, takes an all-numerical input $\\mathbf{x}$ and return a $\\mathbf{y}$ of either `true` or `false` if the sum of the $\\mathbf{x}$ components is within a threshold $\\epsilon$ of a point $\\mathbf{C}$, that is:\n",
+    "\n",
+    "$$\n",
+    "f(\\mathbf{x}, \\epsilon, \\mathbf{C})=\\begin{cases}\n",
+    "\\text{true},\\qquad \\text{if}\\ \\mathbf{C}-\\epsilon<\\sum_{i=1}^m x_i <\\mathbf{C}+\\epsilon \\\\\n",
+    "\\text{false},\\qquad \\text{otherwise}\n",
+    "\\end{cases}\n",
+    "$$\n",
+    "\n",
+    "This model is provided in the `TestUtils` module. We instantiate with a $\\mathbf{C}=500$ and $\\epsilon=1.0$."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "e4f89877",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.utils import TestUtils\n",
+    "\n",
+    "center = 500.0\n",
+    "epsilon = 10.0\n",
+    "\n",
+    "model = TestUtils.getSumThresholdModel(center, epsilon)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f0bb1cc2",
+   "metadata": {},
+   "source": [
+    "Next we need to define a **goal**.\n",
+    "If our model is $f(\\mathbf{x'})=\\mathbf{y'}$ we are then defining our $\\mathbf{y'}$ and the counterfactual result will be the $\\mathbf{x'}$ which satisfies $f(\\mathbf{x'})=\\mathbf{y'}$.\n",
+    "\n",
+    "We will define our goal as `true`, that is, the sum is withing the vicinity of a (to be defined) point $\\mathbf{C}$. The goal is a list of `Output` which take the following parameters\n",
+    "\n",
+    "- The feature name\n",
+    "- The feature type\n",
+    "- The feature value (wrapped in `Value`)\n",
+    "- A confidence threshold, which we will leave at zero (no threshold)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "5bcb0105",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.model import Output, Type, Value\n",
+    "\n",
+    "goal = [Output(\"inside\", Type.BOOLEAN, Value(True), 0.0)]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6aa524ae",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import random\n",
+    "from trustyai.model import FeatureFactory\n",
+    "\n",
+    "features = [FeatureFactory.newNumericalFeature(f\"f-num{i+1}\", random.random()*10.0) for i in range(4)]\n",
+    "\n",
+    "for f in features:\n",
+    "    print(f\"Feature {f.getName()} has value {f.getValue()}\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "513d2e5a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "constraints = [False] * 4"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "30dcc15b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.model.domain import NumericalFeatureDomain\n",
+    "\n",
+    "feature_boundaries = [NumericalFeatureDomain.create(0.0, 1000.0)] * 4"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5047e075",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.model import DataDomain\n",
+    "\n",
+    "data_domain = DataDomain(feature_boundaries)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "e1b0da83",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "center = 500.0\n",
+    "epsilon = 10.0"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "510b3b16",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.utils import TestUtils\n",
+    "\n",
+    "model = TestUtils.getSumThresholdModel(center, epsilon)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "bcd25df0",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from org.optaplanner.core.config.solver.termination import TerminationConfig\n",
+    "from org.kie.kogito.explainability.local.counterfactual import CounterfactualConfigurationFactory\n",
+    "from java.lang import Long\n",
+    "\n",
+    "termination_config = TerminationConfig().withScoreCalculationCountLimit(Long.valueOf(10_000))\n",
+    "\n",
+    "solver_config = (\n",
+    "        CounterfactualConfigurationFactory.builder()\n",
+    "        .withTerminationConfig(termination_config)\n",
+    "        .build()\n",
+    "    )"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "c2b76274",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from org.kie.kogito.explainability.local.counterfactual import CounterfactualExplainer\n",
+    "\n",
+    "explainer = CounterfactualExplainer.builder().withSolverConfig(solver_config).build()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "4cff79cd",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from trustyai.model import PredictionFeatureDomain, PredictionInput, PredictionOutput\n",
+    "\n",
+    "inputs = PredictionInput(features)\n",
+    "outputs = PredictionOutput(goal)\n",
+    "domain = PredictionFeatureDomain(data_domain.getFeatureDomains())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "98057ebd",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import uuid\n",
+    "from trustyai.model import CounterfactualPrediction\n",
+    "\n",
+    "prediction = CounterfactualPrediction(inputs, outputs, domain, constraints, None, uuid.uuid4())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "910a250f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "explanation_async = explainer.explainAsync(prediction, model)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "38774822",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "explanation = explanation_async.get()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7cb95b8c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "for entity in explanation.getEntities():\n",
+    "    print(entity)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7a8587d1",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "python-trustyai",
+   "language": "python",
+   "name": "python-trustyai"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.5"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/requirements-dev.txt b/requirements-dev.txt
@@ -2,4 +2,6 @@ black
 pandas
 scikit-learn
 pylint
-pytest
+pytest
+setuptools
+wheel
diff --git a/setup.cfg b/setup.cfg
@@ -0,0 +1,13 @@
+[build-system]
+requires = ["setuptools", "wheel"]
+build-backend = "setuptools.build_meta"
+
+[metadata]
+name = trustyai
+version = 0.0.1
+
+[options]
+packages = find:
+include_package_data = False
+requires = ['Jpype1']
+python_requires = >=3.8
diff --git a/setup.py b/setup.py
@@ -0,0 +1,3 @@
+import setuptools
+
+setuptools.setup(packages=['trustyai', 'trustyai.local.counterfactual', 'trustyai.local.lime', 'trustyai.model', 'trustyai.utils'])

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+import setuptools`
	`2`	`+`
	`3`	`+setuptools.setup(packages=['trustyai', 'trustyai.local.counterfactual', 'trustyai.local.lime', 'trustyai.model', 'trustyai.utils'])`