Add explanation of BayesianOptimizerController to tutorial

peanutfun · peanutfun · commit b8f1cac6bede · 2024-02-23T15:04:49.000+01:00
diff --git a/doc/tutorial/climada_util_calibrate.ipynb b/doc/tutorial/climada_util_calibrate.ipynb
@@ -2071,6 +2071,29 @@
    "source": [
     "## Execute the Calibration\n",
     "\n",
+    "We created a class `BayesianOptimizerController` to control and guide the calibration process.\n",
+    "It is intended to walk through several optimization iterations and stop the process if the buest guess cannot be improved.\n",
+    "The optimization works as follows:\n",
+    "\n",
+    "1. The optimizer randomly samples the parameter space `BayesianOptimizerController.init_points` times.\n",
+    "2. The optimizer uses a Gaussian regression process to \"smartly\" sample the parameter space at most `BayesianOptimizerController.n_iter` times.\n",
+    "   * The process uses an \"Upper Confidence Bound\" sampling method whose parameter `BayesianOptimizerController.kappa` indicates how close the sampled points are to the buest guess.\n",
+    "     Higher `kappa` means more exploration of the parameter space, lower `kappa` means more exploitation.\n",
+    "   * After each sample, the parameter `kappa` is reduced by the factor `BayesianOptimizerController.kappa_decay`.\n",
+    "     By default, this parameter is set such that `kappa` equals `BayesianOptimizerController.kappa_min` at the last step.\n",
+    "     This way, the sampling becomes more exploitative the more steps are taken.\n",
+    "3. The controller tracks the improvements of the buest guess for parameters.\n",
+    "   If `BayesianOptimizerController.min_improvement_count` consecutive improvements are lower than `BayesianOptimizerController.min_improvement`, the smart sampling is stopped.\n",
+    "   In this case, the `BayesianOptimizerController.iterations` count is increased and the process repeated from step 1.\n",
+    "4. If an entire iteration did not show any improvement, the optimization is stopped.\n",
+    "   It is also stopped when the `BayesianOptimizerController.max_iterations` count is reached.\n",
+    "\n",
+    "Users can control the \"density\", and thus the accuracy of the sampling by adjusting the controller parameters.\n",
+    "Increasing `init_points`, `n_iter`, `min_improvement_count`, and `max_iterations`, and decreasing `min_improvement` generally increases density and accuracy, but leads to longer runtimes.\n",
+    "\n",
+    "We suggest using the `from_input` classmethod for a convenient choice of sampling density based on the parameter space.\n",
+    "The two parameters `init_points` and `n_iter` are set to $b^N$, where $N$ is the number of estimated parameters and $b$ is the `sampling_base` parameter, which defaults to 4.\n",
+    "\n",
     "Now we can finally execute our calibration task!\n",
     "We will plug all input parameters in an instance of `Input`, and then create the optimizer instance with it.\n",
     "The `Optimizer.run` method returns an `Output` object, whose `params` attribute holds the optimal parameters determined by the calibration.\n",
@@ -2122,11 +2145,185 @@
     "\n",
     "    # Create and run the optimizer\n",
     "    opt = BayesianOptimizer(input)\n",
-    "    controller = BayesianOptimizerController.from_input(inp=input)\n",
+    "    controller = BayesianOptimizerController.from_input(input)\n",
     "    bayes_output = opt.run(controller)\n",
     "    bayes_output.params  # The optimal parameters"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Evaluate Output"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The Bayesian Optimizer returns the entire paramter space it sampled.\n",
+    "We can find out a lot about the relation of the fitted parameters by investigating how the cost function value depends on them.\n",
+    "We can retrieve the parameter space as `pandas.DataFrame` via `BayesianOptimizerOutput.p_space_to_dataframe`.\n",
+    "This dataframe has MultiIndex columns.\n",
+    "One group are the `Parameters`, the other holds information on the `Calibration` for each parameter set.\n",
+    "Notice that the optimal parameter set is not necessarily the last entry in the parameter space!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead tr th {\n",
+       "        text-align: left;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead tr:last-of-type th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr>\n",
+       "      <th></th>\n",
+       "      <th colspan=\"2\" halign=\"left\">Parameters</th>\n",
+       "      <th>Calibration</th>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th></th>\n",
+       "      <th>scale</th>\n",
+       "      <th>v_half</th>\n",
+       "      <th>Cost Function</th>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>Iteration</th>\n",
+       "      <th></th>\n",
+       "      <th></th>\n",
+       "      <th></th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>0.422852</td>\n",
+       "      <td>115.264302</td>\n",
+       "      <td>2.726950</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>0.010113</td>\n",
+       "      <td>63.349706</td>\n",
+       "      <td>4.133135</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>0.155288</td>\n",
+       "      <td>37.268453</td>\n",
+       "      <td>0.800611</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>0.194398</td>\n",
+       "      <td>68.718642</td>\n",
+       "      <td>1.683610</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4</th>\n",
+       "      <td>0.402800</td>\n",
+       "      <td>92.721038</td>\n",
+       "      <td>2.046407</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>...</th>\n",
+       "      <td>...</td>\n",
+       "      <td>...</td>\n",
+       "      <td>...</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>246</th>\n",
+       "      <td>0.956750</td>\n",
+       "      <td>45.733468</td>\n",
+       "      <td>0.804684</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>247</th>\n",
+       "      <td>0.788425</td>\n",
+       "      <td>48.043636</td>\n",
+       "      <td>0.768636</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>248</th>\n",
+       "      <td>0.826704</td>\n",
+       "      <td>49.932348</td>\n",
+       "      <td>0.764991</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>249</th>\n",
+       "      <td>0.880736</td>\n",
+       "      <td>49.290532</td>\n",
+       "      <td>0.767437</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>250</th>\n",
+       "      <td>0.744523</td>\n",
+       "      <td>51.596642</td>\n",
+       "      <td>0.770761</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "<p>251 rows × 3 columns</p>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "          Parameters               Calibration\n",
+       "               scale      v_half Cost Function\n",
+       "Iteration                                     \n",
+       "0           0.422852  115.264302      2.726950\n",
+       "1           0.010113   63.349706      4.133135\n",
+       "2           0.155288   37.268453      0.800611\n",
+       "3           0.194398   68.718642      1.683610\n",
+       "4           0.402800   92.721038      2.046407\n",
+       "...              ...         ...           ...\n",
+       "246         0.956750   45.733468      0.804684\n",
+       "247         0.788425   48.043636      0.768636\n",
+       "248         0.826704   49.932348      0.764991\n",
+       "249         0.880736   49.290532      0.767437\n",
+       "250         0.744523   51.596642      0.770761\n",
+       "\n",
+       "[251 rows x 3 columns]"
+      ]
+     },
+     "execution_count": 14,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "p_space_df = bayes_output.p_space_to_dataframe()\n",
+    "p_space_df"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "In contrast, the controller only tracks the consecutive improvements of the best guess."
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 13,
@@ -2337,173 +2534,6 @@
     "controller.improvements()"
    ]
   },
-  {
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "## Evaluate Output"
-   ]
-  },
-  {
-   "attachments": {},
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "The Bayesian Optimizer returns the entire paramter space it sampled.\n",
-    "We can find out a lot about the relation of the fitted parameters by investigating how the cost function value depends on them.\n",
-    "We can retrieve the parameter space as `pandas.DataFrame` via `BayesianOptimizerOutput.p_space_to_dataframe`.\n",
-    "This dataframe has MultiIndex columns.\n",
-    "One group are the `Parameters`, the other holds information on the `Calibration` for each parameter set.\n",
-    "Notice that the optimal parameter set is not necessarily the last entry in the parameter space!"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 14,
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/html": [
-       "<div>\n",
-       "<style scoped>\n",
-       "    .dataframe tbody tr th:only-of-type {\n",
-       "        vertical-align: middle;\n",
-       "    }\n",
-       "\n",
-       "    .dataframe tbody tr th {\n",
-       "        vertical-align: top;\n",
-       "    }\n",
-       "\n",
-       "    .dataframe thead tr th {\n",
-       "        text-align: left;\n",
-       "    }\n",
-       "\n",
-       "    .dataframe thead tr:last-of-type th {\n",
-       "        text-align: right;\n",
-       "    }\n",
-       "</style>\n",
-       "<table border=\"1\" class=\"dataframe\">\n",
-       "  <thead>\n",
-       "    <tr>\n",
-       "      <th></th>\n",
-       "      <th colspan=\"2\" halign=\"left\">Parameters</th>\n",
-       "      <th>Calibration</th>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th></th>\n",
-       "      <th>scale</th>\n",
-       "      <th>v_half</th>\n",
-       "      <th>Cost Function</th>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>Iteration</th>\n",
-       "      <th></th>\n",
-       "      <th></th>\n",
-       "      <th></th>\n",
-       "    </tr>\n",
-       "  </thead>\n",
-       "  <tbody>\n",
-       "    <tr>\n",
-       "      <th>0</th>\n",
-       "      <td>0.422852</td>\n",
-       "      <td>115.264302</td>\n",
-       "      <td>2.726950</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>1</th>\n",
-       "      <td>0.010113</td>\n",
-       "      <td>63.349706</td>\n",
-       "      <td>4.133135</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>2</th>\n",
-       "      <td>0.155288</td>\n",
-       "      <td>37.268453</td>\n",
-       "      <td>0.800611</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>3</th>\n",
-       "      <td>0.194398</td>\n",
-       "      <td>68.718642</td>\n",
-       "      <td>1.683610</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>4</th>\n",
-       "      <td>0.402800</td>\n",
-       "      <td>92.721038</td>\n",
-       "      <td>2.046407</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>...</th>\n",
-       "      <td>...</td>\n",
-       "      <td>...</td>\n",
-       "      <td>...</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>246</th>\n",
-       "      <td>0.956750</td>\n",
-       "      <td>45.733468</td>\n",
-       "      <td>0.804684</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>247</th>\n",
-       "      <td>0.788425</td>\n",
-       "      <td>48.043636</td>\n",
-       "      <td>0.768636</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>248</th>\n",
-       "      <td>0.826704</td>\n",
-       "      <td>49.932348</td>\n",
-       "      <td>0.764991</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>249</th>\n",
-       "      <td>0.880736</td>\n",
-       "      <td>49.290532</td>\n",
-       "      <td>0.767437</td>\n",
-       "    </tr>\n",
-       "    <tr>\n",
-       "      <th>250</th>\n",
-       "      <td>0.744523</td>\n",
-       "      <td>51.596642</td>\n",
-       "      <td>0.770761</td>\n",
-       "    </tr>\n",
-       "  </tbody>\n",
-       "</table>\n",
-       "<p>251 rows × 3 columns</p>\n",
-       "</div>"
-      ],
-      "text/plain": [
-       "          Parameters               Calibration\n",
-       "               scale      v_half Cost Function\n",
-       "Iteration                                     \n",
-       "0           0.422852  115.264302      2.726950\n",
-       "1           0.010113   63.349706      4.133135\n",
-       "2           0.155288   37.268453      0.800611\n",
-       "3           0.194398   68.718642      1.683610\n",
-       "4           0.402800   92.721038      2.046407\n",
-       "...              ...         ...           ...\n",
-       "246         0.956750   45.733468      0.804684\n",
-       "247         0.788425   48.043636      0.768636\n",
-       "248         0.826704   49.932348      0.764991\n",
-       "249         0.880736   49.290532      0.767437\n",
-       "250         0.744523   51.596642      0.770761\n",
-       "\n",
-       "[251 rows x 3 columns]"
-      ]
-     },
-     "execution_count": 14,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "p_space_df = bayes_output.p_space_to_dataframe()\n",
-    "p_space_df"
-   ]
-  },
   {
    "cell_type": "markdown",
    "metadata": {},