Azaya89
diff --git a/‎doc/ref/plotting_options/interactivity.ipynb‎
Lines changed: 2 additions & 3 deletions b/‎doc/ref/plotting_options/interactivity.ipynb‎
Lines changed: 2 additions & 3 deletions
diff --git a/‎doc/ref/plotting_options/resampling.ipynb‎
Lines changed: 114 additions & 44 deletions b/‎doc/ref/plotting_options/resampling.ipynb‎
Lines changed: 114 additions & 44 deletions
diff --git a/‎hvplot/converter.py‎
Lines changed: 38 additions & 4 deletions b/‎hvplot/converter.py‎
Lines changed: 38 additions & 4 deletions
@@ -33,7 +33,7 @@
     "Enables or disables hover tooltips on the plot, also accepts `'hline'` and `'vline'` to change the hit-testing mode.\n",
     "\n",
     "::: {note}\n",
-    "This option is True by default for most plots, but is automatically set to False when `datashade=True` since [Datashader](https://datashader.org/) returns an image that doesn’t support interactivity. If you’re using `datashade=True` and still want interactivity, consider alternatives like using `rasterize=True` or combining datashade with [dynspread](https://datashader.org/api.html#datashader.transfer_functions.dynspread) and overlays that retain interactivity.\n",
+    "This option is `True` by default for most plots, but is automatically set to `False` when [`datashade=True`](option-datashade) and [`selector`](option-selector) is not set, since no relevant data can be displayed as HoloViews returns to the front-end an RGB element that doesn’t include the aggregated data. If you’re using `datashade=True` and still want interactivity, consider alternatives like using [`rasterize=True`](option-rasterize), combining `datashade` with [`dynspread`](option-dynspread), or enabling [`resample_when`](option-resample_when).\n",
     ":::\n",
     "\n",
     "::: {note}\n",
@@ -99,8 +99,7 @@
     "\n",
     "Specifies additional columns from the dataset to be shown in the hover tooltip.\n",
     "- Accepts a list of column names, a single column name as a string, or 'all' to include all available columns.\n",
-    "- When set to 'all', it includes index columns only if `use_index=True`.\n",
-    "- Ignored for `datashade=True` plots, as those do not support interactivity.\n",
+    "- When set to 'all', it includes index columns only if [`use_index=True`](option-use_index).\n",
     "\n",
     "::: {note} \n",
     "`hover_cols` complements the default dimensions shown in the tooltip but does not override them.\n",
 
@@ -19,7 +19,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "The `hvsampledata.synthetic_clusters` dataset is in many examples below."
+    "The `hvsampledata.synthetic_clusters` dataset is used in many examples below. This dataset, returned as a DataFrame object, consists of five sub-datasets combined. Each of the sub-dataset has a random x, y-coordinate based on a normal distribution centered at a specific (x, y) location, with standard deviations derived from a power law, resulting in very dense to very scattered clusters. Each point also carries a `val` (`0` to `4`) and `cat` (`d1` to `d5`) column to identify its dataset and category. The total dataset contains 1,000,000 points, evenly split across the five distributions."
    ]
   },
   {
@@ -61,7 +61,7 @@
     "- Selection of data from a dimension of the supplied dataset, or the index of the corresponding row in the dataset, including: `'first'`, `'last'`, `'min'`, `'max'`.\n",
     "\n",
     "`aggregator` accepts either:\n",
-    "- A [Datashader reduction object](https://datashader.org/api.html#reductions), such as `ds.count()` or `ds.mean('val')`.\n",
+    "- A [Datashader reduction instance](https://datashader.org/api.html#reductions), such as `ds.count()` or `ds.mean('val')`.\n",
     "- A string (e.g. `'mean'`, `'count'`, `'min'`, `'max'`, etc.), in which case the aggregated dimension can be defined by setting the [`color`](option-color) option (if not, the first non-coordinate variable found is used).\n",
     "\n",
     "The `'count_cat'` or `'by'` aggregators can be used for categorical cata. `ds.by(<column>, <reduction>)` allows to define the per-category reduction function (default is `count`). Alternatively, setting the [`by`](option-by) option to a categorical column is equivalent to setting `aggregator=ds.by(<cat_column>)`.\n",
@@ -133,33 +133,6 @@
     "The next examples show how to leverage `ds.summary()` and `ds.where()`. Hover over the plots to see how what information is made available in the tooltip."
    ]
   },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "ds.summary(min_s=ds.min('s'), min_val=ds.min('val'))"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "ds.where(ds.min('s'), 'val')"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "ds.summary(min_s=ds.min('s'), min_val=ds.min('val'))"
-   ]
-  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -199,7 +172,7 @@
     "This approach can turn even the largest datasets into an image that captures patterns such as density or value distribution, making it ideal for high-volume scatter plots. When `datashade=True`, hvPlot returns a [`DynamicMap`](inv:holoviews#reference/containers/bokeh/DynamicMap) containing an [`RGB`](inv:holoviews#reference/elements/bokeh/RGB) instead of individual glyphs.\n",
     "\n",
     ":::{tip}\n",
-    "Since `datashade=True` produces an RGB image, the underlying data (e.g. the aggregated values per pixel) is not directly available to the plot. Enabling the `'hover'` [tool](options-hover) (disabled by default when `datashade=True`) would only show the RGB value per pixel, and no meaningful colorbar can be attached to the plot. To let the frontend apply colormapping instead of the backend, and as a consequence expose the underlying data, we recommend setting [`rasterize=True`](option-rasterize) instead of `datashade=True`.\n",
+    "Since `datashade=True` produces an RGB image, the underlying data (e.g. the aggregated values per pixel) is not directly available to the plot. Enabling the `'hover'` [tool](options-hover) (disabled by default when `datashade=True` unless [`selector`](option-selector) is set) would only show the RGB value per pixel, and no meaningful colorbar can be attached to the plot. To let the frontend apply colormapping instead of the backend, and as a consequence expose the underlying data, we recommend setting [`rasterize=True`](option-rasterize) instead of `datashade=True`.\n",
     ":::\n",
     "\n",
     "The [`cnorm`](option-cnorm) option defaults to `'eq_hist'` when `datashade=True`."
@@ -216,9 +189,9 @@
     "\n",
     "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
     "\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    x='x', y='y', datashade=True, data_aspect=1, frame_height=250,\n",
-    "    title='Datashaded scatter plot with\\n\"count\" aggregator and\\n\"eq_hist\" cnorm'\n",
+    "    title='Datashaded points plot with\\n\"count\" aggregator and\\n\"eq_hist\" cnorm'\n",
     ")"
    ]
   },
@@ -305,11 +278,11 @@
     "    x='x', y='y', frame_height=250, data_aspect=1,\n",
     "    xlim=(-5.5, -5), ylim=(2.5, 3),\n",
     ")\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    rasterize=True, dynspread=False,\n",
     "    title=\"Datashade without dynspread\", **plot_opts,\n",
     ") +\\\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    rasterize=True, dynspread=True,\n",
     "    title=\"Datashade with dynspread\", **plot_opts,\n",
     ")"
@@ -339,11 +312,11 @@
     "    x='x', y='y', frame_height=250, data_aspect=1,\n",
     "    xlim=(-5.5, -5), ylim=(2.5, 3),\n",
     ")\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    rasterize=True, dynspread=True,\n",
     "    title=\"Dynspread with max_px=3 (default)\", **plot_opts,\n",
     ") +\\\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    rasterize=True, dynspread=True, max_px=8,\n",
     "    title=\"Dynspread with max_px=8\", **plot_opts\n",
     ")"
@@ -383,7 +356,7 @@
     "\n",
     "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
     "\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    x='x', y='y', datashade=True, pixel_ratio=0.1, frame_height=250,\n",
     "    data_aspect=1, title=\"Datashade with low pixel ratio\"\n",
     ")"
@@ -430,9 +403,9 @@
     "\n",
     "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
     "\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    x='x', y='y', rasterize=True, data_aspect=1, frame_height=250, cnorm='log',\n",
-    "    title='Rasterized scatter with count aggregator\\nand log cnorm'\n",
+    "    title='Rasterized points with count aggregator\\nand log cnorm'\n",
     ")"
    ]
   },
@@ -464,7 +437,7 @@
     "\n",
     "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
     "\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    x='x', y='y', rasterize=True, resample_when=1_000,\n",
     "    data_aspect=1, frame_height=250, cnorm='log',\n",
     "    title=\"Rasterize only when >1000 points in view\"\n",
@@ -478,6 +451,103 @@
     "When running the code above, you will notice that after zooming in enough, the original data points appear. This gives a hybrid experience: raw points at low density, rasterized aggregates when zoomed out."
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "(option-selector)=\n",
+    "## `selector`\n",
+    "\n",
+    ":::{versionadded} 0.12.0\n",
+    "Requires `holoviews>=1.21`.\n",
+    "Requires `bokeh>=3.7`.\n",
+    ":::\n",
+    "\n",
+    "When a Datashader operation is applied, with [`datashade=True`](option-datashade) or [`rasterize=True`](option-rasterize), the `selector` option allows to augment the tooltip with information computed (*selected*) from variables other than the aggregated one, effectively showing a sample of the dataset in the tooltip.\n",
+    "\n",
+    "Datashader operations allow to easily identify *macro level patterns* in large datasets by aggregating the data appropriately. However, they do not by default expose information about *individual data points*. Let's take for example a simple scatter plots set with `rasterize=True`; hovering over the image will only display the aggregated value per pixel (`'count'` by default), with no way to know more about each point (unless [`resample_when`](option-resample_when) is enabled and the user zooms in enough). Setting `selector` in this case would augment the tooltip with sample information from other variables, selected from *one unique row* of the dataset. Find out more about `selector` in HoloViews' [Interactive Hover for Big Data guide](https://dev.holoviews.org/user_guide/Interactive_Hover_for_Big_Data.html).\n",
+    "\n",
+    "Like the [`aggregator`](option-aggregator) option, a `selector` refers to a [Datashader `Reduction` object](https://datashader.org/api.html#reductions). However, unlike `aggregator` that accepts reductions that can combine data in a pixel (e.g. `'mean'` or `'count'`), `selector` only accepts reductions that *select* values, including: `'first'`, `'last'`, `'min'`, and `'max'`. Valid options include:\n",
+    "- A string object for reductions that do not require a variable name, including `'first'` and `'last'`.\n",
+    "- A 2-tuple with a reduction name and a variable name, for reductions that require a variable name, including `'min'` and `'max'` (e.g. `('min', 'column')`).\n",
+    "- A reduction instance, including `ds.first()`, `ds.last()`, `ds.min()`, and `ds.max()`.\n",
+    "\n",
+    "::: {note}\n",
+    "The hover tooltip always requires a live kernel when `selector` is set as the values displayed need to be sent by the Python server. Without a live kernel, like on this webpage, all the values are displayed as `'undefined'`.\n",
+    ":::\n",
+    "\n",
+    "When you hover over the first plot below, you will see a value for `s`, `val`, and `cat` in the bottom part of the tooltip. All these values originate from the same row in the DataFrame, that row being the first one found in the subdataset contained within this pixel. In the second plot, the values displayed are derived from the row where `val` is minimum within the hovered pixel."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import hvplot.pandas  # noqa\n",
+    "import hvsampledata\n",
+    "\n",
+    "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
+    "\n",
+    "plot_opts = dict(x='x', y='y', rasterize=True, data_aspect=1, frame_height=250, cnorm='log')\n",
+    "(\n",
+    "    df.hvplot.points(selector='first', title='selector=\"first\"', **plot_opts) +\n",
+    "    df.hvplot.points(selector=('min', 'val'), title='selector=(\"min\", \"val\")', **plot_opts)\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`datashade=True` plots get their hover tool enabled by default when `selector` is set."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import datashader as ds\n",
+    "import hvplot.pandas  # noqa\n",
+    "import hvsampledata\n",
+    "\n",
+    "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
+    "\n",
+    "df.hvplot.points(\n",
+    "    x='x', y='y', data_aspect=1, frame_height=250, cnorm='log',\n",
+    "    datashade=True, selector=ds.min('val'), title='datashade=True',\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`selector` can also be set when datashading categorical data."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import hvplot.pandas  # noqa\n",
+    "import hvsampledata\n",
+    "import datashader as ds\n",
+    "\n",
+    "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
+    "\n",
+    "df.hvplot.points(\n",
+    "    x='x', y='y', data_aspect=1, frame_height=250, colorbar=False,\n",
+    "    rasterize=True, aggregator=ds.by('cat'), selector='first',\n",
+    "    title=\"Categorical rasterizing with\\n'count' aggregator'\",\n",
+    ")"
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -503,9 +573,9 @@
     "    x='x', y='y', datashade=True, dynspread=True,\n",
     "    data_aspect=1, frame_width=200, xlim=(-2, 0), ylim=(7, 9),\n",
     ")\n",
-    "df.hvplot.scatter(threshold=0.0, title=\"Dynspread threshold=0.0\", **plot_opts) +\\\n",
-    "df.hvplot.scatter(threshold=0.5, title=\"Dynspread threshold=0.5\", **plot_opts) +\\\n",
-    "df.hvplot.scatter(threshold=1.0, title=\"Dynspread threshold=1.0\", **plot_opts)"
+    "df.hvplot.points(threshold=0.0, title=\"Dynspread threshold=0.0\", **plot_opts) +\\\n",
+    "df.hvplot.points(threshold=0.5, title=\"Dynspread threshold=0.5\", **plot_opts) +\\\n",
+    "df.hvplot.points(threshold=1.0, title=\"Dynspread threshold=1.0\", **plot_opts)"
    ]
   },
   {
@@ -529,7 +599,7 @@
     "\n",
     "df = hvsampledata.synthetic_clusters(\"pandas\")\n",
     "\n",
-    "df.hvplot.scatter(\n",
+    "df.hvplot.points(\n",
     "    x='x', y='y', rasterize=True, x_sampling=0.1, y_sampling=0.1,\n",
     "    data_aspect=1, cnorm='log', xlim=(0, 1), ylim=(0, 1), frame_height=250,\n",
     "    title='Zoomed in rasterized plot\\nwith custom x/y-sampling'\n",
 
@@ -52,6 +52,7 @@
 
 from .backend_transforms import _transfer_opts_cur_backend
 from .util import (
+    _HV_GE_1_21_0,
     filter_opts,
     is_tabular,
     is_series,
@@ -407,7 +408,7 @@ class HoloViewsConverter:
 
     Resampling Options
     ------------------
-    aggregator : str datashader.Reduction or None, default=None
+    aggregator : str, datashader.Reduction, or None, default=None
         Aggregator to use when applying rasterize or datashade operation
         (valid options include 'mean', 'count', 'min', 'max' and more, and
         datashader reduction objects)
@@ -466,6 +467,18 @@ class HoloViewsConverter:
         Applies a resampling operation (datashade, rasterize or downsample) if
         the number of individual data points present in the current viewport
         is above this threshold. The raw plot is displayed otherwise.
+    selector : datashader.Reduction | str | tuple | None, default=None
+        Datashader reduction to apply during a ``rasterize`` or ``datashade``
+        operation, used to select additional information for inclusion in the
+        hover tooltip. Supported options include:
+
+        - string: only ``'first'`` and ``'last'``
+        - tuple of two strings: ``(<reduction>, <column>)``, e.g. ``('min', 'value')``.
+        - Datashader object: ``ds.first``, ``ds.last``, ``ds.min``, and ``ds.max``.
+
+        .. versionadded:: 0.12.0
+           Requires ``holoviews>=1.21``.
+           Requires ``bokeh>=3.7``.
     threshold : float, default=0.5
         When using ``dynspread``, this value defines the minimum density of overlapping points
         required before the spreading operation is applied.
@@ -610,6 +623,7 @@ class HoloViewsConverter:
         'dynspread',
         'max_px',
         'precompute',
+        'selector',
         'threshold',
     ]
 
@@ -794,6 +808,7 @@ def __init__(
         debug=False,
         framewise=True,
         aggregator=None,
+        selector=None,
         projection=None,
         global_extent=None,
         geo=False,
@@ -911,12 +926,20 @@ def __init__(
                 'At least one resampling operation (rasterize, datashader, '
                 'downsample) must be enabled when resample_when is set.'
             )
+        if selector is not None:
+            if not _HV_GE_1_21_0:
+                msg = 'selector requires holoviews>=1.21.'
+                raise ImportError(msg)
+            if not (datashade or rasterize):
+                msg = 'rasterize or datashade must be enabled when selector is set.'
+                raise ValueError(msg)
         self.resample_when = resample_when
         self.datashade = datashade
         self.rasterize = rasterize
         self.downsample = downsample
         self.dynspread = dynspread
         self.aggregator = aggregator
+        self.selector = selector
         self.precompute = precompute
         self.x_sampling = x_sampling
         self.y_sampling = y_sampling
@@ -1043,7 +1066,7 @@ def __init__(
         if kind == 'errorbars':
             hover = False
         elif hover is None:
-            hover = not self.datashade
+            hover = True if self.selector else not self.datashade
         if hover and not any(
             t for t in tools if isinstance(t, HoverTool) or t in ['hover', 'vline', 'hline']
         ):
@@ -1962,13 +1985,24 @@ def method_wrapper(ds, x, y):
             layers = _transfer_opts_cur_backend(layers)
             return layers
 
-        import_datashader()
+        ds = import_datashader()
         from holoviews.operation.datashader import datashade, rasterize, dynspread
 
         categorical, agg = self._process_categorical_datashader()
         if agg:
             opts['aggregator'] = agg
-
+        if self.selector:
+            selector = self.selector
+            try:
+                if isinstance(selector, str):
+                    selector = getattr(ds, selector)()
+                elif isinstance(selector, tuple):
+                    selector = getattr(ds, selector[0])(selector[1])
+            except AttributeError as e:
+                sel = selector[0] if isinstance(selector, tuple) else selector
+                msg = f'Invalid selector value {sel!r}.'
+                raise ValueError(msg) from e
+            opts['selector'] = selector
         if self.precompute:
             opts['precompute'] = self.precompute
         if self.x_sampling: