Skip to content

Commit 3497577

Browse files
committed
add opt-out section
1 parent c89497b commit 3497577

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

develop-docs/application-architecture/dynamic-sampling/extrapolation.mdx

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ There are two modes that can be used to view data in Sentry: default mode and sa
4545

4646
Depending on the context and the use case, one mode may be more useful than the other.
4747

48-
There is currently no way for Sentry to automatically switch from the default mode into sample mode based on query attributes, therefore the transition needs to be triggered by the user. However, Sentry can nudge the user, based on observed characteristics of a query, to switch from one mode to another. One example for this is when an ID column is detected: extrapolated aggregates for high-cardinality and low-volume ID columns are usually not very useful, because they may refer to a highly exaggerated volume of data that is not extrapolated correctly due to the high-cardinality nature of the column in question.
48+
Generally, default makes sense for all queries that aggregate on a dataset of sufficient volume. As absolute sample size decreases below a certain limit, default mode becomes less and less useful. There may be scenarios where the user will want to switch between modes, for example to examine the aggregate numbers first, and dive into single samples for investigation, therefore the sample mode settings should be a transient view option that resets to default mode when the user opens the page the next time.
4949

5050
## Aggregates
5151

@@ -83,6 +83,10 @@ In new product surfaces, the question of whether or not to use extrapolated vs n
8383
4. Does the user care more about a truthful estimate of the aggregate data or about the actual events that happened?
8484
1. Some scenarios, like visualizing metrics over time, are based on aggregates, whereas a case of debugging a specific user’s problem hinges on actually seeing the specific events. The best mode depends on the intended usage of the product.
8585

86+
87+
### Opting Out of Extrapolation
88+
Users may want to opt out of extrapolation for different reasons. It is always possible to set the sample rate to 100% and therefore send all data to Sentry, implicitly opting out of extrapolation and behaving in the same way as sample mode.
89+
8690
### Confidence
8791

8892
When users filter on data that has a very low count but also a low sample rate, yielding a highly extrapolated but low-sample dataset, developers and users should be careful with the conclusions they draw from the data. The storage platform provides confidence intervals along with the extrapolated estimates for the different aggregation types to indicate when there is elevated uncertainty in the data. These types of datasets are inherently noisy and may contain misleading information. When this is discovered, the user should either be very careful with the conclusions they draw from the aggregate data, or switch to non-default mode for investigation of the individual samples.

0 commit comments

Comments
 (0)