You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: man/classIntervals.Rd
+28-1Lines changed: 28 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -69,6 +69,13 @@ classIntervals2shingle(x)
69
69
The "headtails" style uses the algorithm proposed by Bin Jiang (2013), in order to find groupings or hierarchy for data with a heavy-tailed distribution. This classification scheme partitions all of the data values around the mean into two parts and continues the process iteratively for the values (above the mean) in the head until the head part values are no longer heavy-tailed distributed. Thus, the number of classes and the class intervals are both naturally determined. By default the algorithm uses \code{thr = 0.4}, meaning that when the head represents more than 40\% of the observations the distribution is not considered heavy-tailed. The threshold argument \code{thr} may be modified through \code{\dots} (see Examples).
70
70
71
71
The "maximum" style uses the Maximum Breaks method of classification finding the k - 1 largest differences in \code{var}. The mean of the values that generated the largest splits is used as the interval boundary.
72
+
73
+
The "box" style generate 7 breaks (therefore 6 categories) based on a box-and-whisker plot. First and last categories
74
+
includes the data values considered as outliers, and the four remaining categories are defined by the percentiles 25,
75
+
50 and 75 of the data distribution. By default, the identification of outliers is based on the interquantile range
76
+
(IQR), so values lower than percentile 25 - 1.5 * IQR or higher than percentile 75 + 1.5 * IQR are considered as outliers.
77
+
The multiplier applied to the IQR \code{iqr_mult = 1.5} may be modified through \code{\dots}.
0 commit comments