DOC-5424 stash progress on compaction section

andy-stark-redis · andy-stark-redis · commit 96e9ef41972b · 2025-07-04T12:51:35.000+01:00
diff --git a/content/develop/data-types/timeseries/_index.md b/content/develop/data-types/timeseries/_index.md
@@ -19,92 +19,196 @@ weight: 150
 [![Discord](https://img.shields.io/discord/697882427875393627?style=flat-square)](https://discord.gg/KExRgMb)
 [![Github](https://img.shields.io/static/v1?label=&message=repository&color=5961FF&logo=github)](https://github.com/RedisTimeSeries/RedisTimeSeries/)
 
-The Redis time series structure lets you store and query timestamped data points.
-
-Redis time series is available in Redis Open Source, Redis Software, and Redis Cloud.
+The Redis time series data type lets you store real-valued data points
+along with the time they were collected. You can combine the values from a selection
+of time series and query them by time or value range. You can also compute
+aggregate functions of the data over periods of time and create new time series
+from the results. When you create a time series, you can specify a maximum
+retention period for the data, relative to the last reported timestamp, to
+prevent the time series from growing indefinitely.
+
+Time series support very fast reads and writes, making them ideal for
+applications such as:
+
+- Instrument data logging
+- System performance metrics
+- Financial market data
+- Internet of Things (IoT) sensor data
+- Smart metering
+- Quality of service (QoS) monitoring
+
+Redis time series are available in Redis Open Source, Redis Software, and Redis Cloud.
 See
 [Install Redis Open Source]({{< relref "/operate/oss_and_stack/install/install-stack" >}}) or
 [Install Redis Enterprise]({{< relref "/operate/rs/installing-upgrading/install" >}})
 for full installation instructions.
 
-## Features
-* High volume inserts, low latency reads
-* Query by start time and end-time
-* Aggregated queries (min, max, avg, sum, range, count, first, last, STD.P, STD.S, Var.P, Var.S, twa) for any time bucket
-* Configurable maximum retention period
-* Compaction for automatically updated aggregated timeseries
-* Secondary indexing for time series entries. Each time series has labels (field value pairs) which will allows to query by labels
-
-## Creating a timeseries
-A new timeseries can be created with the [`TS.CREATE`]({{< relref "commands/ts.create/" >}}) command; for example, to create a timeseries named `sensor1` run the following:
-
-```
-TS.CREATE sensor1
+## Creating a time series
+
+You can create a new empty time series with the [`TS.CREATE`]({{< relref "commands/ts.create/" >}}) command, specifying a key name. If you use [`TS.ADD`]({{< relref "commands/ts.add/" >}}) to add data to a time series key that does not exist, it is automatically created.
+
+```bash
+> TS.CREATE thermometer:1
+OK
+> TYPE thermometer:1
+TSDB-TYPE
+> TS.INFO thermometer:1
+ 1) totalSamples
+ 2) (integer) 0
+    .
+    .
+```
+
+The timestamp for each data point is a 64-bit integer value. This is designed
+to support Unix timestamps, measured in milliseconds since the
+[Unix epoch](https://en.wikipedia.org/wiki/Unix_time). However, you can interpret
+the timestamps in any way you like (for example, as the number of days since a given start date).
+When you create a time series, you can specify a maximum retention period for the
+data, relative to the last reported timestamp. A retention period of `0` means
+the data does not expire.
+
+```bash
+# Create a new time series with a first value of 10.8 (Celsius),
+# recorded on day 1, with a retention period of 100 days.
+> TS.ADD thermometer:2 1 10.8 RETENTION 100
+(integer) 1
+> TS.INFO thermometer:2
+    .
+    .
+ 9) retentionTime
+10) (integer) 100
+    .
+    .
+```
+
+You can also add one or more *labels* to a time series when you create it. Labels
+are key-value pairs where the value can be a string or a number. You can use
+both the keys and values to select subsets of all the available time series
+for queries and aggregations.
+
+```bash
+> TS.ADD thermometer:3 1 10.4 LABELS location UK type Mercury
+(integer) 1
+> TS.INFO thermometer:3
+ 1) totalSamples
+ 2) (integer) 1
+ 3) memoryUsage
+ 4) (integer) 5000
+    .
+    .
+19) labels
+20) 1) 1) "location"
+       2) "UK"
+    2) 1) "type"
+       2) "Mercury"
+    .
+    .
 ```
 
-You can prevent your timeseries growing indefinitely by setting a maximum age for samples compared to the last event time (in milliseconds) with the `RETENTION` option. The default value for retention is `0`, which means the series will not be trimmed.
-
-```
-TS.CREATE sensor1 RETENTION 2678400000
-```
-This will create a timeseries called `sensor1` and trim it to values of up to one month.
-
 
 ## Adding data points
-For adding new data points to a timeseries we use the [`TS.ADD`]({{< relref "commands/ts.add/" >}}) command:
-
-```
-TS.ADD key timestamp value
-```
 
-The `timestamp` argument is the UNIX timestamp of the sample in milliseconds and `value` is the numeric data value of the sample.
+You can add individual data points with [`TS.ADD`]({{< relref "commands/ts.add/" >}}),
+but you can also use [`TS.MADD`]({{< relref "commands/ts.madd/" >}}) to add multiple data
+points to one or more time series in a single command. (Note that unlike `TS.ADD`, `TS.MADD`
+doesn't create any new time series if you specify keys that don't exist.) The return value
+is an array containing the number of samples in each time series after the operation.
+If you use the `*` character as the timestamp, Redis will record the current
+Unix time, as reported by the server's clock.
 
-Example:
+```bash
+> TS.MADD thermometer:1 1 9.2 thermometer:1 2 9.9 thermometer:2 2 10.3
+1) (integer) 1
+2) (integer) 2
+3) (integer) 2
 ```
-TS.ADD sensor1 1626434637914 26
-```
-
-To **add a datapoint with the current timestamp** you can use a `*` instead of a specific timestamp:
-
-```
-TS.ADD sensor1 * 26
-```
-
-You can **append data points to multiple timeseries** at the same time with the [`TS.MADD`]({{< relref "commands/ts.madd/" >}}) command:
-```
-TS.MADD key timestamp value [key timestamp value ...]
-```
-
 
 ## Deleting data points
-Data points between two timestamps (inclusive) can be deleted with the [`TS.DEL`]({{< relref "commands/ts.del/" >}}) command:
-```
-TS.DEL key fromTimestamp toTimestamp
-```
-Example:
-```
-TS.DEL sensor1 1000 2000
-```
 
-To delete a single timestamp, use it as both the "from" and "to" timestamp:
-```
-TS.DEL sensor1 1000 1000
+Use [`TS.DEL`]({{< relref "commands/ts.del/" >}}) to delete data points
+that fall within a given timestamp range. The range is inclusive, meaning that
+samples whose timestamp equals the start or end of the range are deleted.
+If you want to delete a single timestamp, use it as both the start and end of the range.
+
+```bash
+> TS.INFO thermometer:1
+ 1) totalSamples
+ 2) (integer) 2
+ 3) memoryUsage
+ 4) (integer) 4856
+ 5) firstTimestamp
+ 6) (integer) 1
+ 7) lastTimestamp
+ 8) (integer) 2
+    .
+    .
+> TS.ADD thermometer:1 3 9.7
+(integer) 3
+127.0.0.1:6379> TS.INFO thermometer:1
+ 1) totalSamples
+ 2) (integer) 3
+ 3) memoryUsage
+ 4) (integer) 4856
+ 5) firstTimestamp
+ 6) (integer) 1
+ 7) lastTimestamp
+ 8) (integer) 3
+    .
+    .
+> TS.DEL thermometer:1 1 2
+(integer) 2
+> TS.INFO thermometer:1
+ 1) totalSamples
+ 2) (integer) 1
+ 3) memoryUsage
+ 4) (integer) 4856
+ 5) firstTimestamp
+ 6) (integer) 3
+ 7) lastTimestamp
+ 8) (integer) 3
+    .
+    .
+> TS.DEL thermometer:1 3 3
+(integer) 1
+> TS.INFO thermometer:1
+ 1) totalSamples
+ 2) (integer) 0
+    .
+    .
 ```
 
 **Note:** When a sample is deleted, the data in all downsampled timeseries will be recalculated for the specific bucket. If part of the bucket has already been removed though, because it's outside of the retention period, we won't be able to recalculate the full bucket, so in those cases we will refuse the delete operation.
 
-
-## Labels
-Labels are key-value metadata we attach to data points, allowing us to group and filter. They can be either string or numeric values and are added to a timeseries on creation:
-
-```
-TS.CREATE sensor1 LABELS region east
-```
-
-
-
 ## Compaction
-Another useful feature of Redis Time Series is compacting data by creating a rule for compaction ([`TS.CREATERULE`]({{< relref "commands/ts.createrule/" >}})). For example, if you have collected more than one billion data points in a day, you could aggregate the data by every minute in order to downsample it, thereby reducing the dataset size to 24 * 60 = 1,440 data points. You can choose one of the many available aggregation types in order to aggregate multiple data points from a certain minute into a single one. The currently supported aggregation types are: `avg, sum, min, max, range, count, first, last, std.p, std.s, var.p, var.s and twa`.
+
+A time series can become large if samples are added very frequently. Instead
+of dealing with individual samples, it is sometimes useful to split the full
+time range of the series into equal-sized "buckets" and represent each
+bucket by an aggregate value, such as the average or maximum value.
+Reducing the number of data points in this way is known as *compaction*.
+
+For example, if you expect to collect more than one billion data points in a day, you could aggregate the data using buckets of one minute. Since each bucket is represented by a single value, this compacts the dataset size to 1,440 data points (24 hours x 60 minutes = 1,440 minutes).
+
+Use [`TS.CREATERULE`]({{< relref "commands/ts.createrule/" >}}) to create a
+
+new
+compacted time series from an existing one, leaving the original series unchanged.
+Specify a duration for each bucket and an aggregation function to apply to each bucket.
+The available aggregation functions are:
+
+- `avg`: Arithmetic mean of all values
+- `sum`: Sum of all values
+- `min`: Minimum value
+- `max`: Maximum value
+- `range`: Difference between the highest and the lowest value
+- `count`: Number of values
+- `first`: Value with lowest timestamp in the bucket
+- `last`:  Value with highest timestamp in the bucket
+- `std.p`: Population standard deviation of the values
+- `std.s`: Sample standard deviation of the values
+- `var.p`: Population variance of the values
+- `var.s`: Sample variance of the values
+- `twa`: Time-weighted average over the bucket's timeframe (since RedisTimeSeries v1.8)
 
 It's important to point out that there is no data rewriting on the original timeseries; the compaction happens in a new series, while the original one stays the same. In order to prevent the original timeseries from growing indefinitely, you can use the retention option, which will trim it down to a certain period of time.