|
83 | 83 | * [CHANGE] Removed `CortexQuerierCapacityFull` alert. #342
|
84 | 84 | * [CHANGE] Changes blocks storage alerts to group metrics by the configured `cluster_labels` (supporting the deprecated `alert_aggregation_labels`). #351
|
85 | 85 | * [CHANGE] Increased `CortexIngesterReachingSeriesLimit` critical alert threshold from 80% to 85%. #363
|
86 |
| -* [CHANGE] Decreased `-server.grpc-max-concurrent-streams` from 100k to 10k. #369 |
87 |
| -* [CHANGE] Decreased blocks storage ingesters graceful termination period from 80m to 20m. #369 |
88 |
| -* [CHANGE] Changed default `job_names` for query-frontend, query-scheduler and querier to match custom deployments too. #376 |
89 |
| -* [CHANGE] Increase the rules per group and rule groups limits on different tiers. #396 |
90 |
| -* [CHANGE] Removed `max_samples_per_query` limit, since it only works with chunks and only when using `-distributor.shard-by-all-labels=false`. #397 |
91 |
| -* [CHANGE] Removed chunks storage query sharding config support. The following config options have been removed: #398 |
92 |
| - * `_config` > `queryFrontend` > `shard_factor` |
93 |
| - * `_config` > `queryFrontend` > `sharded_queries_enabled` |
94 |
| - * `_config` > `queryFrontend` > `query_split_factor` |
95 |
| -* [CHANGE] Split `cortex_api` recording rule group into three groups. This is a workaround for large clusters where this group can become slow to evaluate. #401 |
96 |
| -* [CHANGE] Increased `CortexIngesterReachingSeriesLimit` warning threshold from 70% to 80% and critical threshold from 85% to 90%. #404 |
97 |
| -* [CHANGE] Rename ruler_s3_bucket_name and ruler_gcs_bucket_name to ruler_storage_bucket_name: #415 |
98 |
| -* [CHANGE] Fine-tuned rolling update policy for distributor, querier, query-frontend, query-scheduler. #420 |
99 |
| -* [CHANGE] Increased memcached metadata/chunks/index-queries max connections from 4k to 16k. #420 |
100 |
| -* [CHANGE] Disabled step alignment in query-frontend to be compliant with PromQL. #420 |
101 |
| -* [CHANGE] Do not limit compactor CPU and request a number of cores equal to the configured concurrency. #420 |
102 |
| -* [ENHANCEMENT] Add overrides config to compactor. This allows setting retention configs per user. #386 |
103 | 86 | * [ENHANCEMENT] cortex-mixin: Make `cluster_namespace_deployment:kube_pod_container_resource_requests_{cpu_cores,memory_bytes}:sum` backwards compatible with `kube-state-metrics` v2.0.0. #317
|
104 | 87 | * [ENHANCEMENT] Cortex-mixin: Include `cortex-gw-internal` naming variation in default `gateway` job names. #328
|
105 | 88 | * [ENHANCEMENT] Ruler dashboard: added object storage metrics. #354
|
|
117 | 100 | * "Tenant Configuration Sync" row - information about the configuration sync procedure.
|
118 | 101 | * "Sharding Initial State Sync" row - information about the initial state sync procedure when sharding is enabled.
|
119 | 102 | * "Sharding Runtime State Sync" row - information about various state operations which occur when sharding is enabled (replication, fetch, marge, persist).
|
120 |
| -* [ENHANCEMENT] Added 256MB memory ballast to querier. #369 |
121 |
| -* [ENHANCEMENT] Update gsutil command for `not healthy index found` playbook #370 |
122 |
| -* [ENHANCEMENT] Update `etcd-operator` to latest version (see https://github.com/grafana/jsonnet-libs/pull/480). #263 |
123 |
| -* [ENHANCEMENT] Added Alertmanager alerts and playbooks covering configuration syncs and sharding operation: #377 #378 |
124 |
| - * `CortexAlertmanagerSyncConfigsFailing` |
125 |
| - * `CortexAlertmanagerRingCheckFailing` |
126 |
| - * `CortexAlertmanagerPartialStateMergeFailing` |
127 |
| - * `CortexAlertmanagerReplicationFailing` |
128 |
| - * `CortexAlertmanagerPersistStateFailing` |
129 |
| - * `CortexAlertmanagerInitialSyncFailed` |
130 |
| -* [ENHANCEMENT] Add support for Azure storage in Alertmanager configuration. #381 |
131 |
| -* [ENHANCEMENT] Add support for running Alertmanager in sharding mode. #394 |
132 |
| -* [ENHANCEMENT] Allow to customize PromQL engine settings via `queryEngineConfig`. #399 |
133 |
| -* [ENHANCEMENT] Add recording rules to improve responsiveness of Alertmanager dashboard. #387 |
134 |
| -* [ENHANCEMENT] Add `CortexRolloutStuck` alert. #405 |
135 |
| -* [ENHANCEMENT] Added `CortexKVStoreFailure` alert. #406 |
136 |
| -* [ENHANCEMENT] Use configured `ruler` jobname for ruler dashboard panels. #409 |
137 |
| -* [ENHANCEMENT] Add ability to override `datasource` for generated dashboards. #407 |
138 |
| -* [ENHANCEMENT] Use alertmanager jobname for alertmanager dashboard panels #411 |
139 |
| -* [ENHANCEMENT] Added `CortexDistributorReachingInflightPushRequestLimit` alert. #408 |
140 |
| -* [ENHANCEMENT] Define Azure object storage ruler args. #416 |
141 |
| -* [ENHANCEMENT] Added the following config options to allow to schedule multiple replicas of the same service on the same node: #418 |
142 |
| - * `cortex_distributor_allow_multiple_replicas_on_same_node` |
143 |
| - * `cortex_ruler_allow_multiple_replicas_on_same_node` |
144 |
| - * `cortex_querier_allow_multiple_replicas_on_same_node` |
145 |
| - * `cortex_query_frontend_allow_multiple_replicas_on_same_node` |
146 | 103 | * [BUGFIX] Fixed `CortexIngesterHasNotShippedBlocks` alert false positive in case an ingester instance had ingested samples in the past, then no traffic was received for a long period and then it started receiving samples again. #308
|
147 | 104 | * [BUGFIX] Alertmanager: fixed `--alertmanager.cluster.peers` CLI flag passed to alertmanager when HA is enabled. #329
|
148 | 105 | * [BUGFIX] Fixed `CortexInconsistentRuntimeConfig` metric. #335
|
149 | 106 | * [BUGFIX] Fixed scaling dashboard to correctly work when a Cortex service deployment spans across multiple zones (a zone is expected to have the `zone-[a-z]` suffix). #365
|
150 | 107 | * [BUGFIX] Fixed rollout progress dashboard to correctly work when a Cortex service deployment spans across multiple zones (a zone is expected to have the `zone-[a-z]` suffix). #366
|
151 |
| -* [BUGFIX] Fixed rollout progress dashboard to include query-scheduler too. #376 |
152 |
| -* [BUGFIX] Fixed `-distributor.extend-writes` setting on ruler when `unregister_ingesters_on_shutdown` is disabled. #369 |
153 |
| -* [BUGFIX] Upstream recording rule `node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate` renamed. #379 |
154 |
| -* [BUGFIX] Treat `compactor_blocks_retention_period` type as string rather than int.#395 |
155 |
| -* [BUGFIX] Fixed writes/reads/alertmanager resources dashboards to use `$._config.job_names.gateway`. #403 |
156 |
| -* [BUGFIX] Span the annotation.message in alerts as YAML multiline strings. #412 |
157 |
| -* [BUGFIX] Pass `-ruler-storage.s3.endpoint` to ruler when using S3. #421 |
158 | 108 |
|
159 | 109 | ## 1.9.0 / 2021-05-18
|
160 | 110 |
|
|
0 commit comments