Skip to content

Commit e30be46

Browse files
committed
SRVCOM-1728: Updating autoscaling docs abstracts for Jupiter
1 parent b0326aa commit e30be46

7 files changed

+13
-24
lines changed

modules/serverless-concurrency-limits.adoc renamed to modules/serverless-about-concurrency.adoc

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
// Module included in the following assemblies:
22
//
3-
// * serverless/develop/serverless-autoscaling-developer.adoc
3+
// * /serverless/develop/serverless-autoscaling-developer.adoc
44

55
:_content-type: CONCEPT
6-
[id="serverless-concurrency-limits_{context}"]
7-
= Concurrency limits and targets
6+
[id="serverless-about-concurrency_{context}"]
7+
= Concurrency
88

9-
Concurrency can be configured as either a _soft limit_ or a _hard limit_:
9+
Concurrency determines the number of simultaneous requests that can be processed by each replica of an application at any given time. Concurrency can be configured as a _soft limit_ or a _hard limit_:
1010

1111
* A soft limit is a targeted requests limit, rather than a strictly enforced bound. For example, if there is a sudden burst of traffic, the soft limit target can be exceeded.
1212

modules/serverless-autoscaling-developer-minscale.adoc

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,16 +6,14 @@
66
[id="serverless-autoscaling-developer-minscale_{context}"]
77
= Minimum scale bounds
88

9-
The minimum number of replicas that can serve an application is determined by the `minScale` annotation.
9+
The minimum number of replicas that can serve an application is determined by the `minScale` annotation. If scale to zero is not enabled, the `minScale` value defaults to `1`.
1010

1111
The `minScale` value defaults to `0` replicas if the following conditions are met:
1212

1313
* The `minScale` annotation is not set
1414
* Scaling to zero is enabled
1515
* The class `KPA` is used
1616
17-
If scale to zero is not enabled, the `minScale` value defaults to `1`.
18-
1917
.Example service spec with `minScale` annotation
2018
[source,yaml]
2119
----

modules/serverless-autoscaling-maxscale-kn.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[id="serverless-autoscaling-maxscale-kn_{context}"]
77
= Setting the maxScale annotation by using the Knative CLI
88

9-
You can use the `kn service` command with the `--max-scale` flag to create or modify the `--max-scale` value for a service.
9+
Using the `kn` CLI to set the `maxScale` annotation provides a more streamlined and intuitive user interface over modifying YAML files directly. You can use the `kn service` command with the `--max-scale` flag to create or modify the `--max-scale` value for a service.
1010

1111
.Prerequisites
1212

modules/serverless-autoscaling-minscale-kn.adoc

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,10 +3,10 @@
33
// * serverless/develop/serverless-autoscaling-developer.adoc
44

55
:_content-type: PROCEDURE
6-
[id="serverless-autoscaling-minscale_{context}"]
6+
[id="serverless-autoscaling-minscale-kn_{context}"]
77
= Setting the minScale annotation by using the Knative CLI
88

9-
You can use the `kn service` command with the `--min-scale` flag to create or modify the `--min-scale` value for a service.
9+
Using the `kn` CLI to set the `minScale` annotation provides a more streamlined and intuitive user interface over modifying YAML files directly. You can use the `kn service` command with the `--min-scale` flag to create or modify the `--min-scale` value for a service.
1010

1111
.Prerequisites
1212

modules/serverless-concurrency-limits-configure-hard.adoc

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,7 @@
66
[id="serverless-concurrency-limits-configure-hard_{context}"]
77
= Configuring a hard concurrency limit
88

9-
You can specify a hard concurrency limit for your Knative service by modifying the `containerConcurrency` spec or by using the `kn service` command with the correct flags.
10-
11-
// However, a default value can be set for the Revision's containerConcurrency field in config-defaults.yaml.
12-
// add note about this for admins to see? Need more details about config-defaults though
9+
A hard concurrency limit is a strictly enforced upper bound requests limit. If concurrency reaches the hard limit, surplus requests are buffered and must wait until there is enough free capacity to execute the requests. You can specify a hard concurrency limit for your Knative service by modifying the `containerConcurrency` spec, or by using the `kn service` command with the correct flags.
1310

1411
.Procedure
1512

modules/serverless-concurrency-limits-configure-soft.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[id="serverless-concurrency-limits-configure-soft_{context}"]
77
= Configuring a soft concurrency target
88

9-
You can specify a soft concurrency target for your Knative service by setting the `autoscaling.knative.dev/target` annotation in the spec, or by using the `kn service` command with the correct flags.
9+
A soft limit is a targeted requests limit, rather than a strictly enforced bound. For example, if there is a sudden burst of traffic, the soft limit target can be exceeded. You can specify a soft concurrency target for your Knative service by setting the `autoscaling.knative.dev/target` annotation in the spec, or by using the `kn service` command with the correct flags.
1010

1111
.Procedure
1212

serverless/develop/serverless-autoscaling-developer.adoc

Lines changed: 3 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -26,9 +26,7 @@ Any limits or targets that you set for a service are measured against a single i
2626
[id="serverless-autoscaling-developer-scale-bounds"]
2727
== Scale bounds
2828

29-
Scale bounds determine the minimum and maximum numbers of replicas that can serve an application at any given time.
30-
31-
You can set scale bounds for an application to help prevent cold starts or control computing costs.
29+
Scale bounds determine the minimum and maximum numbers of replicas that can serve an application at any given time. You can set scale bounds for an application to help prevent cold starts or control computing costs.
3230

3331
// minscale docs
3432
include::modules/serverless-autoscaling-developer-minscale.adoc[leveloffset=+2]
@@ -38,12 +36,8 @@ include::modules/serverless-autoscaling-minscale-kn.adoc[leveloffset=+3]
3836
include::modules/serverless-autoscaling-developer-maxscale.adoc[leveloffset=+2]
3937
include::modules/serverless-autoscaling-maxscale-kn.adoc[leveloffset=+3]
4038

41-
[id="serverless-autoscaling-developer-concurrency"]
42-
== Concurrency
43-
44-
Concurrency determines the number of simultaneous requests that can be processed by each replica of an application at any given time.
45-
46-
include::modules/serverless-concurrency-limits.adoc[leveloffset=+2]
39+
// concurrency
40+
include::modules/serverless-about-concurrency.adoc[leveloffset=+1]
4741
include::modules/serverless-concurrency-limits-configure-soft.adoc[leveloffset=+2]
4842
include::modules/serverless-concurrency-limits-configure-hard.adoc[leveloffset=+2]
4943
include::modules/serverless-target-utilization.adoc[leveloffset=+2]

0 commit comments

Comments
 (0)