more

shainaraskas · shainaraskas · commit 29b07882dd2d · 2024-11-29T14:32:53.000-05:00
diff --git a/docs/reference/ccr/index.asciidoc b/docs/reference/ccr/index.asciidoc
@@ -1,6 +1,7 @@
 [role="xpack"]
 [[xpack-ccr]]
 == {ccr-cap}
+
 With {ccr}, you can replicate indices across clusters to:
 
 * Continue handling search requests in the event of a datacenter outage
diff --git a/docs/reference/data-store-architecture.asciidoc b/docs/reference/data-store-architecture.asciidoc
@@ -8,8 +8,9 @@ from any node.
 
 The topics in this section provides information about the architecture of {es} and how it stores and retrieves data: 
 
-<<nodes-shards,Nodes and shards>>: Learn about the basic building blocks of an {es} cluster, including nodes, shards, primaries, and replicas.
-<<docs-replication,Reading and writing documents>>: Learn how {es} replicates read and write operations across shards and shard copies.
+* <<nodes-shards,Nodes and shards>>: Learn about the basic building blocks of an {es} cluster, including nodes, shards, primaries, and replicas.
+* <<docs-replication,Reading and writing documents>>: Learn how {es} replicates read and write operations across shards and shard copies.
+* <<shard-allocation-relocation-recovery,Shard allocation, relocation, and recovery>>: Learn how {es} allocates and balances shards across nodes.
 --
 
 include::nodes-shards.asciidoc[]
diff --git a/docs/reference/high-availability-overview.asciidoc b/docs/reference/high-availability-overview.asciidoc
@@ -0,0 +1,20 @@
+Your data is important to you. Keeping it safe and available is important to Elastic. Sometimes your cluster may experience hardware failure or a power loss. To help you plan for this, {es} offers a number of features to achieve high availability despite failures. Depending on your deployment type, you might need to provision servers in different zones or configure external repositories to meet your organization's availability needs.
+
+* *<<high-availability-cluster-design,Design for resilience>>* 
++
+Distributed systems like Elasticsearch are designed to keep working even if some of their components have failed. An Elasticsearch cluster can continue operating normally if some of its nodes are unavailable or disconnected, as long as there are enough well-connected nodes to take over the unavailable node's responsibilities.
++
+If you're designing a smaller cluster, you might focus on making your cluster resilient to single-node failures. Designers of larger clusters must also consider cases where multiple nodes fail at the same time.
+// need to improve connections to ECE, EC hosted, ECK pod/zone docs in the child topics
+* *<<xpack-ccr,Cross-cluster replication>>*
++
+To effectively distribute read and write operations across nodes, the nodes in a cluster need good, reliable connections to each other. To provide better connections, you typically co-locate the nodes in the same data center or nearby data centers.
++
+Co-locating nodes in a single location exposes you to the risk of a single outage taking your entire cluster offline. To maintain high availability, you can prepare a second cluster that can take over in case of disaster by implementing {ccr} (CCR).
++
+CCR provides a way to automatically synchronize indices from a leader cluster to a follower cluster. This cluster could be in a different data center or even a different content from the leader cluster. If the primary cluster fails, the secondary cluster can take over.
++
+TIP: You can also use CCR to create secondary clusters to serve read requests in geo-proximity to your users.
+* *<<snapshot-restore,Snapshots>>* 
++
+Take snapshots of your cluster that can be restored in case of failure.
diff --git a/docs/reference/high-availability.asciidoc b/docs/reference/high-availability.asciidoc
@@ -3,26 +3,7 @@
 
 [partintro]
 --
-Your data is important to you. Keeping it safe and available is important
-to {es}. Sometimes your cluster may experience hardware failure or a power
-loss. To help you plan for this, {es} offers a number of features
-to achieve high availability despite failures.
-
-* With proper planning, a cluster can be
-  <<high-availability-cluster-design,designed for resilience>> to many of the
-  things that commonly go wrong, from the loss of a single node or network
-  connection right up to a zone-wide outage such as power loss.
-
-* You can use <<xpack-ccr,{ccr}>> to replicate data to a remote _follower_
-  cluster which may be in a different data centre or even on a different
-  continent from the leader cluster. The follower cluster acts as a hot
-  standby, ready for you to fail over in the event of a disaster so severe that
-  the leader cluster fails. The follower cluster can also act as a geo-replica
-  to serve searches from nearby clients.
-
-* The last line of defence against data loss is to take
-  <<snapshots-take-snapshot,regular snapshots>> of your cluster so that you can
-  restore a completely fresh copy of it elsewhere if needed.
+include::{es-ref-dir}/high-availability-overview.asciidoc[]
 --
 
 include::high-availability/cluster-design.asciidoc[]
diff --git a/docs/reference/production.asciidoc b/docs/reference/production.asciidoc
@@ -53,14 +53,26 @@ Refer to the documentation for each deployment method for detailed information a
 | ??
 | ??
 
-| <<elasticsearch-deployment-options,Manual on-premise>>
+| *<<elasticsearch-deployment-options,Manual on-premise>>*
 | Self-hosted
 | ??
 | ??
 |===
 
+
+[discrete]
+== Cluster or deployment design
+
+{es} is built to be always available and to scale with your needs. It does this using a distributed architecture. By distributing your cluster, you can keep Elastic online and responsive to requests.
+
+[discrete]
+=== Where to start
+
+Many {es} options come with different performance considerations and trade-offs. The best way to determine the
+optimal configuration for your use case is through https://www.elastic.co/elasticon/conf/2016/sf/quantitative-cluster-sizing[testing with your own data and queries]. When you understand the shape and size of your data, as well as your use case, you can make informed decisions about how to configure your cluster.
+
 [discrete]
-== Your data retention strategy
+=== Your data retention strategy
 
 include::{es-ref-dir}/lifecycle-options.asciidoc[]
 
@@ -69,63 +81,62 @@ You should determine how long you need to retain your data and how you will mana
 something about when to use which one?
 
 [discrete]
-== Cluster or deployment design
+=== Nodes and shards
 
-Many teams rely on {es} to run their key services. To keep these services running, you can design your {es} deployment
-to keep {es} available, even in case of large-scale outages. To keep it running fast, you also can design your
-deployment to be responsive to production workloads.
+When you move to production, you need to introduce multiple nodes and shards to your cluster. Nodes and shards are what make Elasticsearch distributed and scalable.
 
-{es} is built to be always available and to scale with your needs. It does this using a distributed architecture.
-By distributing your cluster, you can keep Elastic online and responsive to requests.
+The number of these nodes and shards depends on your data, your use case, and your budget. See <<how-to,Optimizations>> for more information.
 
-Nodes and shards design
-Size your shards
-Tuning
-Reference architectures
+The way that you manage your nodes and shards depends on your deployment method:
 
-{es} offers many options that allow you to configure your cluster to meet your organization’s goals, requirements,
-and restrictions. You can review the following guides to learn how to tune your cluster to meet your needs:
+* If you're using a *manual on-premise deployment*, then you need to size and manage your nodes and shards manually.
 
-* <<high-availability-cluster-design,Designing for resilience>>
-* <<tune-for-indexing-speed,Tune for indexing speed>>
-* <<tune-for-search-speed,Tune for search speed>>
-* <<tune-for-disk-usage,Tune for disk usage>>
-* <<use-elasticsearch-for-time-series-data,Tune for time series data>>
+* If you're using *Elastic Cloud Hosted* or *Elastic Cloud Enterprise*, then you can choose from different deployment types to apply sensible defaults for your use case, or set the size of your data on a per-zone, per-tier basis. These products can also autoscale resources in response to workload changes.
+** *Elastic Cloud Hosted resources*: 
+*** {cloud}/ec-create-deployment.html[Create a hosted deployment]
+*** {cloud}/ec-autoscaling.html[Deployment autoscaling]
+** *Elastic Cloud Enterprise resources*:
+*** {ece-ref}/ece-stack-getting-started.html[Working with deployments]
+*** {ece-ref}/ece-autoscaling.html[Deployment autoscaling]
 
-Many {es} options come with different performance considerations and trade-offs. The best way to determine the
-optimal configuration for your use case is through https://www.elastic.co/elasticon/conf/2016/sf/quantitative-cluster-sizing[testing with your own data and queries].
+* If you're using *Elastic Cloud on Kubernetes*, then you can define {eck-ref}/k8s-autoscaling.html[autoscaling policies] and use the {eck-ref}/k8s-stateless-autoscaling.html[Kubernetes horizontal pod autoscaler] to scale different elements in your cluster based on your workload.
+
+Learn more about <<nodes-shards,nodes and shards>>. 
 
 [discrete]
-== Security
+=== High availability and disaster recovery
+
+include::{es-ref-dir}/high-availability-overview.asciidoc[]
 
-<<secure-cluster,Learn about securing an Elasticsearch cluster>>
+// each of these topics needs to be reviewed to mark elements related/unrelated to each deployment type
 
 [discrete]
-== Disaster recovery
+=== Optimize your cluster for your use case
 
-In case of failure, {es} offers tools for cross-cluster replication and cluster snapshots that can
-help you fall back or recover quickly. You can also use cross-cluster replication to serve requests based on the
-geographic location of your users and your resources.
+{es} offers many options that allow you to configure your cluster to meet your organization's goals, requirements, and restrictions. Review these guidelines to learn how to tune your cluster to meet your needs. These guidelines cover elements from hardware provision to query optimization.
 
+* <<tune-for-indexing-speed,Tune for indexing speed>>
+* <<tune-for-search-speed,Tune for search speed>>
+* <<tune-for-disk-usage,Tune for disk usage>>
+* <<use-elasticsearch-for-time-series-data,Tune for time series data>>  
+// do we need this last topic anymore? Is this the best version we have? It's not referenced anywhere. it also isn't updated to use data stream lifecycle
+
+// each of these topics needs to be reviewed to mark elements related/unrelated to each deployment type
 
-To effectively distribute read and write operations across nodes, the nodes in a cluster need good, reliable connections
-to each other. To provide better connections, you typically co-locate the nodes in the same data center or nearby data centers.
+[discrete]
+== Security
 
-Co-locating nodes in a single location exposes you to the risk of a single outage taking your entire cluster offline. To
-maintain high availability, you can prepare a second cluster that can take over in case of disaster by implementing
-cross-cluster replication (CCR).
+The {stack} is composed of many moving parts. There are the {es} nodes that form the cluster, plus {ls} instances, {kib} instances, {beats} agents, and clients all communicating with the cluster. In the case of *Elastic Cloud Hosted*, *Elastic Cloud Enterprise*, or *Elastic Cloud Serverless* deployments, you also need to consider the security of the Elastic Cloud instance.
 
-CCR provides a way to automatically synchronize indices from your primary cluster to a secondary remote cluster that
-can serve as a hot backup. If the primary cluster fails, the secondary cluster can take over.
+Review the following topics
 
-You can also use CCR to create secondary clusters to serve read requests in geo-proximity to your users.
+Enabling security protects {es} clusters by:
 
-Learn more about <<xpack-ccr,cross-cluster replication>> and about <<high-availability-cluster-design,designing for resilience>>.
+* <<preventing-unauthorized-access, Preventing unauthorized access>> with password protection, role-based access control, and IP filtering.
+* <<preserving-data-integrity, Preserving the integrity of your data>> with SSL/TLS encryption.
+* <<maintaining-audit-trail, Maintaining an audit trail>> so you know who's doing what to your cluster and the data it stores.
 
-[TIP]
-====
-You can also take <<snapshot-restore,snapshots>> of your cluster that can be restored in case of failure.
-====
+<<secure-cluster,Learn about securing an Elasticsearch cluster>>. 
 
 [discrete]
 == Monitoring
diff --git a/docs/reference/security/index.asciidoc b/docs/reference/security/index.asciidoc
@@ -4,7 +4,7 @@
 [partintro]
 --
 
-The {stack} is comprised of many moving parts. There are the {es}
+The {stack} is composed of many moving parts. There are the {es}
 nodes that form the cluster, plus {ls} instances, {kib} instances, {beats}
 agents, and clients all communicating with the cluster. To keep your cluster
 safe, adhere to the <<es-security-principles,{es} security principles>>.