Add a page on how to remove secondary from the cluster on AWS

NataliaIvakina · NataliaIvakina · commit 64ecd3c9191d · 2025-04-02T11:49:20.000+02:00
diff --git a/modules/ROOT/content-nav.adoc b/modules/ROOT/content-nav.adoc
@@ -15,6 +15,7 @@
 ** xref:cloud-deployments/neo4j-aws.adoc[]
 ** xref:cloud-deployments/neo4j-gcp.adoc[]
 ** xref:cloud-deployments/neo4j-azure.adoc[]
+** xref:cloud-deployments/neo4j-cluster-cloud.adoc[]
 
 * xref:docker/index.adoc[]
 ** xref:docker/introduction.adoc[]
diff --git a/modules/ROOT/pages/cloud-deployments/neo4j-cluster-cloud.adoc b/modules/ROOT/pages/cloud-deployments/neo4j-cluster-cloud.adoc
@@ -0,0 +1,109 @@
+:description: Tha page describes how to manage the Neo4j cluster on AWS.
+:page-role: enterprise-edition
+
+[[neo4j-cluster-cloud-deployments]]
+= Neo4j cluster on self-managed cloud deployments
+
+Before diving into the topic, it is important to understand basics about Neo4j's clustering.
+
+Neo4j cluster consists of a homogenous pool of servers that collectively run a number of databases.
+The servers can operate in two different database-hosting modes: _primary_ and _secondary_.
+A server can simultaneously act as a primary host for one or more databases and as a secondary host for other databases.
+
+For more details on operational and application aspects of Neo4j's clustering, refer to the xref::clustering/index.adoc[Clustering in Neo4j].
+
+For information on how to manage databases and servers in a cluster, see respectively xref::clustering/databases.adoc[] and xref::clustering/servers.adoc[].
+
+
+== Neo4j cluster on AWS
+
+Neo4j does not provide Amazon Machine Images (AMIs) with a pre-installed version of the product.
+The Neo4j AWS Marketplace listings (and listings on GitHub) use CloudFormation templates that deploy and configure Neo4j dynamically with a shell script.
+
+
+// === Neo4j cluster and auto-scaling groups on AWS
+
+
+=== Removing a (secondary constrained) server from the cluster
+
+Imagine you have a cluster consisting of three primary constrained servers and two secondary constrained servers.
+This means that three servers host primary databases and the other two host secondary databases.
+
+When performing rolling updates on Amazon Machine Images (AMIs) for secondary servers, it is important to follow a structured approach.
+Rotating AMIs is a common practice in such environments.
+
+However, simply removing secondary servers from the target Network Load Balancer (NLB) one by one does not prevent read requests from being routed to them.
+This occurs because the NLB and Neo4j server-side routing operate independently and do not share awareness of server availability.
+
+To correctly remove a secondary server from the cluster and reintroduce it after the update:
+
+. Remove the server from the NLB to stop traffic routing.
+. Shut down the server before proceeding with the AMI update.
+
+
+Here are the steps:
+
+. Remove the secondary from the AWS NLB.
+ This prevents external clients from sending requests to the secondary.
+
+. Since Neo4j's cluster routing (server-side routing) does not use the NLB, you need to ensure that queries are not routed to the secondary server.
+To do this, you have to cleanly shut down the secondary.
+
+.. Run the following query to check servers are hosting all their assigned databases.
+The query should return no results:
++
+[source, cypher, role=noplay]
+----
+SHOW SERVERS YIELD name, hosting, requestedHosting, serverId WHERE requestedHosting <> hosting
+----
+
+.. Use the following query to check all databases are in their expected state.
+The query should return no results:
++
+[source, cypher, role=noplay]
+----
+SHOW DATABASES YIELD name, address, currentStatus, requestedStatus, statusMessage WHERE currentStatus <> requestedStatus RETURN name, address, currentStatus, requestedStatus, statusMessage
+----
+
+.. To stop the Neo4j service, run the following command:
++
+[source, shell, role=copy]
+----
+sudo systemctl stop neo4j
+----
++
+To configure the timeout period for waiting on active transactions to either complete or be terminated during shutdown, you can modify the environment variable `NEO4J_SHUTDOWN_TIMEOUT` using `systemctl edit neo4j.service`
+or the setting xref::configuration/configuration-settings.adoc#config_db.shutdown_transaction_end_timeout[`db.shutdown_transaction_end_timeout`] in _neo4j.conf_ file.
++
+By default, `NEO4J_SHUTDOWN_TIMEOUT` is set to 120 seconds and `db.shutdown_transaction_end_timeout` -- to 10 seconds.
++
+If the shutdown process exceeds these limits, it is considered failed.
+You may need to increase the values if the system serves long-running transactions.
+
+.. Verify that the shutdown process has finished successfully by checking the _neo4j.log_ for relevant log messages confirming the shutdown.
+
+
+. When everything is updated or fixed, start the secondaries one by one again.
+.. Run `systemctl start neo4j`.
+.. Once the server has been restarted, confirm it is running successfully.
++
+Run the following command and check the server has state `Enabled` and health `Available`.
++
+[source, cypher, role=noplay]
+----
+SHOW SERVERS WHERE name = [server-id];
+----
+
+.. Confirm that the server has started all the databases that it should.
++
+This command shows any databases that are not in their expected state:
++
+[source, cypher, role=noplay]
+----
+SHOW DATABASES YIELD name, address, currentStatus, requestedStatus, serverID WHERE currentStatus <> requestedStatus AND serverID = [server-id] RETURN name, address, currentStatus, requestedStatus
+----
+
+. Reattach the secondary to the NLB.
+Once the secondary server is stable and caught up, add it back to the AWS NLB target group.
+
+