Merge pull request #218698 from sreekzz/patch-124

prmerger-automator[bot] · web-flow · commit b06be4b292f3 · 2022-11-17T06:57:26.000Z
Removed Storm Contents Phase 2
diff --git a/articles/hdinsight/hadoop/apache-hadoop-etl-at-scale.md b/articles/hdinsight/hadoop/apache-hadoop-etl-at-scale.md
@@ -4,7 +4,7 @@ description: Learn how extract, transform, and load is used in HDInsight with Ap
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: hdinsightactive,seoapr2020
-ms.date: 04/01/2022
+ms.date: 11/17/2022
 ---
 
 # Extract, transform, and load (ETL) at scale
@@ -66,7 +66,7 @@ Azure Data Lake Storage is a managed, hyperscale repository for analytics data.
 
 Data is usually ingested into Data Lake Storage through Azure Data Factory. You can also use Data Lake Storage SDKs, the AdlCopy service, Apache DistCp, or Apache Sqoop. The service you choose depends on where the data is. If it's in an existing Hadoop cluster, you might use Apache DistCp, the AdlCopy service, or Azure Data Factory. For data in Azure Blob storage, you might use Azure Data Lake Storage .NET SDK, Azure PowerShell, or Azure Data Factory.
 
-Data Lake Storage is optimized for event ingestion through Azure Event Hubs or Apache Storm.
+Data Lake Storage is optimized for event ingestion through Azure Event Hubs.
 
 ### Considerations for both storage options
 
diff --git a/articles/hdinsight/hdinsight-hadoop-create-linux-clusters-curl-rest.md b/articles/hdinsight/hdinsight-hadoop-create-linux-clusters-curl-rest.md
@@ -4,7 +4,7 @@ description: Learn how to create HDInsight clusters by submitting Azure Resource
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: hdinsightactive, devx-track-azurecli
-ms.date: 08/05/2022
+ms.date: 11/17/2022
 ---
 
 # Create Apache Hadoop clusters using the Azure REST API
@@ -35,7 +35,6 @@ The following JSON document is a merger of the template and parameters files fro
                        "type": "string",
                        "allowedValues": ["hadoop",
                        "hbase",
-                       "storm",
                        "spark"],
                        "metadata": {
                            "description": "The type of the HDInsight cluster to create."
diff --git a/articles/hdinsight/hdinsight-hadoop-customize-cluster-bootstrap.md b/articles/hdinsight/hdinsight-hadoop-customize-cluster-bootstrap.md
@@ -4,7 +4,7 @@ description: Learn how to customize HDInsight cluster configuration programmatic
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: hdinsightactive, devx-track-azurepowershell
-ms.date: 05/31/2022
+ms.date: 11/17/2022
 ---
 
 # Customize HDInsight clusters using Bootstrap
@@ -30,7 +30,6 @@ For example, using these programmatic methods, you can configure options in thes
 * mapred-site
 * oozie-site.xml
 * oozie-env.xml
-* storm-site.xml
 * tez-site.xml
 * webhcat-site.xml
 * yarn-site.xml
diff --git a/articles/hdinsight/hdinsight-hadoop-linux-information.md b/articles/hdinsight/hdinsight-hadoop-linux-information.md
@@ -4,7 +4,7 @@ description: Get implementation tips for using Linux-based HDInsight (Hadoop) cl
 ms.service: hdinsight
 ms.custom: hdinsightactive,seoapr2020
 ms.topic: conceptual
-ms.date: 04/29/2020
+ms.date: 11/17/2022
 ---
 
 # Information about using HDInsight on Linux
@@ -111,7 +111,7 @@ In HDInsight, the data storage resources (Azure Blob Storage and Azure Data Lake
 
 ### <a name="URI-and-scheme"></a>URI and scheme
 
-Some commands may require you to specify the scheme as part of the URI when accessing a file. For example, the Storm-HDFS component requires you to specify the scheme. When using non-default storage (storage added as "additional" storage to the cluster), you must always use the scheme as part of the URI.
+Some commands may require you to specify the scheme as part of the URI when accessing a file. When using non-default storage (storage added as "additional" storage to the cluster), you must always use the scheme as part of the URI.
 
 When using [**Azure Storage**](./hdinsight-hadoop-use-blob-storage.md), use one of the following URI schemes:
 
@@ -243,4 +243,4 @@ To use a different version of a component, upload the version you need and use i
 
 * [Manage HDInsight clusters by using the Apache Ambari REST API](./hdinsight-hadoop-manage-ambari-rest-api.md)
 * [Use Apache Hive with HDInsight](hadoop/hdinsight-use-hive.md)
-* [Use MapReduce jobs with HDInsight](hadoop/hdinsight-use-mapreduce.md)
+* [Use MapReduce jobs with HDInsight](hadoop/hdinsight-use-mapreduce.md)
diff --git a/articles/hdinsight/hdinsight-hadoop-use-data-lake-storage-gen1.md b/articles/hdinsight/hdinsight-hadoop-use-data-lake-storage-gen1.md
@@ -4,7 +4,7 @@ description: Learn how to query data from Azure Data Lake Storage Gen1 and to st
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: hdinsightactive,hdiseo17may2017,seoapr2020, devx-track-azurepowershell
-ms.date: 09/15/2022
+ms.date: 11/17/2022
 ---
 
 # Use Data Lake Storage Gen1 with Azure HDInsight clusters
@@ -40,7 +40,6 @@ Currently, only some of the HDInsight cluster types/versions support using Data
 | HDInsight version 3.4 | No | Yes | |
 | HDInsight version 3.3 | No | No | |
 | HDInsight version 3.2 | No | Yes | |
-| Storm | | |You can use Data Lake Storage Gen1 to write data from a Storm topology. You can also use Data Lake Storage Gen1 for reference data that can then be read by a Storm topology.|
 
 > [!WARNING]  
 > HDInsight HBase is not supported with Azure Data Lake Storage Gen1
diff --git a/articles/hdinsight/hdinsight-key-scenarios-to-monitor.md b/articles/hdinsight/hdinsight-key-scenarios-to-monitor.md
@@ -4,7 +4,7 @@ description: How to monitor health and performance of Apache Hadoop clusters in
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: hdinsightactive
-ms.date: 04/28/2022
+ms.date: 11/17/2022
 ---
 
 # Monitor cluster performance in Azure HDInsight
@@ -72,7 +72,6 @@ If your cluster's backing store is Azure Data Lake Storage (ADLS), your throttli
 
 * [Performance tuning guidance for Apache Hive on HDInsight and Azure Data Lake Storage](../data-lake-store/data-lake-store-performance-tuning-hive.md)
 * [Performance tuning guidance for MapReduce on HDInsight and Azure Data Lake Storage](../data-lake-store/data-lake-store-performance-tuning-mapreduce.md)
-* [Performance tuning guidance for Apache Storm on HDInsight and Azure Data Lake Storage](../data-lake-store/data-lake-store-performance-tuning-storm.md)
 
 ## Troubleshoot sluggish node performance
 
diff --git a/articles/hdinsight/hdinsight-scaling-best-practices.md b/articles/hdinsight/hdinsight-scaling-best-practices.md
@@ -6,7 +6,7 @@ author: yeturis
 ms.service: hdinsight
 ms.topic: how-to
 ms.custom: seoapr2020
-ms.date: 07/21/2022
+ms.date: 11/17/2022
 ---
 
 # Manually scale Azure HDInsight clusters
@@ -66,37 +66,6 @@ The impact of changing the number of data nodes varies for each type of cluster
 
     For more information on using the HBase shell, see [Get started with an Apache HBase example in HDInsight](hbase/apache-hbase-tutorial-get-started-linux.md).
 
-* Apache Storm
-
-    You can seamlessly add or remove data nodes while Storm is running. However, after a successful completion of the scaling operation, you'll need to rebalance the topology. Rebalancing allows the topology to readjust [parallelism settings](https://storm.apache.org/documentation/Understanding-the-parallelism-of-a-Storm-topology.html) based on the new number of nodes in the cluster. To rebalance running topologies, use one of the following options:
-
-  * Storm web UI
-
-    Use the following steps to rebalance a topology using the Storm UI.
-
-    1. Open `https://CLUSTERNAME.azurehdinsight.net/stormui` in your web browser, where `CLUSTERNAME` is the name of your Storm cluster. If prompted, enter the HDInsight cluster administrator (admin) name and password you specified when creating the cluster.
-
-    1. Select the topology you wish to rebalance, then select the **Rebalance** button. Enter the delay before the rebalance operation is done.
-
-        :::image type="content" source="./media/hdinsight-scaling-best-practices/hdinsight-portal-scale-cluster-storm-rebalance.png" alt-text="HDInsight Storm scale rebalance":::
-
-  * Command-line interface (CLI) tool
-
-    Connect to the server and use the following command to rebalance a topology:
-
-    ```bash
-     storm rebalance TOPOLOGYNAME
-    ```
-
-    You can also specify parameters to override the parallelism hints originally provided by the topology. For example, the code below reconfigures the `mytopology` topology to 5 worker processes, 3 executors for the blue-spout component, and 10 executors for the yellow-bolt component.
-
-    ```bash
-    ## Reconfigure the topology "mytopology" to use 5 worker processes,
-    ## the spout "blue-spout" to use 3 executors, and
-    ## the bolt "yellow-bolt" to use 10 executors
-    $ storm rebalance mytopology -n 5 -e blue-spout=3 -e yellow-bolt=10
-    ```
-
 * Kafka
 
     You should rebalance partition replicas after scaling operations. For more information, see the [High availability of data with Apache Kafka on HDInsight](./kafka/apache-kafka-high-availability.md) document.
diff --git a/articles/hdinsight/hdinsight-virtual-network-architecture.md b/articles/hdinsight/hdinsight-virtual-network-architecture.md
@@ -3,7 +3,7 @@ title: Azure HDInsight virtual network architecture
 description: Learn the resources available when you create an HDInsight cluster in an Azure Virtual Network.
 ms.service: hdinsight
 ms.topic: conceptual
-ms.date: 04/01/2022
+ms.date: 11/17/2022
 ---
 
 # Azure HDInsight virtual network architecture
@@ -16,12 +16,9 @@ Azure HDInsight clusters have different types of virtual machines, or nodes. Eac
 
 | Type | Description |
 | --- | --- |
-| Head node |  For all cluster types except Apache Storm, the head nodes host the processes that manage execution of the distributed application. The head node is also the node that you can SSH into and execute applications that are then coordinated to run across the cluster resources. The number of head nodes is fixed at two for all cluster types. |
 | ZooKeeper node | Zookeeper coordinates tasks between the nodes that are doing data processing. It also does leader election of the head node, and keeps track of which head node is running a specific master service. The number of ZooKeeper nodes is fixed at three. |
 | Worker node | Represents the nodes that support data processing functionality. Worker nodes can be added or removed from the cluster to scale computing capability and manage costs. |
 | Region node | For the HBase cluster type, the region node (also referred to as a Data Node) runs the Region Server. Region Servers serve and manage a portion of the data managed by HBase. Region nodes can be added or removed from the cluster to scale computing capability and manage costs.|
-| Nimbus node | For the Storm cluster type, the Nimbus node provides functionality similar to the Head node. The Nimbus node assigns tasks to other nodes in a cluster through Zookeeper, which coordinates the running of Storm topologies. |
-| Supervisor node | For the Storm cluster type, the supervisor node executes the instructions provided by the Nimbus node to do the processing. |
 
 ## Resource naming conventions
 
diff --git a/articles/hdinsight/log-analytics-migration.md b/articles/hdinsight/log-analytics-migration.md
@@ -5,7 +5,7 @@ ms.service: hdinsight
 ms.topic: how-to
 ms.author: sairamyeturi
 author: yeturis
-ms.date: 09/02/2022
+ms.date: 11/17/2022
 ---
 
 # Log Analytics migration guide for Azure HDInsight clusters
@@ -267,13 +267,6 @@ The following charts show the table mappings from the classic Azure Monitoring I
 | HDInsightHBaseMetrics | <ul><li>**Description**: This table contains JMX metrics from HBase. It contains all the same JMX metrics from the tables listed in the Old Schema column. In contrast from the old tables, each row contains one metric.</li><li>**Old table**: metrics\_regionserver\_CL, metrics\_regionserver\_wal\_CL, metrics\_regionserver\_ipc\_CL, metrics\_regionserver\_os\_CL, metrics\_regionserver\_replication\_CL, metrics\_restserver\_CL, metrics\_restserver\_jvm\_CL, metrics\_hmaster\_assignmentmanager\_CL, metrics\_hmaster\_ipc\_CL, metrics\_hmaser\_os\_CL, metrics\_hmaster\_balancer\_CL, metrics\_hmaster\_jvm\_CL, metrics\_hmaster\_CL,metrics\_hmaster\_fs\_CL</li></ul>|
 | HDInsightHBaseLogs | <ul><li>**Description**: This table contains logs from HBase and its related components: Phoenix and HDFS.</li><li>**Old table**: log\_regionserver\_CL, log\_restserver\_CL, log\_phoenixserver\_CL, log\_hmaster\_CL, log\_hdfsnamenode\_CL, log\_garbage\_collector\_CL</li></ul>|
 
-## Storm workload
-
-| New Table | Details |
-| --- | --- |
-| HDInsightStormMetrics | <ul><li>**Description**: This table contains the same JMX metrics as the tables in the Old Tables section. Its rows contain one metric per record.</li><li>**Old table**: metrics\_stormnimbus\_CL, metrics\_stormsupervisor\_CL</li></ul>|
-| HDInsightStormTopologyMetrics | <ul><li>**Description**: This table contains topology level metrics from Storm. It's the same shape as the table listed in Old Tables section.</li><li>**Old table**: metrics\_stormrest\_CL</li></ul>|
-| HDInsightStormLogs | <ul><li>**Description**: This table contains all logs generated from Storm.</li><li>**Old table**: log\_supervisor\_CL, log\_nimbus\_CL</li></ul>|
 
 ## Oozie workload
 
diff --git a/articles/hdinsight/manage-clusters-runbooks.md b/articles/hdinsight/manage-clusters-runbooks.md
@@ -4,7 +4,7 @@ description: Learn how to create and delete Azure HDInsight clusters with script
 ms.service: hdinsight
 ms.custom: hdinsightactive
 ms.topic: tutorial
-ms.date: 12/27/2019
+ms.date: 11/17/2022
 ---
 
 # Tutorial: Create Azure HDInsight clusters with Azure Automation
@@ -108,7 +108,7 @@ If you don’t have an Azure subscription, create a [free account](https://azure
     #Automation credential for user to SSH into cluster
     $sshCreds = Get-AutomationPSCredential –Name 'ssh-password' 
     
-    $clusterType = "Hadoop" #Use any supported cluster type (Hadoop, HBase, Storm, etc.)
+    $clusterType = "Hadoop" #Use any supported cluster type (Hadoop, HBase, etc.)
     $clusterOS = "Linux"
     $clusterWorkerNodes = 3
     $clusterNodeSize = "Standard_D3_v2"
@@ -162,4 +162,4 @@ When no longer needed, delete the Azure Automation Account that was created to a
 ## Next steps
 
 > [!div class="nextstepaction"]
-> [Manage Apache Hadoop clusters in HDInsight by using Azure PowerShell](hdinsight-administer-use-powershell.md)
+> [Manage Apache Hadoop clusters in HDInsight by using Azure PowerShell](hdinsight-administer-use-powershell.md)