Skip to content

Commit 9b98b8a

Browse files
committed
CNV-15574: Troubleshooting guide assembly
1 parent f56f48b commit 9b98b8a

8 files changed

+218
-116
lines changed

_topic_maps/_topic_map.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3454,7 +3454,7 @@ Topics:
34543454
File: virt-exposing-custom-metrics-for-vms
34553455
- Name: OpenShift Virtualization critical alerts
34563456
File: virt-virtualization-alerts
3457-
- Name: Collecting OpenShift Virtualization data for Red Hat Support
3457+
- Name: Collecting data for Red Hat Support
34583458
File: virt-collecting-virt-data
34593459
Distros: openshift-enterprise
34603460
- Name: Backup and restore

modules/virt-about-collecting-virt-data.adoc

Lines changed: 0 additions & 21 deletions
This file was deleted.
Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * virt/logging_events_monitoring/virt-collecting-virt-data.adoc
4+
5+
:_content-type: PROCEDURE
6+
[id="virt-collecting-data-about-vms_{context}"]
7+
= Collecting data about virtual machines
8+
9+
Collecting data about malfunctioning virtual machines (VMs) minimizes the time required to analyze and determine the root cause.
10+
11+
.Prerequisites
12+
13+
* Windows VMs:
14+
** Record the Windows patch update details for Red Hat Support.
15+
** Install the latest version of the VirtIO drivers. The VirtIO drivers include the QEMU guest agent.
16+
** If Remote Desktop Protocol (RDP) is enabled, try to connect to the VMs with RDP to determine whether there is a problem with the connection software.
17+
18+
.Procedure
19+
20+
. Collect detailed `must-gather` data about the malfunctioning VMs.
21+
. Collect screenshots of VMs that have crashed before you restart them.
22+
. Record factors that the malfunctioning VMs have in common. For example, the VMs have the same host or network.
23+
// . Collect memory dumps from VMs _before_ remediation attempts.
24+
// Uncomment this line for CNV-20256.
Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,22 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * virt/logging_events_monitoring/virt-collecting-virt-data.adoc
4+
5+
:_content-type: PROCEDURE
6+
[id="virt-collecting-data-about-your-environment_{context}"]
7+
= Collecting data about your environment
8+
9+
Collecting data about your environment minimizes the time required to analyze and determine the root cause.
10+
11+
.Prerequisites
12+
13+
* Set the retention time for Prometheus metrics data to a minimum of seven days.
14+
* Configure the Alertmanager to capture relevant alerts and to send them to a dedicated mailbox so that they can be viewed and persisted outside the cluster.
15+
* Record the exact number of affected nodes and virtual machines.
16+
17+
.Procedure
18+
19+
. Collect `must-gather` data for the cluster by using the default `must-gather` image.
20+
. Collect `must-gather` data for {rh-storage-first}, if necessary.
21+
. Collect `must-gather` data for {VirtProductName} by using the {VirtProductName} `must-gather` image.
22+
. Collect Prometheus metrics for the cluster.

modules/virt-must-gather-options.adoc

Lines changed: 106 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,106 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * virt/logging_events_monitoring/virt-collecting-virt-data.adoc
4+
5+
[id="virt-must-gather-options_{context}"]
6+
= must-gather tool options
7+
8+
You can specify a combination of scripts and environment variables for the following options:
9+
10+
* Collecting detailed virtual machine (VM) information from a namespace
11+
* Collecting detailed information about specified VMs
12+
* Collecting image and image stream information
13+
* Limiting the maximum number of parallel processes used by the `must-gather` tool
14+
15+
[id="parameters"]
16+
== Parameters
17+
18+
.Environment variables
19+
20+
You can specify environment variables for a compatible script.
21+
22+
`NS=<namespace_name>`:: Collect virtual machine information, including `virt-launcher` pod details, from the namespace that you specify. The `VirtualMachine` and `VirtualMachineInstance` CR data is collected for all namespaces.
23+
24+
`VM=<vm_name>`:: Collect details about a particular virtual machine. To use this option, you must also specify a namespace by using the `NS` environment variable.
25+
26+
`PROS=<number_of_processes>`:: Modify the maximum number of parallel processes that the `must-gather` tool uses. The default value is `5`.
27+
+
28+
[IMPORTANT]
29+
====
30+
Using too many parallel processes can cause performance issues. Increasing the maximum number of parallel processes is not recommended.
31+
====
32+
33+
.Scripts
34+
35+
Each script is only compatible with certain environment variable combinations.
36+
37+
`gather_vms_details`:: Collect VM log files, VM definitions, and namespaces (and their child objects) that belong to {VirtProductName} resources. If you use this parameter without specifying a namespace or VM, the `must-gather` tool collects this data for all VMs in the cluster. This script is compatible with all environment variables, but you must specify a namespace if you use the `VM` variable.
38+
39+
`gather`:: Use the default `must-gather` script, which collects cluster data from all namespaces and includes only basic VM information. This script is only compatible with the `PROS` variable.
40+
41+
`gather_images`:: Collect image and image stream custom resource information. This script is only compatible with the `PROS` variable.
42+
43+
[id="usage-and-examples_{context}"]
44+
== Usage and examples
45+
46+
Environment variables are optional. You can run a script by itself or with one or more compatible environment variables.
47+
48+
[cols="1a,1a", options="header"]
49+
.Compatible parameters
50+
|===
51+
|Script |Compatible environment variable
52+
|`gather_vms_details`
53+
|* For a namespace: `NS=<namespace_name>`
54+
55+
* For a VM: `VM=<vm_name> NS=<namespace_name>`
56+
57+
* `PROS=<number_of_processes>`
58+
59+
|`gather` |* `PROS=<number_of_processes>`
60+
|`gather_images` |* `PROS=<number_of_processes>`
61+
|===
62+
63+
To customize the data that `must-gather` collects, you append a double dash (`--`) to the command, followed by a space and one or more compatible parameters.
64+
65+
.Syntax
66+
67+
[source,terminal,subs="attributes+"]
68+
----
69+
$ oc adm must-gather \
70+
--image=registry.redhat.io/container-native-virtualization/cnv-must-gather-rhel8:v{HCOVersion} \
71+
-- <environment_variable_1> <environment_variable_2> <script_name>
72+
----
73+
74+
.Detailed VM information
75+
76+
The following command collects detailed VM information for the `my-vm` VM in the `mynamespace` namespace:
77+
78+
[source,terminal,subs="attributes+"]
79+
----
80+
$ oc adm must-gather \
81+
--image=registry.redhat.io/container-native-virtualization/cnv-must-gather-rhel8:v{HCOVersion} \
82+
-- NS=mynamespace VM=my-vm gather_vms_details <1>
83+
----
84+
<1> The `NS` environment variable is mandatory if you use the `VM` environment variable.
85+
86+
.Default data collection limited to three parallel processes
87+
88+
The following command collects default `must-gather` information by using a maximum of three parallel processes:
89+
90+
[source,terminal,subs="attributes+"]
91+
----
92+
$ oc adm must-gather \
93+
--image=registry.redhat.io/container-native-virtualization/cnv-must-gather-rhel8:v{HCOVersion} \
94+
-- PROS=3 gather
95+
----
96+
97+
.Image and image stream information
98+
99+
The following command collects image and image stream information from the cluster:
100+
101+
[source,terminal,subs="attributes+"]
102+
----
103+
$ oc adm must-gather \
104+
--image=registry.redhat.io/container-native-virtualization/cnv-must-gather-rhel8:v{HCOVersion} \
105+
-- gather_images
106+
----

modules/virt-must-gather-usage-targeted-vm-data.adoc

Lines changed: 0 additions & 82 deletions
This file was deleted.
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * virt/logging_events_monitoring/virt-collecting-virt-data.adoc
4+
5+
//This file contains UI elements and/or package names that need to be updated.
6+
7+
:_content-type: PROCEDURE
8+
[id="virt-using-virt-must-gather_{context}"]
9+
= Using the must-gather tool for {VirtProductName}
10+
11+
You can collect data about {VirtProductName} resources by running the `must-gather` command with the {VirtProductName} image.
12+
13+
The default data collection includes information about the following resources:
14+
15+
* {VirtProductName} Operator namespaces, including child objects
16+
* {VirtProductName} custom resource definitions
17+
* Namespaces that contain virtual machines
18+
* Basic virtual machine definitions
19+
20+
.Procedure
21+
22+
* Run the following command to collect data about {VirtProductName}:
23+
+
24+
[source,terminal]
25+
----
26+
$ oc adm must-gather --image-stream=openshift/must-gather \
27+
--image=registry.redhat.io/container-native-virtualization/cnv-must-gather-rhel8:v{HCOVersion}
28+
----
Lines changed: 37 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,25 +1,50 @@
11
:_content-type: ASSEMBLY
22
include::_attributes/common-attributes.adoc[]
33
[id="virt-collecting-virt-data"]
4-
= Collecting {VirtProductName} data for Red Hat Support
4+
= Collecting data for Red Hat Support
55
:context: virt-collecting-virt-data
66

77
toc::[]
88

9-
When opening a support case, it is helpful to provide debugging
10-
information about your cluster to Red Hat Support.
9+
When you submit a xref:../../support/getting-support.adoc#support-submitting-a-case_getting-support[support case] to Red Hat Support, it is helpful to provide debugging information for {product-title} and {VirtProductName} by using the following tools:
1110

12-
The `must-gather` tool enables you to collect diagnostic information about your
13-
{product-title} cluster, including virtual machines and other data related to
14-
{VirtProductName}.
11+
must-gather tool::
12+
The `must-gather` tool collects diagnostic information, including resource definitions and service logs.
1513

16-
For prompt support, supply diagnostic information for both {product-title}
17-
and {VirtProductName}.
14+
Prometheus::
15+
Prometheus is a time-series database and a rule evaluation engine for metrics. Prometheus sends alerts to Alertmanager for processing.
1816

19-
include::modules/about-must-gather.adoc[leveloffset=+1]
17+
Alertmanager::
18+
The Alertmanager service handles alerts received from Prometheus. The Alertmanager is also responsible for sending the alerts to external notification systems.
2019

21-
include::modules/virt-about-collecting-virt-data.adoc[leveloffset=+1]
20+
include::modules/virt-collecting-data-about-your-environment.adoc[leveloffset=+1]
2221

23-
include::modules/gathering-data-specific-features.adoc[leveloffset=+1]
22+
[id="additional-resources_collecting-data-about-your-environment"]
23+
[role="_additional-resources"]
24+
=== Additional resources
25+
* Configuring the xref:../../monitoring/configuring-the-monitoring-stack.adoc#modifying-retention-time-for-prometheus-metrics-data_configuring-the-monitoring-stack[retention time] for Prometheus metrics data
26+
* Configuring the Alertmanager to send xref:../../monitoring/managing-alerts.adoc#sending-notifications-to-external-systems_managing-alerts[alert notifications] to external systems
27+
* Collecting `must-gather` data for xref:../../support/gathering-cluster-data.adoc#support_gathering_data_gathering-cluster-data[{product-title}]
28+
* Collecting `must-gather` data for link:https://access.redhat.com/documentation/en-us/red_hat_openshift_data_foundation/4.10/html-single/troubleshooting_openshift_data_foundation/index#downloading-log-files-and-diagnostic-information_rhodf[{rh-storage-first}].
29+
* Collecting `must-gather` data for xref:../../virt/logging_events_monitoring/virt-collecting-virt-data.adoc#virt-using-virt-must-gather_virt-collecting-virt-data[{VirtProductName}]
30+
* Collecting Prometheus metrics for xref:../../monitoring/managing-metrics.adoc#querying-metrics-for-all-projects-as-an-administrator_managing-metrics[all projects] as a cluster administrator
2431

25-
include::modules/virt-must-gather-usage-targeted-vm-data.adoc[leveloffset=+1]
32+
include::modules/virt-collecting-data-about-vms.adoc[leveloffset=+1]
33+
34+
[id="additional-resources_collecting-data-about-vms"]
35+
[role="_additional-resources"]
36+
=== Additional resources
37+
* Installing xref:../../virt/virtual_machines/virt-installing-qemu-guest-agent.adoc#virt-installing-virtio-drivers-existing-windows_virt-installing-qemu-guest-agent[VirtIO drivers] on Windows VMs
38+
* Downloading and installing link:https://access.redhat.com/solutions/6957701[VirtIO drivers] on Windows VMs without host access
39+
* Connecting to Windows VMs with RDP using the xref:../../virt/virtual_machines/virt-accessing-vm-consoles.adoc#virt-vm-rdp-console-web_virt-accessing-vm-consoles[web console] or the xref:../../virt/virtual_machines/virt-accessing-vm-consoles.adoc#virt-accessing-rdp-console_virt-accessing-vm-consoles[command line]
40+
* Collecting `must-gather` data about xref:../../virt/logging_events_monitoring/virt-collecting-virt-data.adoc#virt-must-gather-options_virt-collecting-virt-data[virtual machines]
41+
// * Collecting virtual machine memory dumps. [link TBD. CNV-20256]
42+
43+
include::modules/virt-using-virt-must-gather.adoc[leveloffset=+1]
44+
45+
include::modules/virt-must-gather-options.adoc[leveloffset=+2]
46+
47+
[id="additional-resources_must-gather-virt"]
48+
[role="_additional-resources"]
49+
=== Additional resources
50+
* xref:../../support/gathering-cluster-data.adoc#about-must-gather_gathering-cluster-data[About the `must-gather` tool]

0 commit comments

Comments
 (0)