You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/README.adoc
+36-28Lines changed: 36 additions & 28 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,20 +4,20 @@
4
4
5
5
The troubleshooting panel displays a graph of related resources and observability signals.
6
6
Clicking on a node in the graph opens a console page showing the details of each resource or signal.
7
-
Nodes in the graph represent a resource or signal type, edges represent relationships.
7
+
Nodes in the graph represent a type of resource or signal, while edges represent relationships.
8
8
9
9
The panel provides a map of related information, so you can navigate more quickly to relevant data.
10
10
It may also help to find related information that you were not aware of.
11
11
12
-
Lets consider the example of troubleshooting an Alert on the OCP console.
12
+
Consider an example of troubleshooting an Alert on the OCP console.
13
13
14
14
== Opening the panel
15
15
16
16
First open the alert of interest in the console.
17
-
Now open the troubleshooting panel [FIXME global button]
17
+
Now open the troubleshooting panel [FIXME screenshot of global button]
18
18
19
-
Opening panel shows a _neighbourhood_ of the resource currently displayed in the console.
20
-
A neighbourhood graph starts at the current resource, and includes related objects up to
19
+
Opening the panel shows a _neighbourhood_ of the resource currently displayed in the console.
20
+
A neighbourhood is a graph that starts at the current resource, and includes related objects up to
21
21
3 steps away from the starting point.
22
22
23
23
NOTE: Not all resource types are currently supported, more will be added in future.
@@ -27,41 +27,49 @@ For an unsupported resource, the panel will be empty.
27
27
image::images/panel-graph.png[]
28
28
29
29
30
-
<1> Alert(1) node represents the starting point alert in the console. TYPE OF ALERT
31
-
<2> Pod(1) indicates that there is a single Pod resource associated with this alert. Clicking on this node will open a console search showing the related pod directly.
32
-
<3> Event(2) there are two kuberenetes events associated with the Pod, you can see them by clicking this node.
33
-
<3> Logs(74) The pod has emitted logs, click to jump to them.
34
-
<4> Metrics(105) There are many metrics associated with every Pod.
35
-
<6> Network(6) shows that there are network events, which means the pod has communicated over the network. The remaining nodes are the Service, Deployment and DaemonSet that the pod has communicated with.
36
-
<7> Focus: Clicking on nodes in the graph will change what is shown in the main console page, but the graph will not change.
37
-
You can also navigate using links on the console page while the panel is open.
38
-
To update the graph, click "Focus". This will draw a new graph using the resource shown in the main page as the starting point.
39
-
<8> Show Query enables some experimental features, see below.
40
-
41
-
NOTE: Clicking on a node may sometimes show fewer results than are indicated on the graph. This is a known issue that will be addressed in future.
30
+
<1> Alert(1): This node represents the starting point, a `KubeContainerWaiting` alert that was displayed in the console.
31
+
<2> Pod(1): This node indicates there is a single Pod resource associated with this alert. Clicking on this node will show the pod details in the console.
32
+
<3> Event(2): There are two kuberenetes events associated with the Pod, and you can see them by clicking this node.
33
+
<3> Logs(74): The pod has emitted 74 lines of logs. Click to show them.
34
+
<4> Metrics(105): There are always many metrics associated with every Pod.
35
+
<6> Network(6): There are network events associated with the pod, which means it has communicated with other resources in the cluster.
36
+
The remaining Service, Deployment and DaemonSet nodes are the resources that the pod has communicated with.
37
+
<7> Focus: Clicking nodes changes what is shown in the main console. You can navigate using links on the console page while the panel is open.
38
+
The graph will not change until you click "Focus".
39
+
This will draw a new graph starting from the resource shown in the main console.
40
+
<8> Show Query: enables experimental features detailed below.
42
41
42
+
NOTE: Clicking on a node may sometimes show fewer results than are indicated on the graph.
43
+
This is a known issue that will be addressed in future.
43
44
44
45
== Experimental features
45
46
46
47
[.border]
47
48
image::images/query-details.png[]
48
49
49
50
<1> Hide Query hides the experimental features.
50
-
<2> The "Korrel8r query" that identifies the starting point for the graph. This query language is experimental and will change in future.
51
-
It is updated by the "Focus" button to correspond to the resources in the main console window. (TODO upstream reference or no?)
51
+
<2> The query that identifies the starting point for the graph. The format of this query is experimental and may change in future.
52
+
footnote:[This query language is part of https://korrel8r.github.io/korrel8r[Korrel8r], the correlation engine used to create the graphs]
53
+
The "Focus" button updates the query to match the resources in the main console window.
52
54
<3> Neighbourhood depth: increase or decrease to see a smaller or larger neighbourhood.
53
55
Note: setting a large value in a large cluster may cause the query to fail if the number of results is too big.
54
56
<4> Goal class: Selecting this option will do a _goal directed search_ instead of a neighbourhood search.
55
57
A goal directed search will show all paths from the starting point to the goal _class_ , which indicates a type of resource or signal.
56
-
The format of the goal class is experimental and may change. Currently valid goals are
57
-
- `k8s:RESOURCE[VERSION.[GROUP]]` - identifies a Kind of kuberenetes resource. For example `k8s:Pod` or `k8s:Deployment.apps.v1`.
58
-
- `alert:alert` - any alert.
59
-
- `metric:metric` - any metric.
60
-
- `netflow:network` - any netobserv network event.
61
-
- `log:LOG_TYPE` - LOG_TYPE must be `application`, `infrastructure` or `audit`
58
+
59
+
The format of the goal class is experimental and may change. The valid goal classes are:
60
+
61
+
[horizontal]
62
+
`k8s:__resource[.version.[group]]__` :: Kind of Kuberenetes resource. For example `k8s:Pod` or `k8s:Deployment.apps.v1`.
63
+
`alert:alert`:: Any alert.
64
+
`metric:metric`:: Any metric.
65
+
`netflow:network`:: Any network observability event.
66
+
`log:__log_type__`:: Stored logs, __log_type__ must be `application`, `infrastructure` or `audit`
62
67
63
68
== Optional signal stores
64
69
65
-
The panel will show the resources and signals that are installed for your cluster.
66
-
Kuberenetes resources, alerts and metrics are always available in OCP.
67
-
Other signals are optional: logs or network flow events will only appear if Openshift Logging or Network Observabilty are installed.
70
+
The troubleshooting panel relies on the observability signal stores installed in your cluster.
71
+
Kuberenetes resources, alerts and metrics are available by default in an OCP cluster.
72
+
Other types of signal require optional components to be installed:
73
+
74
+
- Logs: "Red Hat Openshift Logging" (collection) and "Loki Operator provided by Red Hat" (store)
75
+
- Network Events: "Network Observability provided by Red Hat" (collection) and "Loki Operator provided by Red Hat" (store)
0 commit comments