MediaPipe generic metrics documentation (#2691)

dkalinowski · atobiszei · web-flow · commit 2cfa1b7309a3 · 2024-09-18T09:20:11.000+02:00
CVS-151870

---------

Co-authored-by: Adrian Tobiszewski &lt;adrian.tobiszewski@intel.com&gt;
diff --git a/docs/metrics.md b/docs/metrics.md
@@ -28,10 +28,14 @@ However, you can enable also additional metrics by listing all the metrics you w
 Default metrics
 | Type      | Name | Labels | Description |
 | :---    |    :----   |    :----   |    :----       |
-| gauge      | ovms_streams | name,version | Number of OpenVINO execution streams |
-| gauge      | ovms_current_requests | name,version | Number of requests being currently processed by the model server |
+| gauge      | ovms_streams | name,version | Number of OpenVINO execution streams. |
+| gauge      | ovms_current_requests | name,version | Number of requests being currently processed by the model server. |
+| gauge      | ovms_current_graphs | name | Number of MediaPipe graphs in process. |
 | counter      | ovms_requests_success | api,interface,method,name,version | Number of successful requests to a model or a DAG. |
 | counter      | ovms_requests_fail | api,interface,method,name,version | Number of failed requests to a model or a DAG. |
+| counter      | ovms_requests_accepted | api,interface,method,name | Number of accepted requests which ended up inserting packet(s) into a MediaPipe graph. |
+| counter      | ovms_requests_rejected | api,interface,method,name | Number of rejected which failed at MediaPipe packet creation step. |
+| counter      | ovms_responses | api,interface,method,name | Number of responses generated by the MediaPipe graph. |
 | histogram      | ovms_request_time_us | interface,name,version | Processing time of requests to a model or a DAG. |
 | histogram      | ovms_inference_time_us | name,version | Inference execution time in the OpenVINO backend. |
 | histogram      | ovms_wait_for_infer_req_time_us | name,version | Request waiting time in the scheduling queue. Indicates how long the request has to wait before required resources are assigned to it. |
@@ -47,11 +51,11 @@ Optional metrics
 Labels description
 | Name      | Values |  Description |
 | :---    |    :----   |    :----   |
-| api      | KServe, TensorFlowServing  | Name of the serving API. |
+| api      | KServe, TensorFlowServing, V3  | Name of the serving API. |
 | interface      | REST, gRPC | Name of the serving interface. |
-| method      | ModelMetadata, ModelReady, ModelInfer, Predict, GetModelStatus, GetModelMetadata | Interface methods. |
-| version      | 1, 2, ..., n | Model version. Note that GetModelStatus and ModelReady do not have the version label. |
-| name      | As defined in model server config | Model name or DAG name. |
+| method      | ModelMetadata, ModelReady, ModelInfer, Predict, GetModelStatus, GetModelMetadata, Unary, Stream | Interface methods. |
+| version      | 1, 2, ..., n | Model version. Note that GetModelStatus and ModelReady and all MediaPipe servables do not have the version label. |
+| name      | As defined in model server config | Model name, DAG name or MediaPipe graph name. |
 
 
 ## Enable metrics
@@ -175,10 +179,14 @@ echo '{
              "metrics_list": 
                  [ "ovms_requests_success",
                  "ovms_requests_fail",
+                 "ovms_requests_accepted",
+                 "ovms_requests_rejected",
+                 "ovms_responses",
                  "ovms_inference_time_us",
                  "ovms_wait_for_infer_req_time_us",
                  "ovms_request_time_us",
                  "ovms_current_requests",
+                 "ovms_current_graphs",
                  "ovms_infer_req_active",
                  "ovms_streams",
                  "ovms_infer_req_queue_size"]
@@ -224,7 +232,17 @@ It means that each request to the DAG pipeline will update also the metrics for
 
 ## Metrics implementation for MediaPipe Graphs
 
-For [MediaPipe Graphs](./mediapipe.md) metrics endpoint is not supported.
+For [MediaPipe Graphs](./mediapipe.md) execution there are 4 generic metrics which apply to all graphs:
+
+| Type      | Name  | Description |
+| :---    |    :----   |    :----   |
+| counter      | ovms_requests_accepted | Counts number of requests which ended up pushing MediaPipe packet down the graph stream. For example image frame in vision use cases, LLM prompt in text generation use cases. |
+| counter      | ovms_requests_rejected | Counts errors in MediaPipe packet creation phase. For example bad image format in vision use cases. Please note that for V3 API, the LLM request is validated at graph node level meaning that packet creation always succeeds. Please refer to specific graph definition and implementation. |
+| counter      | ovms_responses | Useful to track number of packets generated by MediaPipe graph. Keep in mind that single request may trigger production of multiple (or zero) packets, therefore tracking number of responses is complementary to tracking accepted requests. For example tracking streaming partial responses of LLM text generation graphs. |
+| gauge      | ovms_current_graphs | Number of graphs currently in-process. For unary communication it is equal to number of currently processing requests (each request initializes separate MediaPipe graph). For streaming communication it is equal to number of active client connections. Each connection is able to reuse the graph and decide when to delete it when the connection is closed. |
+
+Exposing custom metrics in calculator implementations (MediaPipe graph nodes) is not supported yet.
+
 
 ## Visualize with Grafana