smartcontractkit
diff --git a/‎book/src/libs/wasp/benchspy/first_test.md‎
Lines changed: 45 additions & 31 deletions b/‎book/src/libs/wasp/benchspy/first_test.md‎
Lines changed: 45 additions & 31 deletions
diff --git a/‎book/src/libs/wasp/benchspy/getting_started.md‎
Lines changed: 9 additions & 9 deletions b/‎book/src/libs/wasp/benchspy/getting_started.md‎
Lines changed: 9 additions & 9 deletions
diff --git a/‎book/src/libs/wasp/benchspy/loki_custom.md‎
Lines changed: 21 additions & 12 deletions b/‎book/src/libs/wasp/benchspy/loki_custom.md‎
Lines changed: 21 additions & 12 deletions
diff --git a/‎book/src/libs/wasp/benchspy/loki_dillema.md‎
Lines changed: 18 additions & 11 deletions b/‎book/src/libs/wasp/benchspy/loki_dillema.md‎
Lines changed: 18 additions & 11 deletions
diff --git a/‎book/src/libs/wasp/benchspy/loki_std.md‎
Lines changed: 37 additions & 33 deletions b/‎book/src/libs/wasp/benchspy/loki_std.md‎
Lines changed: 37 additions & 33 deletions
diff --git a/‎book/src/libs/wasp/benchspy/overview.md‎
Lines changed: 11 additions & 12 deletions b/‎book/src/libs/wasp/benchspy/overview.md‎
Lines changed: 11 additions & 12 deletions
@@ -1,18 +1,26 @@
-# BenchSpy - Your first test
+# BenchSpy - Your First Test
 
-Let's start with a simplest case, which doesn't require you to have any of the observability stack, but only `WASP` and the application you are testing.
-`BenchSpy` comes with some built-in `QueryExecutors` each of which additionaly has predefined metrics that you can use. One of these executors is the
-`DirectQueryExecutor` that fetches metrics directly from `WASP` generators.
+Let's start with the simplest case, which doesn't require any part of the observability stack—only `WASP` and the application you are testing.
+`BenchSpy` comes with built-in `QueryExecutors`, each of which also has predefined metrics that you can use. One of these executors is the `DirectQueryExecutor`, which fetches metrics directly from `WASP` generators,
+which means you can run it with Loki.
 
-Our first test will follow the following logic:
-* Run a simple load test
-* Generate the performance report and store it
-* Run the load again
-* Generate a new report and compare it to the previous one
+> [!NOTE]
+> Not sure whether to use `Loki` or `Direct` query executors? [Read this!](./loki_dillema.md)
+
+## Test Overview
+
+Our first test will follow this logic:
+- Run a simple load test.
+- Generate a performance report and store it.
+- Run the load test again.
+- Generate a new report and compare it to the previous one.
+
+We'll use very simplified assertions for this example and expect the performance to remain unchanged.
 
-We will use some very simplified assertions, used only for the sake of example, and expect the performance to remain unchanged.
+### Step 1: Define and Run a Generator
+
+Let's start by defining and running a generator that uses a mocked service:
 
-Let's start by defining and running a generator that will use a mocked service:
 ```go
 gen, err := wasp.NewGenerator(&wasp.Config{
     T:           t,
@@ -28,39 +36,43 @@ require.NoError(t, err)
 gen.Run(true)
 ```
 
-Now that we have load data, let's generate a baseline performance report and store it in the local storage:
-```go
-fetchCtx, cancelFn := context.WithTimeout(context.Background(), 60*time.Second)
-defer cancelFn()
+### Step 2: Generate a Baseline Performance Report
 
+With load data available, let's generate a baseline performance report and store it in local storage:
+
+```go
 baseLineReport, err := benchspy.NewStandardReport(
-    // random hash, this should be commit or hash of the Application Under Test (AUT)
+    // random hash, this should be the commit or hash of the Application Under Test (AUT)
     "e7fc5826a572c09f8b93df3b9f674113372ce924",
     // use built-in queries for an executor that fetches data directly from the WASP generator
     benchspy.WithStandardQueries(benchspy.StandardQueryExecutor_Direct),
     // WASP generators
     benchspy.WithGenerators(gen),
 )
-require.NoError(t, err, "failed to create original report")
+require.NoError(t, err, "failed to create baseline report")
+
+fetchCtx, cancelFn := context.WithTimeout(context.Background(), 60*time.Second)
+defer cancelFn()
 
 fetchErr := baseLineReport.FetchData(fetchCtx)
-require.NoError(t, fetchErr, "failed to fetch data for original report")
+require.NoError(t, fetchErr, "failed to fetch data for baseline report")
 
 path, storeErr := baseLineReport.Store()
-require.NoError(t, storeErr, "failed to store current report", path)
+require.NoError(t, storeErr, "failed to store baseline report", path)
 ```
 
 > [!NOTE]
-> There's quite a lot to unpack here and you are enouraged to read more about build-in `QueryExecutors` and
-> standard metrics each comes with [here](./built_in_query_executors.md) and about the `StandardReport` [here](./standard_report.md).
+> There's a lot to unpack here, and you're encouraged to read more about the built-in `QueryExecutors` and the standard metrics they provide as well as about the `StandardReport` [here](./reports/standard_report.md).
 >
-> For now, it's enough for you to know that standard metrics that `StandardQueryExecutor_Generator` comes with are following:
-> * median latency
-> * p95 latency (95th percentile)
-> * error rate
+> For now, it's enough to know that the standard metrics provided by `StandardQueryExecutor_Direct` include:
+> - Median latency
+> - P95 latency (95th percentile)
+> - Error rate
+
+### Step 3: Run the Test Again and Compare Reports
+
+With the baseline report ready, let's run the load test again. This time, we'll use a wrapper function to automatically load the previous report, generate a new one, and ensure they are comparable.
 
-With baseline report ready let's run the load test again, but this time let's use a wrapper function
-that will automatically load the previous report, generate a new one and make sure that they are actually comparable.
 ```go
 // define a new generator using the same config values
 newGen, err := wasp.NewGenerator(&wasp.Config{
@@ -84,6 +96,7 @@ defer cancelFn()
 // currentReport is the report that we just created (baseLineReport)
 currentReport, previousReport, err := benchspy.FetchNewStandardReportAndLoadLatestPrevious(
     fetchCtx,
+    // commit or tag of the new application version
     "e7fc5826a572c09f8b93df3b9f674113372ce925",
     benchspy.WithStandardQueries(benchspy.StandardQueryExecutor_Direct),
     benchspy.WithGenerators(newGen),
@@ -92,8 +105,9 @@ require.NoError(t, err, "failed to fetch current report or load the previous one
 ```
 
 > [!NOTE]
-> In real-world case, once you have the first report generated you should only need to use
-> `benchspy.FetchNewStandardReportAndLoadLatestPrevious` function.
+> In a real-world case, once you've generated the first report, you should only need to use the `benchspy.FetchNewStandardReportAndLoadLatestPrevious` function.
+
+### What's Next?
 
-Okay, so we have two reports now, that's great, but how do we make sure that application's performance is as expected?
-You'll find out in the [next chapter](./first_test_comparison.md).
+Now that we have two reports, how do we ensure that the application's performance meets expectations?
+Find out in the [next chapter](./simplest_metrics.md).
@@ -1,14 +1,14 @@
-# BenchSpy - Getting started
+# BenchSpy - Getting Started
 
-All of the following examples assume that you have access to following applications:
-* Grafana
-* Loki
-* Prometheus
+The following examples assume you have access to the following applications:
+- Grafana
+- Loki
+- Prometheus
 
 > [!NOTE]
-> The easiest way to run them locally is by using CTFv2's [observability stack](../../../framework/observability/observability_stack.md).
-> Just remember to first install the `CTF CLI` as described in [CTFv2 Getting Started](../../../framework/getting_started.md) chapter.
+> The easiest way to run these locally is by using CTFv2's [observability stack](../../../framework/observability/observability_stack.md).
+> Be sure to install the `CTF CLI` first, as described in the [CTFv2 Getting Started](../../../framework/getting_started.md) guide.
 
-Since BenchSpy is tightly couplesd with WASP it's highly recommended that you [get familiar with it first](../overview.md), if you haven't yet.
+Since BenchSpy is tightly coupled with WASP, we highly recommend that you [get familiar with it first](../overview.md) if you haven't already.
 
-Ready? [Let's go!](./first_test.md)
+Ready? [Let's get started!](./first_test.md)
@@ -1,13 +1,14 @@
-# BenchSpy - Custom Loki metrics
+# BenchSpy - Custom Loki Metrics
 
-In this chapter we will see how to use custom LogQl queries in the performance report. For this more advanced use case
-we will need to compose the performance report manually.
+In this chapter, we’ll explore how to use custom `LogQL` queries in the performance report. For this more advanced use case, we’ll manually compose the performance report.
 
-Load-generation part is the same as in the standard Loki metrics example and thus will be skipped.
+The load generation part is the same as in the standard Loki metrics example and will be skipped.
 
-Let's define two illustrative metrics now:
-* `vu_over_time` - rate of virtual users generated by WASP, 10 seconds window
-* `responses_over_time` - number of AUT's responses, 1 second window
+## Defining Custom Metrics
+
+Let’s define two illustrative metrics:
+- **`vu_over_time`**: The rate of virtual users generated by WASP, using a 10-second window.
+- **`responses_over_time`**: The number of AUT's responses, using a 1-second window.
 
 ```go
 lokiQueryExecutor := benchspy.NewLokiQueryExecutor(
@@ -20,19 +21,27 @@ lokiQueryExecutor := benchspy.NewLokiQueryExecutor(
 ```
 
 > [!NOTE]
-> These LogQl queries are using standard labels that `WASP` uses when sending data to Loki.
+> These `LogQL` queries use the standard labels that `WASP` applies when sending data to Loki.
+
+## Creating a `StandardReport` with Custom Queries
+
+Now, let’s create a `StandardReport` using our custom queries:
 
-And create a `StandardReport` using our custom queries:
 ```go
 baseLineReport, err := benchspy.NewStandardReport(
     "2d1fa3532656c51991c0212afce5f80d2914e34e",
-    // notice the different functional option used to pass custom executors
+    // notice the different functional option used to pass Loki executor with custom queries
     benchspy.WithQueryExecutors(lokiQueryExecutor),
     benchspy.WithGenerators(gen),
 )
 require.NoError(t, err, "failed to create baseline report")
 ```
 
-The rest of the code remains basically unchanged (apart from the name of metrics we are asserting on). You can find the full example [here](...).
+## Wrapping Up
 
-Now it's time to look at the last of the bundled `QueryExecutors`. Proceed to the [next chapter to read about Prometheus](./prometheus.md).
+The rest of the code remains unchanged, except for the names of the metrics being asserted. You can find the full example [here](...).
+
+Now it’s time to look at the last of the bundled `QueryExecutors`. Proceed to the [next chapter to read about Prometheus](./prometheus_std.md).
+
+> [!NOTE]
+> You can find the full example [here](https://github.com/smartcontractkit/chainlink-testing-framework/tree/main/wasp/examples/benchspy/loki_query_executor/loki_query_executor_test.go).
@@ -1,15 +1,22 @@
-# BenchSpy - To Loki or not to Loki?
+# BenchSpy - To Loki or Not to Loki?
 
-You might be asking yourself whether you should use `Loki` or `Direct` query executor if all you
-need are basic latency metrics.
+You might be wondering whether to use the `Loki` or `Direct` query executor if all you need are basic latency metrics.
 
-As a rule of thumb, if all you need is a single number that describes the median latency or error rate
-and you are not interested in directly comparing time series, minimum or maximum values or any kinds
-of more advanced calculation on raw data, then you should go with the `Direct`.
+## Rule of Thumb
 
-Why?
+If all you need is a single number, such as the median latency or error rate, and you're not interested in:
+- Comparing time series directly,
+- Examining minimum or maximum values, or
+- Performing advanced calculations on raw data,
 
-Because it returns a single value for each of standard metrics using the same raw data that Loki would use
-(it accesses the data stored in the `WASP`'s generator that would later be pushed to Loki).
-This way you can run your load test without a Loki instance and save yourself the need of calculating the
-median and 95th percentile latency or the error ratio.
+then you should opt for the `Direct` query executor.
+
+## Why Choose `Direct`?
+
+The `Direct` executor returns a single value for each standard metric using the same raw data that Loki would use. It accesses data stored in the `WASP` generator, which is later pushed to Loki.
+
+This means you can:
+- Run your load test without a Loki instance.
+- Avoid calculating metrics like the median, 95th percentile latency, or error ratio yourself.
+
+By using `Direct`, you save resources and simplify the process when advanced analysis isn't required.
@@ -1,18 +1,20 @@
-# BenchSpy - Standard Loki metrics
+# BenchSpy - Standard Loki Metrics
 
-> [!NOTE]
-> This example assumes you have access to Loki and Grafana instances. If you don't
-> find out how to launch them using CTFv2's [observability stack](../../../framework/observability/observability_stack.md).
+> [!WARNING]
+> This example assumes you have access to Loki and Grafana instances. If you don't, learn how to launch them using CTFv2's [observability stack](../../../framework/observability/observability_stack.md).
 
-Our Loki example, will vary from the previous one in just a couple of details:
-* generator will have Loki config
-* standard query executor type will be `benchspy.StandardQueryExecutor_Loki`
-* we will cast all results to `[]string`
-* and calculate medians for all metrics
+In this example, our Loki workflow will differ from the previous one in just a few details:
+- The generator will include a Loki configuration.
+- The standard query executor type will be `benchspy.StandardQueryExecutor_Loki`.
+- All results will be cast to `[]string`.
+- We'll calculate medians for all metrics.
 
 Ready?
 
-Let's define new load generation first:
+## Step 1: Define a New Load Generator
+
+Let's start by defining a new load generator:
+
 ```go
 label := "benchspy-std"
 
@@ -36,41 +38,43 @@ gen, err := wasp.NewGenerator(&wasp.Config{
 require.NoError(t, err)
 ```
 
-Now let's run the generator and save baseline report:
+## Step 2: Run the Generator and Save the Baseline Report
+
 ```go
 gen.Run(true)
 
-fetchCtx, cancelFn := context.WithTimeout(context.Background(), 60*time.Second)
-defer cancelFn()
-
 baseLineReport, err := benchspy.NewStandardReport(
     "c2cf545d733eef8bad51d685fcb302e277d7ca14",
-    // notice the different standard executor type
+    // notice the different standard query executor type
     benchspy.WithStandardQueries(benchspy.StandardQueryExecutor_Loki),
     benchspy.WithGenerators(gen),
 )
-require.NoError(t, err, "failed to create original report")
+require.NoError(t, err, "failed to create baseline report")
+
+fetchCtx, cancelFn := context.WithTimeout(context.Background(), 60*time.Second)
+defer cancelFn()
 
 fetchErr := baseLineReport.FetchData(fetchCtx)
-require.NoError(t, fetchErr, "failed to fetch data for original report")
+require.NoError(t, fetchErr, "failed to fetch data for baseline report")
 
 path, storeErr := baseLineReport.Store()
-require.NoError(t, storeErr, "failed to store current report", path)
+require.NoError(t, storeErr, "failed to store baseline report", path)
 ```
 
-Since next steps are very similar to the ones used in the first test we will skip them and jump straight
-to metrics comparison.
+## Step 3: Skip to Metrics Comparison
+
+Since the next steps are very similar to those in the first test, we’ll skip them and go straight to metrics comparison.
+
+By default, the `LokiQueryExecutor` returns results as the `[]string` data type. Let’s use dedicated convenience functions to cast them from `interface{}` to string slices:
 
-By default, `LokiQueryExecutor` returns `[]string` data type, so let's use dedicated convenience functions
-to cast them from `interface{}` to string slice:
 ```go
 currentAsStringSlice := benchspy.MustAllLokiResults(currentReport)
 previousAsStringSlice := benchspy.MustAllLokiResults(previousReport)
 ```
 
-And finally, time to compare metrics. Since we have a `[]string` we will first convert it to `[]float64` and
-then calculate a median and assume it hasn't changed by more than 1%. Again, remember that this is just an illustration.
-You should decide yourself what's the best way to assert the metrics.
+## Step 4: Compare Metrics
+
+Now, let’s compare metrics. Since we have `[]string`, we’ll first convert it to `[]float64`, calculate the median, and ensure the difference between the medians is less than 1%. Again, this is just an example—you should decide the best way to validate your metrics.
 
 ```go
 var compareMedian = func(metricName string) {
@@ -85,23 +89,23 @@ var compareMedian = func(metricName string) {
     require.NoError(t, err, "failed to convert %s results to float64 slice", metricName)
     previousMedian := benchspy.CalculatePercentile(previousFloatSlice, 0.5)
 
-    var diffPrecentage float64
+    var diffPercentage float64
     if previousMedian != 0 {
-        diffPrecentage = (currentMedian - previousMedian) / previousMedian * 100
+        diffPercentage = (currentMedian - previousMedian) / previousMedian * 100
     } else {
-        diffPrecentage = currentMedian * 100
+        diffPercentage = 100
     }
-    assert.LessOrEqual(t, math.Abs(diffPrecentage), 1.0, "%s medians are more than 1% different", metricName, fmt.Sprintf("%.4f", diffPrecentage))
+    assert.LessOrEqual(t, math.Abs(diffPercentage), 1.0, "%s medians are more than 1% different", metricName, fmt.Sprintf("%.4f", diffPercentage))
 }
 
 compareMedian(string(benchspy.MedianLatency))
 compareMedian(string(benchspy.Percentile95Latency))
 compareMedian(string(benchspy.ErrorRate))
 ```
 
-We have used standard metrics, which are the same as in the first test, now let's see how you can use your custom LogQl queries.
+## What’s Next?
 
-> [!NOTE]
-> Don't know whether to use `Loki` or `Direct` query executors? [Read this!](./loki_dillema.md)
+In this example, we used standard metrics, which are the same as in the first test. Now, [let’s explore how to use your custom LogQL queries](./loki_custom.md).
 
-You can find the full example [here](...).
+> [!NOTE]
+> You can find the full example [here](https://github.com/smartcontractkit/chainlink-testing-framework/tree/main/wasp/examples/benchspy/loki_query_executor/loki_query_executor_test.go).
@@ -1,16 +1,15 @@
 # BenchSpy
 
-BenchSpy (short for benchmark spy) is a WASP-coupled tool that allows for easy comparison of various performance metrics.
+BenchSpy (short for Benchmark Spy) is a [WASP](../overview.md)-coupled tool designed for easy comparison of various performance metrics.
 
-It's main characteristics are:
-* three built-in data sources:
-    * `Loki`
-    * `Prometheus`
-    * `Direct`
-* standard/pre-defined metrics for each data source
-* ease of extensibility with custom metrics
-* ability to load latest performance report based on Git history
-* 88% unit test coverage
+## Key Features
+- **Three built-in data sources**:
+  - `Loki`
+  - `Prometheus`
+  - `Direct`
+- **Standard/pre-defined metrics** for each data source.
+- **Ease of extensibility** with custom metrics.
+- **Ability to load the latest performance report** based on Git history.
+- **88% unit test coverage**.
 
-It doesn't come with any comparation logic, other than making sure that performance reports are comparable (e.g. they mesure the same metrics in the same way),
-leaving total freedom to the user.
+BenchSpy does not include any built-in comparison logic beyond ensuring that performance reports are comparable (e.g., they measure the same metrics in the same way), offering complete freedom to the user for interpretation and analysis.