updated latency page (#508)

lukasgoetzweiss · web-flow · commit d458e42b2f6b · 2024-10-29T13:33:01.000-06:00
* updated latency page

* a few minor copy edits
diff --git a/docs/sdks/faqs/latency.md b/docs/sdks/faqs/latency.md
@@ -4,22 +4,26 @@ sidebar_position: 6
 
 # Latency
 
-Eppo’s feature flagging architecture enables faster delivery of client-side web experiments. By leveraging a "dumb server, smart client" approach and utilizing the power of the Fastly CDN, Eppo offers low-latency feature flag evaluations and efficient updates.
+## Introduction
 
-### Architecture Overview
+Eppo’s feature flags leverage a "dumb server, smart client" approach. This, paired with using Fastly's Global CDN, allows Eppo to provide low-latency SDK initialization and near instantaneous evaluation of feature flag and experiment variations.
 
-The Eppo feature flagging architecture is designed as a JSON delivery service. The server maintains a file containing feature flags and their corresponding assignment rules. The client downloads this file and determines which feature flags apply to its group.
+The system is designed as a JSON delivery service. Our CDN maintains a file containing feature flags and their corresponding assignment rules. The SDK client then downloads this file and determines what variant to apply for a specific subject (e.g., user).
 
-Our architecture follows the "smart client" principle, offloading the evaluation work to the client-side SDKs. This approach requires initial development effort to implement new SDKs and update existing ones when new targeting rules are introduced. However, it allows us to tap into the resources of the Fastly CDN, benefiting from its global infrastructure and distributed caching capabilities.
+Offloading the evaluation work to the SDK means that once the SDK is initialized all evaluation happens locally, typically in under 1 ms. Further, if user context changes mid-session, there is no need to reach out to Eppo's servers to understand their new targeting eligibility. All of the targeting happens locally, so Eppo's SDK will always ensure users see the right experiment given their unique targeting attributes.
 
-### Latency Considerations
+This page walks through some of the latency considerations associated with building a global feature flagging system, describes options for how to reach your internal SLAs, and presents performance benchmarks.
 
-When it comes to feature-flagging services, latency is a crucial factor for client experiences. We distinguish between two types of latency:
+## Latency Considerations
 
-1. **Evaluation Latency**: The time it takes for a client to determine which flag value applies to it.
-2. **Update Latency**: The time it takes for updated rules to reach the client.
+When it comes to feature-flagging services, latency is a crucial factor for end user experiences. We distinguish between two types of latency:
 
-We have made architectural tradeoffs to optimize these latency factors. Instead of a "smart server" architecture that requires frequent server polling, Eppo prioritizes fast evaluations by accepting relatively slower updates.
+1. **Evaluation Latency**: The time it takes to determine which flag value applies for a specific subject (user).
+2. **Update Latency**: The time it takes for updated rules to reach end users.
+
+Eppo's "smart client" approach allows us to give very impressive evaluation latencies (typically under 1ms), while still providing update latencies that satisfy internal SLAs for disabling problematic features. 
+
+This is in contrast to a server-side evaluation which requires frequent network requests each time a flag is evaluated, or a user's context changes.
 
 ### Leveraging the Fastly CDN
 
@@ -29,32 +33,68 @@ The diagram below illustrates our architecture, where requests enter from the ri
 
 ![Feature flag architecture](/img/feature-flagging/latency-1.png)
 
-This architectural choice allows us to achieve impressive latency figures. Let's explore the uncached and cached latency numbers to understand the client experience better.
-
-### Uncached Latency
-
-The following map displays uncached latency figures obtained using a generic latency testing tool. Clients can download the feature-flag file in less than a second from most locations worldwide.
-
-![Uncached latency](/img/feature-flagging/latency-2.png)
-
-### Cached Latency
-
-The cached latency numbers provide a more representative view of typical client experiences. These numbers correspond to requests made within a couple of minutes after the initial request.
-
-![Cached latency](/img/feature-flagging/latency-3.png)
-
-Most regions, including the US, Europe, South Korea, and Australia, experience latencies under 100ms. Even in locations further away, latencies are generally below half a second. To put it into perspective, the ping time between New York and London is approximately 72ms. From the client's perspective, it appears as though Eppo's servers are distributed globally, even though they are physically located in the corn fields and cow pastures of Iowa.
+### Latency benchmark
+
+This architectural choice allows us to achieve impressive latency figures. To understand initialization times around the world, we ran a benchmark with ~80 active feature flag using the open source tool Grafana k6. Two users (i.e., threads) executed 500 requests in a row. This is repeated for each region in GCP.
+
+The table below shows the measured percentiles for each region, reported in milliseconds.
+
+:::note
+These figures measure latency from the VMs used for the test, so they are data center to data center. These figures will not be representative of what an end-user would experience in the mentioned locations for each provider - those figures would be higher (due to having to traverse the open internet/low bandwidth connections). 
+:::
+
+
+| **Region**              | **Location**                  | **p50**   | **p90**   | **p99**  |
+| ----------------------- | ----------------------------- | ----- | ----- | ---- |
+| africa-south1           | Johannesburg, South Africa    | 2     | 2.4   | 5.1  |
+| asia-east1              | Changhua County, Taiwan       | 15.4  | 15.8  | 19.7 |
+| asia-east2              | Hong Kong                     | 2     | 2.6   | 5.6  |
+| asia-northeast1         | Tokyo, Japan                  | 1.2   | 1.5   | 2.6  |
+| asia-northeast2         | Osaka, Japan                  | 9.9   | 10.1  | 13   |
+| asia-northeast3         | Seoul, South Korea            | 2.8   | 4.2   | 4.9  |
+| asia-south1             | Mumbai, India                 | 24.8  | 35    | 35.6 |
+| asia-south2             | Delhi, India                  | 2.7   | 2.9   | 3.7  |
+| asia-southeast1         | Jurong West, Singapore        | 2.3   | 2.8   | 4    |
+| asia-southeast2         | Jakarta, Indonesia            | 19    | 19.9  | 29.3 |
+| australia-southeast1    | Sydney, Australia             | 2.3   | 3     | 3.9  |
+| australia-southeast2    | Melbourne, Australia          | 16.3  | 16.7  | 18.9 |
+| europe-central2         | Warsaw, Poland                | 24.8  | 25    | 27.7 |
+| europe-north1           | Hamina, Finland               | 3.4   | 4     | 6.1  |
+| europe-southwest1       | Madrid, Spain                 | 1.7   | 2.6   | 3.7  |
+| europe-west1            | St. Ghislain, Belgium         | 5.2   | 5.8   | 7.3  |
+| europe-west10           | Berlin, Germany               | 13.9  | 14.1  | 15.9 |
+| europe-west12           | Turin, Italy                  | 6.2   | 7.3   | 13   |
+| europe-west2            | London, England               | 1.6   | 2.1   | 3.1  |
+| europe-west3            | Frankfurt, Germany            | 1.4   | 1.8   | 2.7  |
+| europe-west4            | Eemshaven, Netherlands        | 4.7   | 5.1   | 6.4  |
+| europe-west6            | Zurich, Switzerland           | 5.6   | 6     | 7.3  |
+| europe-west8            | Milan, Italy                  | 1.5   | 1.8   | 2.7  |
+| europe-west9            | Paris, France                 | 1.8   | 2.9   | 3.9  |
+| me-central1             | Doha, Qatar                   | 126.3 | 126.8 | 130  |
+| me-west1                | Tel Aviv, Israel              | 48.1  | 48.8  | 49   |
+| northamerica-northeast1 | Montréal, Québec              | 9.3   | 10    | 13.3 |
+| northamerica-northeast2 | Toronto, Ontario              | 1.8   | 2.2   | 2.9  |
+| southamerica-east1      | Osasco, São Paulo             | 2.1   | 2.3   | 4.1  |
+| southamerica-west1      | Santiago, Chile               | 1.1   | 1.4   | 2.4  |
+| us-central1             | Council Bluffs, Iowa          | 12.6  | 12.9  | 14.2 |
+| us-east1                | Moncks Corner, South Carolina | 14.8  | 15    | 16.2 |
+| us-east4                | Ashburn, Virginia             | 1.8   | 2.2   | 3.1  |
+| us-east5                | Columbus, Ohio                | 12.2  | 12.6  | 17   |
+| us-south1               | Dallas, Texas                 | 1.8   | 2.3   | 3.3  |
+| us-west1                | The Dalles, Oregon            | 8.1   | 8.4   | 10.6 |
+| us-west2                | Los Angeles, California       | 1     | 1.3   | 2.3  |
+| us-west3                | Salt Lake City, Utah          | 18.4  | 19.4  | 19.7 |
+| us-west4                | Las Vegas, Nevada             | 7.9   | 8.2   | 9.6  |
+
+Most regions experience latencies under 20ms. Even in locations further away, latencies are well below half a second. To put this into perspective, the ping time between New York and London is approximately 72ms. From the client's perspective, it appears as though Eppo's servers are distributed globally, even though they are physically located in the corn fields and cow pastures of Iowa.
 
 ### Update Latency
 
-Update latency refers to the time required for updated feature-flagging rules (JSON file) to reach the clients.
-
-For client side SDKs (Android, iOS, React, etc), the rules are updated each time the Eppo SDK is initialized, which should happen only one time per application lifecycle. If the SDK is initialized more than once during an application’s lifecycle, an exception is thrown.
+Update latency refers to the time required for updated feature-flagging rules (JSON file) to reach the clients. Most of Eppo's SDKs have built-in polling that is configurable at initialization. This makes it easy to set a desired update latency: simply set the polling cadence to reach your internal SLA for changes to go live.
 
-For server side SDKs (Java, Node.js, Ruby, etc), the rules are first updated when the Eppo SDK is initialized, then again every 30 seconds for the duration of the application’s lifecycle.
+Some SDKs also offer a method to manually trigger a reload of configurations, allowing for a lot of flexibility in how to handle update latency.
 
-### Conclusion
+## Conclusion
 
-Eppo's feature-flag architecture offers a faster and cost-effective way to deliver client-side web experiments. By prioritizing client-side evaluation and leveraging the Fastly CDN, we achieve sub-100ms evaluation latencies while maintaining efficient update processes.
+Eppo's feature-flagging architecture is optimized for evaluation latency. Once the SDK is initialized (typically in under 20ms), all evaluation of flags happen effectively immediately. Eppo also provides flexibility in how to handle update latency, ensuring changes made in Eppo's UI reach end users within a specified time window.
 
-For any further questions or assistance, please refer to the Eppo Feature Flagging Service documentation or reach out to our support team.