openai · Rohan5commit · Apr 26, 2026 · Copilot · Apr 26, 2026
@@ -33,9 +33,10 @@ import java.util.function.Consumer
  *
  * This client performs best when you create a single instance and reuse it for all interactions
  * with the REST API. This is because each client holds its own connection pool and thread pools.
- * Reusing connections and threads reduces latency and saves memory. The client also handles rate
- * limiting per client. This means that creating and using multiple instances at the same time will
- * not respect rate limits.
+ * Reusing connections and threads reduces latency and saves memory. Each client also manages its
+ * own retries independently, so reusing one client keeps that retry behavior, connection pool, and
+ * thread pools in one place while multiple clients may compete separately against the same external
+ * rate limits.
- * Reusing connections and threads reduces latency and saves memory. Each client also manages its
- * own retries independently, so reusing one client keeps that retry behavior, connection pool, and
- * thread pools in one place while multiple clients may compete separately against the same external
- * rate limits.
+ * Reusing connections and threads reduces latency and saves memory. Retries are applied per
+ * request, reactively based on API responses such as rate limits, and are not coordinated across
+ * requests or client instances. The SDK does not proactively throttle requests or coordinate rate
+ * limiting across multiple clients, so separate clients may still compete against the same
+ * external rate limits.
- * Reusing connections and threads reduces latency and saves memory. Each client also manages its
- * own retries independently, so reusing one client keeps that retry behavior, connection pool, and
- * thread pools in one place while multiple clients may compete separately against the same external
- * rate limits.
+ * Reusing connections and threads reduces latency and saves memory. Retries are applied per
+ * request, reactively based on API responses such as rate limits, and are not coordinated across
+ * requests or client instances. The SDK does not proactively throttle requests or coordinate rate
+ * limiting across multiple clients, so separate clients may still compete against the same
+ * external rate limits.
  *
  * The threads and connections that are held will be released automatically if they remain idle. But
  * if you are writing an application that needs to aggressively release unused resources, then you