fix(amazonq): debounce inline suggestion requests. #7289

Hweinstock · 2025-05-12T16:14:09Z

Problem

When typing a decent amount of text in quick succession, the language server will get throttled in its requests to the backend. This is because we send a request for recommendations on every key stroke, causing the language server to make a request on each key stroke. This is rightfully getting throttled by the backend.

Solution

The ideal behavior is that we only make a request to the language server, and thus to the backend, when typing stops. Therefore, this is an ideal use case for debounce. However, we need to extend debounce slightly outlined below.
Apply debounce to the recommendations such that we wait 20 ms after typing stops before fetching the results.
By applying this at the recommendation level, none of the inline latency metrics are affected.

Debounce Changes

-Let f be some debounced function that takes a string argument, our current debounce does the following:

f('a') f('ab') f('abc') (pause for debounce delay) -> f would be called with 'a'

The issue is is that for suggestions, this means the language server request will be made with stale context (i.e. not including our most recent content). What we want instead is for the case above to call f with 'abc' and not with 'a' or 'ab'.

We can accomplish this by adding a flag to debounce allowing us to choose whether we call it with the first args of the debounce interval (default, and 'a' in the example above), or the most recent args ('abc' in the example above).

Verification

I did not notice the added latency when testing inline. However it does seem slower than prod, with and without this change.
I was not able to get a throttling exception.

Treat all work as PUBLIC. Private feature/x branches will not be squash-merged at release time.
Your code changes must meet the guidelines in CONTRIBUTING.md.
License: I confirm that my contribution is made under the terms of the Apache 2.0 license.

Hweinstock · 2025-05-12T19:17:11Z

packages/core/src/codewhisperer/models/constants.ts

- * the interval of the background thread invocation, which is triggered by the timer
+ * Delay for making requests once the user stops typing. Without a delay, inline suggestions request is triggered every keystroke.
 */
-export const defaultCheckPeriodMillis = 1000 * 60 * 5


didn't see these used anymore, so I deleted for clarity.

these were probably missed when I was deleting code. One of the problems with exporting in typescript is that it makes it very hard to detect which exports are actually used

jpinkney-aws · 2025-05-12T19:21:58Z

packages/core/src/codewhisperer/models/constants.ts

- * the interval of the background thread invocation, which is triggered by the timer
+ * Delay for making requests once the user stops typing. Without a delay, inline suggestions request is triggered every keystroke.
 */
-export const defaultCheckPeriodMillis = 1000 * 60 * 5


these were probably missed when I was deleting code. One of the problems with exporting in typescript is that it makes it very hard to detect which exports are actually used

Hweinstock · 2025-05-12T19:45:34Z

Could we use something like https://www.npmjs.com/package/ts-unused-exports? I just ran it and got 951 results, so this might be noisy. Perhaps something to investigate in the future if more bandwidth is available.

Hweinstock added 2 commits May 12, 2025 11:36

fix: add debounce to avoid request per keystroke

a9a1ac6

test: add tests to ensure request is debounced

48e7a65

This comment was marked as off-topic.

Sign in to view

Hweinstock changed the title ~~fix(amazonq): debounce inline suggestion requests.~~ fix(amazonq): debounce inline suggestion requests. (WIP) May 12, 2025

refactor: adjust delay to be smaller

142566b

Hweinstock force-pushed the fix/throttle branch from 1334647 to 142566b Compare May 12, 2025 17:01

fix: use last args to avoid improper suggestions

82cca2b

Hweinstock changed the title ~~fix(amazonq): debounce inline suggestion requests. (WIP)~~ fix(amazonq): debounce inline suggestion requests. May 12, 2025

fix: adjust tests to avoid lint error

5a65802

Hweinstock force-pushed the fix/throttle branch from f12d5e5 to 5a65802 Compare May 12, 2025 18:57

Hweinstock commented May 12, 2025

View reviewed changes

Hweinstock marked this pull request as ready for review May 12, 2025 19:17

Hweinstock requested review from a team as code owners May 12, 2025 19:17

jpinkney-aws approved these changes May 12, 2025

View reviewed changes

Hweinstock merged commit 0429352 into aws:feature/flare-inline May 12, 2025
32 of 37 checks passed

Hweinstock deleted the fix/throttle branch May 12, 2025 20:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(amazonq): debounce inline suggestion requests. #7289

fix(amazonq): debounce inline suggestion requests. #7289

Uh oh!

Hweinstock commented May 12, 2025 •

edited

Loading

Uh oh!

This comment was marked as off-topic.

Hweinstock May 12, 2025

Uh oh!

jpinkney-aws May 12, 2025

Uh oh!

jpinkney-aws May 12, 2025

Uh oh!

Hweinstock commented May 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(amazonq): debounce inline suggestion requests. #7289

fix(amazonq): debounce inline suggestion requests. #7289

Uh oh!

Conversation

Hweinstock commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Debounce Changes

Verification

Uh oh!

This comment was marked as off-topic.

Hweinstock May 12, 2025

Choose a reason for hiding this comment

Uh oh!

jpinkney-aws May 12, 2025

Choose a reason for hiding this comment

Uh oh!

jpinkney-aws May 12, 2025

Choose a reason for hiding this comment

Uh oh!

Hweinstock commented May 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Hweinstock commented May 12, 2025 •

edited

Loading