FEATURE: llm quotas #1047

SamSaffron · 2025-01-02T06:20:40Z

Adds a comprehensive quota management system for LLM models that allows:

Setting per-group token and usage limits with configurable durations
Tracking and enforcing token/usage limits across user groups
Quota reset periods (hourly, daily, weekly, or custom)
Admin UI for managing quotas with real-time updates
Full test coverage for quota models and controllers

This system provides granular control over LLM API usage by allowing admins
to define limits on both total tokens and number of requests per group.
Supports multiple concurrent quotas per model and automatically handles
quota resets.

SaifMurtaza · 2025-01-10T17:04:29Z

Are we able to provide guidelines for setting the max tokens? Some tooltips would be helpful here, i.e - for the non-technical admin,concepts such as tokens might be missed. They are probably left wondering "How many words does this actually mean?" or "How many times can I message the persona?"

keegangeorge

LGTM overall, just a few suggestions here and there

app/controllers/discourse_ai/admin/ai_llm_quotas_controller.rb

app/models/llm_quota.rb

assets/javascripts/discourse/components/ai-llm-editor-form.gjs

assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

config/locales/server.en.yml

spec/fabricators/llm_quota_fabricator.rb

spec/fabricators/llm_quota_usage_fabricator.rb

spec/models/llm_quota_spec.rb

spec/requests/admin/ai_llms_controller_spec.rb

assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

assets/javascripts/discourse/components/ai-quota-duration-selector.gjs

assets/javascripts/discourse/components/modal/ai-llm-quota-modal.gjs

assets/javascripts/discourse/components/ai-quota-duration-selector.gjs

This introduces a new feature for per group quotas

we need to route through llm model for simplicity

Co-authored-by: Keegan George <[email protected]>

js files

SamSaffron · 2025-01-13T07:15:46Z

OK @keegangeorge I think it is all addressed! thanks for the review

SamSaffron · 2025-01-13T08:25:57Z

thanks @SaifMurtaza , will add more clarity, I am on the fence on implementing an optional "absolute quota" that is shared on the group... then you can guarantee you would never spend more than N$ on AI a day even if you have very large groups.

davidtaylorhq · 2025-01-13T09:25:13Z

As mentioned on dev, the use of { i18n } from "discourse-i18n"; should be reverted until after the next core version bump.

keegangeorge

LGTM! Just two small fixes to fix linting and .discourse-compatibility needed before merging 🚀

keegangeorge · 2025-01-13T16:56:35Z

assets/javascripts/discourse/components/ai-composer-helper-menu.gjs

 import { getOwner } from "@ember/owner";
 import { service } from "@ember/service";
-import I18n from "discourse-i18n";
+import { i18n } from "discourse-i18n";


See internal: /t/124366/17 we should pin discourse-ai for core < beta4 in .discourse-compatibility

assets/javascripts/discourse/components/ai-llm-editor-form.gjs

assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

Co-authored-by: Keegan George <[email protected]>

SamSaffron marked this pull request as ready for review January 10, 2025 05:56

keegangeorge suggested changes Jan 10, 2025

View reviewed changes

keegangeorge reviewed Jan 10, 2025

View reviewed changes

SamSaffron and others added 24 commits January 13, 2025 17:53

FEATURE: llm quotas

f0fd10a

This introduces a new feature for per group quotas

most of the backend for a quota system is now done

7937908

add a controller

4b834d8

test for quotas

df65032

working on quota creation

667368b

we need to route through llm model for simplicity

quota update logic

6cc1840

fix updating tokens etc

5b46db4

created a quota selector

18a90b2

This is all working now!

7af4adc

use prettier (just set it up via conform)

e6a31b9

fix broken system spec

0514117

correct handling and error messages

edcb34c

remove puts

5e31f52

fix specs

0d12b82

Update app/models/llm_quota.rb

02787fb

Co-authored-by: Keegan George <[email protected]>

Update config/locales/server.en.yml

710723f

Co-authored-by: Keegan George <[email protected]>

Update assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

3f321a1

Co-authored-by: Keegan George <[email protected]>

Update config/locales/client.en.yml

44c2c0a

Co-authored-by: Keegan George <[email protected]>

Update assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

45f543b

Co-authored-by: Keegan George <[email protected]>

Update spec/fabricators/llm_quota_fabricator.rb

52a21c2

Co-authored-by: Keegan George <[email protected]>

Update spec/fabricators/llm_quota_usage_fabricator.rb

dc51ea0

Co-authored-by: Keegan George <[email protected]>

Update spec/requests/admin/ai_llms_controller_spec.rb

e8425b5

Co-authored-by: Keegan George <[email protected]>

Update spec/models/llm_quota_spec.rb

0e0cb14

Co-authored-by: Keegan George <[email protected]>

consistently use import { i18n } from "discourse-i18n"; in all

5b5c38c

js files

SamSaffron force-pushed the quotas2 branch from 0144118 to 5b5c38c Compare January 13, 2025 06:54

render modal without a service

45f368e

we need the import for lookup

e35b5dc

keegangeorge approved these changes Jan 13, 2025

View reviewed changes

SamSaffron and others added 4 commits January 14, 2025 15:24

Update assets/javascripts/discourse/components/ai-llm-quota-editor.gjs

7d1244c

Co-authored-by: Keegan George <[email protected]>

Update assets/javascripts/discourse/components/ai-llm-editor-form.gjs

073d160

Co-authored-by: Keegan George <[email protected]>

compatability

564a22d

add tooltips remove invalid yaml

4cb5a29

SamSaffron merged commit d07cf51 into main Jan 14, 2025
6 checks passed

SamSaffron deleted the quotas2 branch January 14, 2025 04:54

FEATURE: llm quotas #1047

FEATURE: llm quotas #1047

Conversation

SamSaffron commented Jan 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SaifMurtaza commented Jan 10, 2025

Uh oh!

keegangeorge left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SamSaffron commented Jan 13, 2025

Uh oh!

SamSaffron commented Jan 13, 2025

Uh oh!

davidtaylorhq commented Jan 13, 2025

Uh oh!

keegangeorge left a comment

Choose a reason for hiding this comment

Uh oh!

keegangeorge Jan 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

SamSaffron commented Jan 2, 2025 •

edited

Loading

keegangeorge left a comment •

edited

Loading