improve the vram calculator by Yuyz0112 · Pull Request #38 · neutree-ai/ui

Yuyz0112 · 2025-07-17T02:23:30Z

Please check the internal design doc.

Copilot

Pull Request Overview

This PR centralizes inference and vLLM parameters into configuration constants, refactors core VRAM calculation functions to use those constants, and removes the CPU resources UI and related CPU calculations.

Introduce INFERENCE_CONFIG and VLLM_CONFIG to manage empirical parameters
Refactor calculateKVCache, calculateActivations, calculateFrameworkOverhead, calculateMultiDeviceOverhead, and estimateGenerationSpeed to use new configs
Remove CPU memory/cores calculation logic and the corresponding UI section

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
src/pages/vram-calculator/VramCalculatorPage.tsx	Remove CPU resources block; drop unused type & `Label` imports
src/lib/vram-calculator.ts	Add config constants, refactor calculation functions, remove CPU logic

Comments suppressed due to low confidence (3)

src/lib/vram-calculator.ts:122

Consider adding unit tests for calculateKVCache to cover scenarios with different effective batch sizes based on VLLM_CONFIG.ACTIVE_USER_RATIO and MEMORY_POOL_OVERHEAD.

function calculateKVCache(

src/lib/vram-calculator.ts:136

[nitpick] The variable name cachePerToken is somewhat generic; consider renaming it to kvCachePerToken to clarify that it refers to KV cache byte size per token.

  const cachePerToken =

src/pages/vram-calculator/VramCalculatorPage.tsx:557

Since the CPU resources section has been removed, consider cleaning up related translation keys (e.g., pages.vramCalculator.sections.cpuResources) to avoid unused localization entries.

            <div className="border border-border rounded-lg p-6">

Copilot · 2025-07-17T02:37:12Z

src/lib/vram-calculator.ts

  if (availableVramPerGpu >= 80) {
-    baseOverhead = 2.0;
+    baseOverhead = 2.0; // H100, A100 80GB
  } else if (availableVramPerGpu >= 40) {
-    baseOverhead = 1.5;
+    baseOverhead = 1.5; // A100 40GB, A6000
  } else if (availableVramPerGpu >= 24) {
-    baseOverhead = 1.2;
+    baseOverhead = 1.2; // RTX 4090, RTX 3090
  } else if (availableVramPerGpu >= 16) {


[nitpick] Consider extracting the GPU memory tier thresholds (80, 40, 24, 16) into a configuration object or enum to improve maintainability and avoid magic numbers.

samuel032khoury

OK

improve the vram calculator

390f164

Yuyz0112 requested a review from samuel032khoury July 17, 2025 02:23

samuel032khoury requested a review from Copilot July 17, 2025 02:36

Copilot AI reviewed Jul 17, 2025

View reviewed changes

samuel032khoury reviewed Jul 17, 2025

View reviewed changes

samuel032khoury approved these changes Jul 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve the vram calculator#38

improve the vram calculator#38
Yuyz0112 wants to merge 1 commit intomainfrom
yz-calc

Yuyz0112 commented Jul 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 17, 2025

Uh oh!

samuel032khoury left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Yuyz0112 commented Jul 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

samuel032khoury left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants