-
Notifications
You must be signed in to change notification settings - Fork 0
Add performance optimizations for large repositories #11
Copy link
Copy link
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Feature Request: Performance Optimizations
Problem Statement
Large repositories with extensive history or many files can make dex slow, especially with full file extraction.
Proposed Solution
Implement performance optimizations for better scalability.
Implementation Details
Optimizations
-
Parallel Processing
- Concurrent file reading
- Parallel git operations
- Multi-threaded parsing
-
Caching
- Cache git operations
- Store parsed ASTs
- Remember file contents
- Token count cache
-
Lazy Loading
- Stream large files
- Incremental processing
- On-demand symbol resolution
-
Smart Limits
- File size thresholds
- Token count limits
- Automatic truncation
- Sampling strategies
CLI Options
dex --max-file-size=1MB # Skip large files
dex --max-tokens=100000 # Token limit
dex --cache # Enable caching
dex --parallel=4 # Worker threadsProgress Indicators
Analyzing changes... ████████░░ 80% | 1234/1543 files | ETA: 5s
Performance Metrics
## Metadata
- **Processing Time:** 2.3s
- **Files Analyzed:** 1543
- **Cache Hits:** 89%
- **Peak Memory:** 124MBBenefits
- 10x faster on large repos
- Lower memory usage
- Better user experience
- Scalable to monorepos
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request