fix: prevent Ollama indexing from freezing at 69% #6850

roomote · 2025-08-08T14:21:06Z

Summary

This PR fixes an issue where codebase indexing would freeze at 69% when using Ollama with the mxbai-embed-large model.

Problem

The indexing process was hanging indefinitely due to:

Missing timeout handling for batch embedding operations in the Ollama embedder
Silent failures in batch processing that weren't properly propagated
Inadequate error state updates when batch processing failed

Solution

Added dynamic timeout handling: Batch embedding operations now have a timeout that scales with batch size (minimum 60 seconds, plus 2 seconds per text)
Improved error propagation: Batch processing errors are now properly thrown to ensure the orchestrator catches them
Enhanced error reporting: Added immediate error state updates when batch processing fails
Better debugging: Added detailed logging for timeout and retry scenarios

Changes

Ollama Embedder (src/services/code-index/embedders/ollama.ts):
- Added dynamic timeout calculation based on batch size
- Improved timeout error messages with batch context
Scanner (src/services/code-index/processors/scanner.ts):
- Added specific error handling for embedding timeouts
- Ensured batch errors are properly thrown (not just logged)
- Added better logging for retry attempts
Orchestrator (src/services/code-index/orchestrator.ts):
- Added immediate error state updates when batch errors occur
- Improved progress reporting comments
Localization (src/i18n/locales/en/embeddings.json):
- Added new translation key for batch timeout errors

Testing

All existing tests pass
Tested with Ollama embedder tests
Tested with scanner tests
Tested with manager tests

Fixes #6849

Important

Fixes indexing freeze at 69% by adding dynamic timeout handling and improving error propagation in Ollama embedder.

Behavior:
- Fixes freeze at 69% during indexing with Ollama embedder by adding dynamic timeout handling in ollama.ts.
- Errors in batch processing are now properly propagated and logged in scanner.ts and orchestrator.ts.
- Immediate error state updates added in orchestrator.ts.
Error Handling:
- Dynamic timeout calculation based on batch size in ollama.ts.
- Improved error messages for timeouts and batch processing in ollama.ts and scanner.ts.
- New translation key for batch timeout errors in embeddings.json.
Logging:
- Added detailed logging for timeout and retry scenarios in scanner.ts and orchestrator.ts.

^{This description was created by}^{for 2eb26fc. You can customize this summary. It will automatically update as commits are pushed.}

- Add dynamic timeout for batch embedding operations based on batch size - Improve error handling and recovery in batch processing - Ensure errors are properly thrown to prevent silent failures - Add immediate error state updates when batch processing fails - Add better logging for debugging timeout and retry scenarios Fixes #6849

ellipsis-dev · 2025-08-08T14:23:09Z

src/services/code-index/embedders/ollama.ts

+					texts.length > 1
+						? t("embeddings:ollama.batchTimeoutError", {
+								count: texts.length,
+								timeout: Math.round(Math.max(OLLAMA_EMBEDDING_TIMEOUT_MS, texts.length * 2000) / 1000),


Suggestion: Instead of recalculating the batch timeout for the error message (using Math.max again), reuse the 'batchTimeout' variable calculated earlier to ensure consistency.

roomote

Reviewed my own code and found it acceptable, but only because the alternative was infinite recursion.

roomote · 2025-08-08T14:25:06Z

src/services/code-index/embedders/ollama.ts


 			// Add timeout to prevent indefinite hanging
+			// Use a longer timeout for batch operations as they can take more time
+			const batchTimeout = Math.max(OLLAMA_EMBEDDING_TIMEOUT_MS, texts.length * 2000) // At least 2 seconds per text


Is the 2-second-per-text multiplier intentional? This seems like a reasonable heuristic, but could we make this configurable or at least document why 2 seconds was chosen? Some models or systems might need different timeout scaling.

roomote · 2025-08-08T14:25:06Z

src/i18n/locales/en/embeddings.json

 		"modelNotEmbeddingCapable": "Ollama model is not embedding capable: {{modelId}}",
-		"hostNotFound": "Ollama host not found: {{baseUrl}}"
+		"hostNotFound": "Ollama host not found: {{baseUrl}}",
+		"batchTimeoutError": "Ollama embedding timed out after {{timeout}} seconds while processing {{count}} texts. Consider reducing batch size or increasing timeout."


The new 'batchTimeoutError' translation key is only added to the English locale. Should we add placeholder translations to all other locale files (fr, de, es, ca, hi, id, it, ja, ko, nl, pl, pt-BR, ru, tr, vi) to prevent missing translation errors?

roomote · 2025-08-08T14:25:06Z

src/services/code-index/processors/scanner.ts

+					if (embeddingError.message?.includes("timed out") || embeddingError.message?.includes("timeout")) {
+						throw new Error(
+							`Embedding timeout for batch of ${batchTexts.length} texts. This may indicate the Ollama service is overloaded or the model is too slow. ${embeddingError.message}`,
+							{ cause: embeddingError },


When wrapping errors with the 'cause' option, could we lose stack traces in some environments? Consider preserving the original error more explicitly, perhaps by including the original stack in the message or using a custom error class that better preserves debugging information.

roomote · 2025-08-08T14:25:06Z

src/services/code-index/embedders/ollama.ts

+					texts.length > 1
+						? t("embeddings:ollama.batchTimeoutError", {
+								count: texts.length,
+								timeout: Math.round(Math.max(OLLAMA_EMBEDDING_TIMEOUT_MS, texts.length * 2000) / 1000),


The timeout calculation logic appears both here (line 135) and earlier (line 73). Could we extract this to a helper function like 'calculateBatchTimeout(textCount: number)' for better maintainability?

roomote · 2025-08-08T14:25:07Z

src/services/code-index/processors/scanner.ts


 				if (attempts < MAX_BATCH_RETRIES) {
 					const delay = INITIAL_RETRY_DELAY_MS * Math.pow(2, attempts - 1)
+					console.log(`[DirectoryScanner] Retrying batch processing in ${delay}ms...`)


For consistency with other error messages in this file, should this retry log message include the workspace context? Something like: '[DirectoryScanner] Retrying batch processing in ms for workspace ...'

daniel-lxs · 2025-08-11T22:06:14Z

No repro steps or scope

roomote bot requested review from cte, jr and mrubens as code owners August 8, 2025 14:21

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Aug 8, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 8, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 8, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. bug Something isn't working labels Aug 8, 2025

roomote bot mentioned this pull request Aug 8, 2025

codebase indexing not work #6849

Closed

ellipsis-dev bot reviewed Aug 8, 2025

View reviewed changes

roomote bot commented Aug 8, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 8, 2025

daniel-lxs closed this Aug 11, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Aug 11, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Aug 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent Ollama indexing from freezing at 69% #6850

fix: prevent Ollama indexing from freezing at 69% #6850

Uh oh!

roomote bot commented Aug 8, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot Aug 8, 2025

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 8, 2025

Uh oh!

roomote bot Aug 8, 2025

Uh oh!

roomote bot Aug 8, 2025

Uh oh!

roomote bot Aug 8, 2025

Uh oh!

roomote bot Aug 8, 2025

Uh oh!

daniel-lxs commented Aug 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: prevent Ollama indexing from freezing at 69% #6850

fix: prevent Ollama indexing from freezing at 69% #6850

Uh oh!

Conversation

roomote bot commented Aug 8, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Changes

Testing

Uh oh!

ellipsis-dev bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs commented Aug 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Aug 8, 2025 •

edited by ellipsis-dev bot

Loading