fix: remove device_map to prevent TableTransformer meta tensor errors

micmarty-deepsense · claude · micmarty-deepsense · commit 08b5a5be4a66 · 2026-01-30T11:27:43.000+01:00
Remove device_map parameter from from_pretrained() calls to fix
meta tensor errors during concurrent processing.

Changes:
- Device normalization (cuda -&gt; cuda:0) for consistent caching
- Remove device_map from DetrImageProcessor.from_pretrained()
- Remove device_map from TableTransformerForObjectDetection.from_pretrained()
- Add explicit .to(device, dtype=torch.float32) for proper placement
- Improve logging to show target device

Fixes:
- "Trying to set a tensor of type Float but got Meta" errors
- AssertionError during concurrent PDF processing
- Finalization race conditions with device_map

Root Cause:
device_map causes models to initialize with meta tensors, which fail
when explicitly moved to device. Removing device_map and using explicit
.to() ensures proper tensor placement.

Co-Authored-By: Claude Sonnet 4.5 &lt;noreply@anthropic.com&gt;
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -1,3 +1,12 @@
+## 1.1.9
+
+### Fix
+- **TableTransformer device_map fix**: Remove device_map parameter to prevent meta tensor errors
+  - Device normalization (cuda -> cuda:0) for consistent caching
+  - Load models without device_map, use explicit .to(device, dtype=torch.float32)
+  - Fixes concurrent PDF processing AssertionError
+  - Prevents "Trying to set a tensor of type Float but got Meta" errors
+
 ## 1.1.8
 
 - put `pdfium` call behind a thread lock
diff --git a/unstructured_inference/models/tables.py b/unstructured_inference/models/tables.py
@@ -70,24 +70,42 @@ def initialize(
         model: Union[str, Path],
         device: Optional[str] = "cuda" if torch.cuda.is_available() else "cpu",
     ):
-        """Loads the donut model using the specified parameters"""
+        """Loads the table transformer model using the specified parameters.
+
+        Device placement strategy:
+        - Normalize device names (cuda -> cuda:0) for consistent caching
+        - Load models WITHOUT device_map to avoid meta tensor errors
+        - Use explicit .to(device, dtype=torch.float32) for proper placement
+        """
+        # Device normalization for consistent caching
+        if device is None:
+            device = "cuda" if torch.cuda.is_available() else "cpu"
+        if device.startswith("cuda") and ":" not in device:
+            device = f"cuda:{torch.cuda.current_device()}"
+
         self.device = device
-        self.feature_extractor = DetrImageProcessor.from_pretrained(model, device_map=self.device)
+
+        # Load feature extractor WITHOUT device_map
+        self.feature_extractor = DetrImageProcessor.from_pretrained(model)
         # value not set in the configuration and needed for newer models
         # https://huggingface.co/microsoft/table-transformer-structure-recognition-v1.1-all/discussions/1
         self.feature_extractor.size["shortest_edge"] = inference_config.IMG_PROCESSOR_SHORTEST_EDGE
         self.feature_extractor.size["longest_edge"] = inference_config.IMG_PROCESSOR_LONGEST_EDGE
 
         try:
-            logger.info("Loading the table structure model ...")
+            logger.info(f"Loading table structure model to {self.device}...")
             cached_current_verbosity = logging.get_verbosity()
             logging.set_verbosity_error()
-            self.model = TableTransformerForObjectDetection.from_pretrained(
-                model,
-                device_map=self.device,
-            )
+
+            # Load model WITHOUT device_map (prevents meta tensor errors)
+            self.model = TableTransformerForObjectDetection.from_pretrained(model)
+
+            # Explicit device placement with dtype
+            self.model.to(self.device, dtype=torch.float32)
+
             logging.set_verbosity(cached_current_verbosity)
             self.model.eval()
+            logger.info(f"Table model successfully loaded to {self.device}")
 
         except EnvironmentError:
             logger.critical("Failed to initialize the model.")